LLMs factor in unrelated information when recommending medical treatments

LLMs factor in unrelated information when recommending medical treatments (news.mit.edu)
from Pro@programming.dev to technology@lemmy.world on 23 Jun 22:41
https://programming.dev/post/32765603

#technology

threaded - newest

lupusblackfur@lemmy.world on 23 Jun 22:57 next collapse

large language model deployed to make treatment recommendations

What kind of irrational lunatic would seriously attempt to invoke currently available Counterfeit Cognizance to obtain a “treatment recommendation” for anything…???

FFS.

Anyone who would seems a supreme candidate for a Darwin Award.

osaerisxero@kbin.melroy.org on 24 Jun 00:10 next collapse

There's a potentially justifiable use case in training one and evaluating its performance for use in, idk, triaging a mass-casualty event. Similar to the 911 bot they announced the other day.

Also similar to the 911 bot, i expect it's already being used to justify cuts in necessary staffing so it's going to be required in every ER to ~~maintain higher profit margins~~ just keep the lights on.

OhVenus_Baby@lemmy.ml on 24 Jun 02:19 collapse

Not entirely true. I have several chronic and severe health issues. ChatGPT provides nearly and surpassing medical advice (heavily needs re-verified) from multiple specialialty doctors. In my country doctors are horrible. This bridges the gap albeit again highly needing oversight to be safe. Certainly has merit though.

notfromhere@lemmy.ml on 24 Jun 03:00 collapse

Bridging the gap is something sorely needed and LLMs are damn close to achieving.

FancyPantsFIRE@lemmy.world on 23 Jun 22:58 next collapse

Their analysis also revealed that these nonclinical variations in text, which mimic how people really communicate, are more likely to change a model’s treatment recommendations for female patients, resulting in a higher percentage of women who were erroneously advised not to seek medical care, according to human doctors.

This is not an argument for LLMs (which people are deferring to an alarming rate) but I’d call out that this seems to be a bias in humans giving medical care as well.

assaultpotato@sh.itjust.works on 23 Jun 23:22 collapse

Of course it is, LLMs are inherently regurgitation machines - train on biased data, make biased predictions.

xep@fedia.io on 23 Jun 23:30 next collapse

LLMs are not Large Medical Expert Systems. They are Large Language Models, and are evaluated on how convincing their output is, instead of how accurate or useful it is.

01189998819991197253@infosec.pub on 24 Jun 01:09 next collapse

Say it with me, now: chatgpt is not a doctor.

Now, louder for the morons in the back. Altman! Are you listening?!

ToastedRavioli@midwest.social on 24 Jun 08:00 collapse

ChatGPT is not a doctor. But models trained on imaging can actually be a very useful tool for them to utilize.

Even years ago, just before the AI “boom”, they were asking doctors for details on how they examine patient images and then training models on that. They found that the AI was “better” than doctors specifically because it followed the doctor’s advice 100% of the time; thereby eliminating any kind of bias from the doctor that might interfere with following their own training.

Of course, the splashy headline “AI better than doctors” was ridiculous. But it does show the benefit of having a neutral tool for doctors to utilize, especially when looking at images for people who are outside of the typical demographics that much medical training is based on. (As in mostly just white men. For example, everything they train doctors on regarding knee imagining comes from images of the knees of coal miners in the UK some decades ago)

ignirtoq@fedia.io on 24 Jun 01:25 next collapse

Why are they... why are they having autocomplete recommend medical treatment? There are specialized AI algorithms that already exist for that purpose that do it far better (though still not well enough to even assist real doctors, much less replace them).

R00bot@lemmy.blahaj.zone on 24 Jun 02:16 next collapse

Because sycophants keep saying it’s going to take these jobs, eventually real scientists/researchers have to come in and show why the sycophants are wrong.

notfromhere@lemmy.ml on 24 Jun 03:02 collapse

Are there any studies done (or benchmarks) that show accuracy on recommendations for treatments given a medical history and condition requiring treatment?

Gbagginsthe3rd@aussie.zone on 24 Jun 03:27 collapse

Im currently working on one now as a researcher. Its a crude tool to measure the quality of response. But its a start

notfromhere@lemmy.ml on 24 Jun 03:32 collapse

Gotta start somewhere, and it won’t ever improve if we don’t start improving it. So many on Lemmy assume the tech will never be good enough so why even bother, but that’s why we do things, to make the world that much better… eventually. Why else would we plant literal trees? For those that come after us.

Traister101@lemmy.today on 24 Jun 05:50 collapse

It’s not an assumption it’s just a matter of practical reality. If we’re at best a decade off from that point why pretend it could suddenly unexpectedly improve to the point it’s unrecognizable from its current state? LLMs are neat, scientists should keep working on them and if it weren’t for all the nonsense “Ai” hype we have currently I’d expect to see them used rarely but quite successfully as it would be getting used off of merit, not hype.

drmoose@lemmy.world on 24 Jun 04:25 collapse

I have used chatgpt for early diagnostics with great success and obviously its not a doctor but that doesn’t mean it’s useless.

Chatgpt can be a crucial first step especially in places where doctor care is not immediately available. The initial friction for any disease diagnosis is huge and anything to overcome that is a net positive.