I feel like I'm going nuts.

There are other commenters saying this is a good practice they've also done for other injuries. You are saying you are an actual radiologist and immediately clock the problems with its advice.

I have seen this pattern over and over again. Anytime someone is an actual expert at anything, AI output appears insufficient or incomplete or outright misleading. It is only when you do not know what the AI is being asked to do is it likely you will find the output helpful.

This is itself alarming to me, but no one else seems to find this to be quite damning for the AI services being offered, preferring instanced to be wowed by the convenience and speed at which they can be delivered unreviewed and unproven information.

▲ appplication 2 hours ago | parent | next [-]

This is the root of AI psychosis. There’s a lot of unpack here, and I won’t go too deep because you can’t really have a discussion with affected folks because their fundamental basis is not evidence, it’s belief.

It is weirdly religious in a way, because if you were to present contrary evidence (e.g. experts in a field weighing in about how plausible sounding responses are bunk), you would only be told you don’t believe enough in the long term potential and capabilities.

Don’t get me wrong, I think we all agree capabilities will eventually improve (and farther-future capabilities could reasonably surpass experts), but really is unclear if the current transformer architectures with their probabilistic/hallucinatory outputs will plateau before they surpass current experts abilities in all promised fields.

▲ TomasBM 2 minutes ago | parent | next [-]

Why is it psychosis and not lower standards?

While I can understand being skeptical of non-experts' claims that such answers are enough, I don't understand why you call it "psychosis" and not simply naivety or lack of expertise.

At the same time, the new so-called "models" haven't been pure transformer-based LLMs, but entire systems with tools (with access to the Internet), data storage, and the options to trigger additional instances for different tasks.

▲ cheschire 4 minutes ago | parent | prev | next [-]

I was a very early adopter in my circles with AI and I shared it with many people. Strangely, I seem to be the most skeptical about AI in my circles as well, but because I was the gateway for a many folks, they want to come back and share their experiences with me.

And it's so much like listening to someone in a church congregation sharing their experiences with god. Clear and obvious gaps are hand-waved away exactly how you're describing.

▲ sublinear 20 minutes ago | parent | prev | next [-]

Human expertise is also improving all the time and not limited to just connecting dots. When AI seems to surpass a particular human, it's just because the human lacks broader knowledge and fails to investigate further.

An expert already knows they don't know everything. That was never the point. Critical thinking cannot be delegated to AI any more than it can be delegated to a book. There is nothing new going on here.

▲ lazide 42 minutes ago | parent | prev [-]

I don’t think they will improve, there is too much incentive to poison the datasets going forward.

A lot of the models up to this point have been benefitted - like Google did - from essentially ‘pre SEO’ internet.

Now the same tools are being used to generate nigh infinite good sounding bullshit, which poisons the dataset in all sorts of hard to detect ways.

To add insult to injury, the human experts are also not as. Naive, and have many incentives to poison their own input in subtle ways too.

▲ brokencode 17 minutes ago | parent | next [-]

I seriously doubt that data set poisoning will be a real limiter in model performance.

For one, if your website/book is poisoned, who is going to trust it for anything at all, much less for training models?

For two, all the major AI labs hire or contract for subject matter experts to create curated data sets, evaluate model performance, etc.

Unless they hire malicious experts, this will provide a growing, high quality data set that should drown out any poisoned pretraining data.

	▲	Analemma_ 4 minutes ago \| parent \| next [-]
		I think you underestimate just how much money is being poured into LLM SEO at the moment. It's real quiet because they don't want to draw attention and countermeasures from the frontier labs, but this is getting huge investment, and they will have a monomaniac focus on juicing product results whereas the attention of the labs necessarily has to be spread out.
	▲	microgpt 9 minutes ago \| parent \| prev [-]
		Pretty easy to display one thing to verified browsers (just latest few user-agents from the 10ish different mainstream browsers on the 3 main OSes) and another to anything else. Yes AI scrapers can easily spoof user-agent, but they fall out of date as the browser updates. Bit harder to catch them in tarpits and then serve nonsense to whoever ever triggered the tarpit.

▲ rvnx 35 minutes ago | parent | prev [-]

Human doctors use LLMs to diagnose too

OpenEvidence claims

    "More than 40% of U.S. physicians use it daily, and it handled around 20 million clinical consultations per month. Over 100 million Americans were treated by a doctor using it in 2025."

https://www.cnbc.com/2026/01/21/openevidence-chatgpt-for-doc...

▲

something98 22 minutes ago | parent | next [-]

This is a very misleading statement; most of those physicians are using LLMs to transcribe notes from visits and/or for billing purposes (e.g., proper billing codes).

▲

brokencode 14 minutes ago | parent [-]

OpenEvidence is specifically meant to help clinicians make evidence-based decisions in the diagnosis and treatment of patients, not note transcription.

	▲	sxg 5 minutes ago \| parent [-]
		It does both: https://www.openevidence.com/user-guide/visits-overview

▲

sambellll 3 minutes ago | parent | prev | next [-]

To me this is like a good software engineer using AI.

The fact that they use it doesn't make what the result is any worse or less trustworthy - arguably it makes it better.

It only becomes a problem if they offload all of the thinking to AI.

▲

sarchertech 11 minutes ago | parent | prev [-]

Ignoring the fact that this number comes from a company press release, it doesn’t say anything about the number of doctors using it to diagnose, just that they use it.

If a physician uses Google to search for a dosage chart for some drug they rarely prescribe, you wouldn’t say they are using Google to diagnose the patient. You wouldn’t say that either if they used Google to search for the most recent studies on a topic.

▲ qnleigh 33 minutes ago | parent | prev | next [-]

Totally agree. I'm a scientist, and like most scientists I have some specialized skills that most of my colleages don't. AI has empowered them to learn and build things that they might have otherwise needed me for. But there have been quite a few cases where it led them very far down a wrong path. This has started happening way more often in the last few months.*

We've known since the beginning that AIs confidently say incorrect things. But now that they can speak confidently about very complex topics, and mostly say correct things, we are letting our guard down and lots of subtle falsehoods are slipping through.

*In one case, I was able to put things back on track because the AI suggested my colleague talk to me; somehow it figured out we were co-workers.

	▲	bitlad 28 minutes ago \| parent [-]
		>very far down the wrong path. Absolutely agree. Have seen this first hand

▲ sbarre 2 hours ago | parent | prev | next [-]

> Anytime someone is an actual expert at anything, AI output appears insufficient or incomplete or outright misleading

Yes, this is exactly so. AI is able to confidently sound plausible enough to convince laypersons or anyone who isn't very familiar with the subject matter, which is a big part of the mass-appeal "magic" of ChatGPT and other similar tools. It's like having a know-it-all friend (who also makes shit up to bridge their own knowledge gaps).

In many non-advanced non-specialized situations, AI is right enough to be at best useful or at worst not harmful (usually landing in the middle somewhere).

But speaking for myself, in areas where I consider myself quite proficient, I can very easily spot the subtle inconsistencies and naive conclusions that AI responses provide, and I have to guide/steer/correct it a lot to get good results when the subject matter is complex enough.

▲ sxg an hour ago | parent | prev | next [-]

I see your argument, but it's not exactly news that an expert found a flaw in a popular tool. You could say the same about Wikipedia--experts have tons of issues with it, but Wikipedia still provides value to non-experts. The most likely alternative to Wikipedia for non-experts is simply not trying to learn anything new.

Similarly with LLMs, you can't just write them off entirely because they sometimes provide misleading or incorrect advice. The positive utility maximizing view is to learn when you need to call in an expert. I recently moved in to a new house and have used Claude extensively to figure out basic things (e.g., adjusting the garage door height, how to mount a TV). However, when the HVAC suddenly stopped working, I gave Claude a shot for an hour and tried some non-destructive fixes, but then realized I had to call in an HVAC expert.

▲

frereubu 29 minutes ago | parent | next [-]

Slightly OT Nitpick: in regard to experts and Wikipedia, when doing a neuroscience-adjacent MSc, experts in the field actually directed me to Wikipedia as an excellent source for high-level neuroanatomy, including recent research, so I'm not sure your blanket description about experts and Wikipedia is correct.

▲

ohyes an hour ago | parent | prev [-]

The free alternative to Wikipedia is the library, not “don’t learn anything new ever”.

I find Claude is surprisingly similar to a confident but incorrect coworker, with the benefit that Claude will reevaluate when I correct it.

▲

sxg 42 minutes ago | parent | next [-]

I used the phrase "most likely alternative" intentionally. The library is where people should go to get answers in a world without Wikipedia, but the vast majority of people won't. So in practice, most non-experts either learn from Wikipedia or don't try to learn anything at all.

	▲	ohyes 2 minutes ago \| parent [-]
		Sure, if we’re going to go that broad. People are already leaning heavily towards learning nothing instead of using Wikipedia. I guess to me it has to be comparable to be an alternative. Like, I don’t consider doomscrolling x an alternative to reading Wikipedia but I might consider it an alternative to CNN, even though they’re all technically and very broadly activities that I could use to inform myself. In that same way I don’t consider the multitude of ways I could use my free will necessarily alternatives to each other even though they technically are. It kinda sucks but going that broad feels to me like it breaks the concept of alternative and makes it kind of meaningless.

▲

bflesch an hour ago | parent | prev [-]

Claude will do everything to retain you as a user, because that's one of their most important metrics.

	▲	ohyes a minute ago \| parent [-]
		Excellent point my colleague has the exact opposite incentive.

▲ pwg a minute ago | parent | prev | next [-]

> Anytime someone is an actual expert at anything, AI output appears insufficient or incomplete or outright misleading.

The term for when the press "gets it wrong" is Gell-Mann Amnesia (https://en.wiktionary.org/wiki/Gell-Mann_Amnesia_effect).

In that case, when you have personal knowledge of the facts, or know the specific domain area, you can see where the reporter mixed things up.

AI is no different, it's just a bunch of matrix math substituting for "the reporter" regurgitating what it was previously told. So the Gell-Mann Amnesia effect would apply just the same. If you have domain knowledge, you immediately see where the AI got it wrong. When you do not have domain knowledge, you have less chance of seeing that the AI was wrong.

▲ rapatel0 3 minutes ago | parent | prev | next [-]

you shouldn’t expect frontier models to work on medical imaging. There is much more that goes into building a medical imaging product. first and foremost is data. medical imaging datasets are not prevalent one the public internet at the scale necessary to have good performance on medical imaging tasks. especially MRI. also the labels are super noisy. this is completely different than asking for general medical reasoning which is more derived from papers, public standards and textbooks. text exists at the right scale but images don’t.

▲ meowface 37 minutes ago | parent | prev | next [-]

I may be missing something, but I think it's unclear that the parent poster here is necessarily actually contradicting anything the AI said. It may depend on the exact information the OP wrote to Claude and GPT. The full transcripts would be needed. (Though there is definitely a separate point that a doctor would generally better know all the right questions to ask, while current LLMs may be making certain assumptions.)

The LLM may have, from its "perspective", implicitly thought the OP was telling it that he had strong reason to believe there was no calcification and was not considering the bigger picture of possibly receiving an incomplete/poor assessment from the medical staff. In fact, the issue here may be the LLM overly trusting doctors vs. trusting its own expertise.

▲ mattgreenrocks 8 minutes ago | parent | prev | next [-]

You're not. This site was also bullish on using LLMs as therapists, which defeats the very point of them, and reflects a lack of knowledge on what exactly therapists do for people.

More on topic: if the article's author arrived at a definitively negative result would this have shown up on HN?

▲ nlawalker 2 hours ago | parent | prev | next [-]

> no one else seems to find this to be quite damning for the AI services being offered, preferring instanced to be wowed by the convenience and speed at which they can be delivered unreviewed and unproven information

"Be wowed by the convenience and speed", or merely "take advantage of the mere availability"? What most people find to be damning about expert advice is that they simply can't get it anywhere, at any cost that they can afford.

▲

whatever1 2 hours ago | parent [-]

So if you want to do a surgery but you don’t see any surgeons around you ask a grocery butcher to have his way?

▲

sxg an hour ago | parent | next [-]

In certain circumstances, the answer is yes. If an airplane's pilots are incapacitated, do you simply give up and crash the plane because there are no other pilots on board? Or would you rather have someone on the ground try to coach a passenger into at least attempting to land the plane?

▲

close04 31 minutes ago | parent | next [-]

A passenger crashing the plane while trying to avoid a certain crash doesn’t make things any worse. An incompetent doctor trying to save you from certain death can make things so much worse. It’s all about weighing the best/worst outcome compared to where you are now.

	▲	microgpt 5 minutes ago \| parent [-]
		I hate to break it to you but death is certain for everyone. Emotionally realizing this and the complete inability to do anything about it is called an "existential crisis" and if you haven't had one or several yet, you will.

▲

frereubu 33 minutes ago | parent | prev | next [-]

That's an extreme edge case, which I don't think is in the context of the concerns in this thread.

	▲	sxg 18 minutes ago \| parent [-]
		The specific case doesn't matter--it's meant to make you think about the general question throughout this thread: when an expert isn't available, should non-experts use AI (or other tools) to help themselves? Sometimes the answer is yes because the potential benefits outweigh the potential harms (if any harms exist). But sometimes the answer is no because misleading/incorrect advice can cause a net harm.

▲

ChrisMarshallNY an hour ago | parent | prev | next [-]

As long as that passenger didn’t have the fish.

▲

jancsika 33 minutes ago | parent | prev [-]

You can choose a) a calm, level-headed passenger who knows they aren't a pilot, or b) a calm, level-headed passenger who almost has their pilots license but has a medical condition that prevents them from admitting when they lack certain knowledge.

Who do you choose to be coached by an expert on the ground?

	▲	rvnx 30 minutes ago \| parent [-]
		No thank you, I will ask Claude and then ask ChatGPT to challenge me, and do a couple of rounds like that. The first: Has no clue about anything and therefore no useful knowledge and cannot challenge me The second one: Is proven to willfully give wrong information and does even basic mistakes. The LLMs will do their best, even if imperfect, since they summarizes what appeared in books. I prefer to be grounded on what Airbus / Boeing manuals, or on what pilots training book said, than two far more unreliable sources.

▲

EA-3167 an hour ago | parent | prev [-]

People, especially in medical crises, are desperate for answers that they often can't get because their clinicians don't know. The illusion of an all-knowing guru who sounds like their doctor and tells them ANYTHING is extremely alluring. If you're waiting to hear back from a doctor about test results (which these days probably showed up on your online account the moment they were completed) can be agonizing.

Ok for pain in your shoulder it might not, but how about a woman with a lump in her breast waiting for the mammogram interpretation? How about someone trying to understand disturbing lab results? People are also often pushed these days to move through visits with doctors at a breakneck speed, but the AI will "hear you out" all day.

Part of this is a problem with the AI, part of it a problem with our healthcare systems, and part of it is simply human nature. If you think that OpenAI, Anthropic, Google and the rest weren't aware of this going in you must have very little faith in the intelligence of their members. It's not hard to imagine the future of LLM's should involve a hell of a lot of liability on the companies running it, but for now it's the Wild West.

▲

bilsbie an hour ago | parent [-]

> but how about

Whatever scenario you come up with my answer is the same.

As an adult I’d like to be able to choose what tools I use to learn about my condition regardless of how well it works or even if it’s likely to mislead me.

There’s risk in every aspect of life and we can’t baby proof everything.

	▲	baconmania 42 minutes ago \| parent \| next [-]
		>choose what tools I use to learn about my condition regardless of how well it works or even if it’s likely to mislead me. Even if it "works" so poorly that you're not actually learning about your condition?
	▲	EA-3167 38 minutes ago \| parent \| prev [-]
		If it's helping you learn about your condition then sure I agree. The issue here is that's not really the case, it's giving you the illusion that you're learning about your condition while feeding you hallucinations and half-truths at best. A recent look at medical advice from these things showed they're no better than a coin flip. So if you MUST have answers that are at most random guesses, I'd suggest saving a few bucks and asking a coin before flipping it.

▲ ffsm8 22 minutes ago | parent | prev | next [-]

> This is itself alarming to me, but no one else seems to find this to be quite damning for the AI services being offered, preferring instanced to be wowed by the convenience and speed at which they can be delivered unreviewed and unproven information.

This point is being raised in literally all discussions about llms for the whole last year, if not longer.

What it omits is the fact that these people getting suckered into the ai psychosis are using non-specialized models without an agentic loop while knowing nothing about the topic they're using the ai for.

That's down to the fact that this tech hasn't really been integrated yet and people are using them widely (and) irresponsibly, but it's not necessarily something you should blame LLMs for - the cause is likely more down to the model providers marketing and our collective tendency to like self affirmation / thinking they themselves know best.

▲ highfrequency 2 hours ago | parent | prev | next [-]

Seems natural enough. There will always be complexity and nuance that is missed by an AI model or person - the world is just super detailed. The more expertise you have the more you will be aware of that nuance. That doesn't mean the model or person is not useful as a starting point.

▲ je42 36 minutes ago | parent | prev | next [-]

The question is how far is AI off compared to the professional that we have access to. World best experts are not accessible to most of us. :(

▲ kryogen1c an hour ago | parent | prev | next [-]

On the flip side of this problem, novel best practices lag the medical standard of care, other human failures like corruption and competing priorities notwithstanding.

For example, we had to advocate for certain practices during the birth of our first child that became routine during our second several years later.

So, neither side is guaranteed correct, doctor or citizen researcher (which did not include LLMs in my case, for the record). The truest answer is also the most useless one, applicable to all fields: it depends.

The real question is: if you embrace being a layman, whom do you trust more: LLMs/the internet or experts, like doctors? I think the answer is pretty clearly experts.

▲ jstummbillig an hour ago | parent | prev | next [-]

No, not anytime someone is an actual expert at anything, AI output appears insufficient. That is why experts in various fields use AI.

Then to say "Aha, but all of that is AI psychosis" makes obviously no sense: Why would we trust experts when they offer critique but not when they say "this is helpful"?

Overall: People are not insane. AI makes mistakes and, often, fails completely. AI also helps them do things better, quicker, increasingly so. The jaggedness of AI is confusing and real.

▲

torben-friis 43 minutes ago | parent | next [-]

How many times have you seen an expert go "yeah these results are good consistently enough for a non expert to trust them without expert assistance"?

There is a huge difference between having a chance of a good result, which can be useful for experts able to filter out the bullshit, and consistent success. I would generate code as a helper, I would never allow a guy from marketing to merge unreviewed AI code.

	▲	hectdev 30 minutes ago \| parent [-]
		That's what I would like to call job security. When you know how to read what is wrong, you can easily catch the mistakes and correct it. AI gets you there faster by doing a lot of things right and you correct the mistakes.

▲

lazide 37 minutes ago | parent | prev [-]

I’ve never seen an expert use AI in their field beyond the initial ‘oh interesting’ stage.

▲ tomaskafka an hour ago | parent | prev | next [-]

Yes. The PM’s “with AI I know enough to be dangerous, haha” means “I’m actually dangerous and I don’t realize”

▲ beering an hour ago | parent | prev | next [-]

TFA doesn’t actually state where the bit about shockwave therapy came from and it wasn’t the main point of the article. The concern was about being given useless therapies. The homeopathic analgesic is concerning, at least to me.

I.e. nothing this radiologist said was related to the LLM’s advice.

▲ Hikikomori 27 minutes ago | parent | prev | next [-]

It's like reading news articles. Seems reasonable until you read an article about something you know, then you see how wrong they can be.

▲ suttontom 41 minutes ago | parent | prev | next [-]

Your instinct is correct, and in a lot of cases it's true. However, I've heard from enough doctors by now (a cardiologist, psychiatrist, and epidemiologist/former physician) that they use medical LLMs and find them extremely helpful, mostly as a way to either bring up knowledge they'd forgotten about or as a way to learn something new and then verify it. I'm extremely skeptical about LLMs in general and the connection to Gell-Mann Amnesia is apt, but I wouldn't necessarily write them off completely like that. There are experts using the models that find them genuinely helpful in their field.

	▲	GTP 34 minutes ago \| parent [-]
		Probably this is the point, and it's a point that has been brought up a lot of times in the past, maybe less in recent times: you need to know the things you're applying an LLM to. In this way, you can keep the good outputs while having the expertise to discard the bad ones.

▲ parineum 2 hours ago | parent | prev | next [-]

> I have seen this pattern over and over again. Anytime someone is an actual expert at anything, AI output appears insufficient or incomplete or outright misleading.

AI isn't even the first instance of this phenomenon, news articles are like this as well.

https://en.wiktionary.org/wiki/Gell-Mann_Amnesia_effect

▲ newsclues 2 hours ago | parent | prev | next [-]

LLM is not necessarily an expert system. Once there are expert systems for law, healthcare, accounting, governance…

https://en.wikipedia.org/wiki/Expert_system

	▲	microgpt 3 minutes ago \| parent [-]
		Didn't they try that in the 80s and 90s but discover the real world is too variable for that to work?

▲ meindnoch an hour ago | parent | prev | next [-]

We're past the point of Gell-Mann amnesia. This is full blown Gell-Mann psychosis.

▲ grayhatter 27 minutes ago | parent | prev | next [-]

Welcome to the club? This new awareness you've found over the true quality of LLM based GenAI output has been what "all the haters" have been mad about for-ever. That the output of LLMs are clearly defective, and merely have found a cute trick towards making humans think they're less defective than they are actually measured to be.

And the corresponding anger and frustration to push the risks of genai output out onto others, while also aggressively pushing it as a feature you should be using already. You're behind don't you know, and whatever other lie I have to tell to trick you into enough FOMO to pay me 200USD/mo so I can sell FOSS back to you.

An LLM can only output the mean next likely token, and then add a bunch of extra noise on top of that so it feels interesting and not repetitive. None of this is new, the problem is, 50% of humans are below the mean, but have no idea. So when an LLM tells them some lie: well, it sounds so helpful! It's impossible for someone who sounds this helpful to lie to me, liars never sound confident! It must be PERFECT! I'm gonna tell everyone how perfect it is. so the bottom 0-33% think LLMs are fantastic tools that make nearly 0 mistakes in comparison to the bottom 33%. 33-66%-ish aren't sure, some times it's great, but it will make that random mistake sometimes, but I can catch most (or all of them depending on ego). and the 66%+ are angry about how many people are getting tricked by something so obviously low quality, or are lucky enough to not have to care.

▲ silisili an hour ago | parent | prev [-]

This is natural and even logically expected. It's just Gell-Mann amnesia in action. The world has more people spouting on things than it has people knowledgeable in said things.

Apply that to the Internet at large, and realize where LLMs got their training. They're basically ConfidentlyIncorrect personified.