Remix.run Logo
RyanOD 4 hours ago

Ok, fair. Hard to understand why it would get that wrong.

codebje 3 hours ago | parent | next [-]

Because LLMs aren't sentient, they don't draw on facts, and they don't have nuance. The answer given is similar to answers you might expect to see for similar questions.

It's really amazing we can make machines do that, and it's really depressing that we think a stochastic bullshit machine is going to give us something we can rely on.

Robotbeat 43 minutes ago | parent [-]

Or… the default LLM Google uses for search has been quantized to s**. Ask a proper Thinking model, with browsing enabled, and odds of a correct answer are much higher. There’s been substantial improvement in AI in even the last year.

Ask a human a question like this, and they also have a chance of getting it wrong, even when confident.

nvme0n1p1 24 minutes ago | parent | next [-]

> Ask a human a question like this

Why would a human know specs for a random phone off the top of their head? The human response is either "I don't know" or "let me look that up", not a hallucination.

jazzyjackson 25 minutes ago | parent | prev [-]

*so long as an accurate answer exists on the internet

Claude is OK at saying when it can’t find good information, but it’s still 50/50 on citing a source that has nothing to do with its claim.

Groxx 3 hours ago | parent | prev | next [-]

They are this wrong about everything, but you don't usually notice it when using it to look for things you aren't an expert in. The default stance really does need to be "do not trust, verify" at all times.

They can still be useful, e.g. they're significantly better at finding "I want a thing that does x but not y and it must be blue, or maybe two things that can be glued together to do that" than classic search. But they'll routinely miss extremely obvious answers because the related search it ran didn't find it, or completely screw up what something can actually do. Checking more pages of results by hand or asking humans who know even a little about those fields is still wildly more useful... but they're absolutely slaughtering the sites where people do that, by stealing all the real traffic and sending DDoS-level automated requests.

Barbing 4 hours ago | parent | prev | next [-]

I’d make assumptions about how the cheapest and fastest possible flash model optimized for being extra cheap and extra fast would get something wrong based on its limited context (which can be very incomplete summaries of search results)

bitmasher9 3 hours ago | parent [-]

I often have the expensive models give relatively simple inaccurate answers, even when they cite sources that directly contradict them. The error rate is lower, but you can’t have confidence with llm answers.

pesus 3 hours ago | parent | prev | next [-]

It somehow seems to interpret whatever sources it's grepping as the exact opposite of what those sources say fairly often. I've lost track of how many times I've clicked on the sources it cites, and every single one is in agreement, but the AI claims the opposite.

facemelt2 4 hours ago | parent | prev | next [-]

Did you just agree to a stranger's counterpoint on the internet? This post should be in a museum somewhere

SequoiaHope 3 hours ago | parent | prev [-]

The simple answer is that these systems are very bad at telling the truth reliably.