Remix.run Logo
simonw 3 days ago

Completely agree with you - LLMs with access to search tools that know how to use them (o3, GPT-5, Claude 4 are particularly good at this) mostly paper over the problems caused by a lossy set of knowledge in the model weights themselves.

But... end users need to understand this in order to use it effectively. They need to know if the LLM system they are talking to has access to a credible search engine and is good at distinguishing reliable sources from junk.

That's advanced knowledge at the moment!

johnecheck 3 days ago | parent | next [-]

From earlier today:

Me: How do I change the language settings on YouTube?

Claude: Scroll to the bottom of the page and click the language button on the footer.

Me: YouTube pages scroll infinitely.

Claude: Sorry! Just click on the footer without scrolling, or navigate to a page where you can scroll to the bottom like a video.

(Videos pages also scroll indefinitely through comments)

Me: There is no footer, you're just making shit up

Claude: [finally uses a search engine to find the right answer]

pbhjpbhj 3 days ago | parent [-]

IME, eventually, after a long time, the scrolling stops and you can get to the footer. YMMV!

gf000 3 days ago | parent | prev [-]

Slightly off topic, but my experience is that they are pretty terrible at using search tools..

They can often reason themselves into some very stupid direction, burning all the tokens for no reason and failing to reply in the end.