Remix.run Logo
Agraillo 4 days ago

> finds that they're not credible enough to generate an answer

The credibility is one side of the story. In many cases, at least for my curious research, I happen to search for something very niche, so to find at least anything related, an LLM needs to find semantic equivalence between the topic in the query and what the found pages are discussing or explaining.

One recent example: in a flat-style web discussion, it may be interesting to somehow visually mark a reply if the comment is from a user who was already in the discussion (at least GP or GGP). I wanted to find some thoughts or talk about this. I had almost no luck with Perplexity, which probably brute-forced dozens of result pages for semantic equivalence comparison, and I also "was not feeling/getting lucky" with Google using keywords, the AROUND operator, and so on. I'm sure there are a couple of blogs and web-technology forums where this was really discussed, but I'm not sure the current indexing technology is semantically aware at scale.

It's interesting that sometimes Google is still better, for example, when a topic I’m researching has a couple of specific terms one should be aware of to discuss it seriously. Making them mandatory (with quotes) may produce a small result set to scan with my own eyes.