| ▲ | giancarlostoro 3 hours ago | |||||||
One of the things I often wonder is "what will be the minimally viable LLM" that can work from just enough information that if it googles the rest it can provide reasonable answers? I'm surprised something like Encyclopedia Britanica hasn't yet (afaik) tried to capitalize on AI by selling their data to LLMs and validating outputs for LLM companies, it would make a night and day difference in some areas I would think. Wikipedia is nice, but there's so much room for human error and bias there. | ||||||||
| ▲ | embedding-shape 3 hours ago | parent | next [-] | |||||||
Your worry about Wikipedia is that there is "much room for human error and bias", yet earlier you seem to imply that a LLM that has access to the www somehow would have less human error and bias? Personally, I'd see it the other way around. | ||||||||
| ||||||||
| ▲ | intrasight 3 hours ago | parent | prev | next [-] | |||||||
It's not so much a "minimally viable LLM" but rather an LLM that knows natural language well but knows nothing else. Like me - as an engineer who knows how to troubleshoot in general but doesn't know about a specific device like my furnace (recent example). And I don't think that LLM could just Google or check Wikipedia. But I do agree that this architecture makes a lot of sense. I assume it will become the norm to use such edge LLMs. | ||||||||
| ||||||||
| ▲ | utopiah 3 hours ago | parent | prev | next [-] | |||||||
> validating outputs for LLM companies How? They can validate thousands if not millions of queries but nothing prevent the millions-th-and-one from being a hallucination. People who would then pay extra for a "Encyclopedia Britanica validated LLM" would then, rightfully so IMHO, complain that "it" suggested them to cook with a dangerous mushroom. | ||||||||
| ▲ | bee_rider 2 hours ago | parent | prev | next [-] | |||||||
Isn’t that sort of what a RAG is? You’d need an LLM “smart” enough to turn natural-user prompts into searches, then some kind of search, then an LLM “smart” though to summarize the results. | ||||||||
| ||||||||
| ▲ | uniq7 3 hours ago | parent | prev | next [-] | |||||||
Since Google Search already includes an AI summary, your minimally viable "LLM" can be just an HTTP GET call | ||||||||
| ▲ | thinkingtoilet 2 hours ago | parent | prev [-] | |||||||
Wikipedia has proven to be as accurate as encyclopedias for decades now. Also, I'm betting AI companies have illegally trained their models on the Encyclopedia Britanica's data by now. | ||||||||