▲ | crystal_revenge 5 days ago | ||||||||||||||||||||||||||||||||||||||||
People also tend not to understand the absurdity of assuming that we can make LLMs stop hallucinating. It would imply not only that truth is absolutely objective, but that it exists on some smooth manifold which language can be mapped to. That means there would be some high dimensional surface representing "all true things". Any fact could be trivially resolved as "true" or "false" simply by exploring whether or not it was represented on this surface. Where or not "My social security number is 123-45-6789" is true could be determined simply by checking whether or not that statement was mappable to the truth manifold. Likewise you could wander around that truth manifold and start generating output of all true things. If such a thing existed it would make even the wildest fantasies about AGI seem tame. edit: To simplify it further, this would imply you could have an 'is_true(statement: string): bool' function for any arbitrary statement in English. | |||||||||||||||||||||||||||||||||||||||||
▲ | jdietrich 5 days ago | parent | next [-] | ||||||||||||||||||||||||||||||||||||||||
>People also tend not to understand the absurdity of assuming that we can make LLMs stop hallucinating. It would imply not only that truth is absolutely objective, but that it exists on some smooth manifold which language can be mapped to. Frankly, this is a silly line of argument. There is a vast spectrum between regularly inventing non-existent citations and total omniscience. "We can't define objective truth" isn't a gotcha, it's just irrelevant. Nobody in the field is talking about or working on completely eliminating hallucinations in some grand philosophical sense, they're just grinding away at making the error rate go down, because that makes models more useful. As shown in this article, relatively simple changes can have a huge effect and meaningful progress is being made very rapidly. We've been here before, with scepticism about Wikipedia. A generation of teachers taught their students "you can't trust Wikipedia, because anyone can edit it". Two decades and a raft of studies later, it became clear that Wikipedia is at least as factually accurate as traditional encyclopedias and textbooks. The contemporary debate about the reliability of Wikipedia is now fundamentally the same as arguments about the reliability of any carefully-edited resource, revolving around subtle and insidious biases rather than blatant falsehoods. Large neural networks do not have to be omniscient to be demonstrably more reliable than all other sources of knowledge, they just need to keep improving at their current rate for a few more years. Theoretical nitpicking is missing the forest for the trees - what we can empirically observe about the progress in AI development should have us bracing ourselves for radical social and economic transformation. | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
▲ | mqus 5 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||
Well, no. The article pretty much says that any arbitrary statement can be mapped to {true, false, I don't know}. This is still not 100% accurate, but at least something that seems reachable. The model should just be able to tell unknowns, not be able to verify every single fact. | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
▲ | thisoneisreal 5 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||
A great book in this vein is "Language vs. Reality." The main thesis of the book is that language evolved to support approximate, ad hoc collaboration, and is woefully inadequate for doing the kind of work that e.g. scientists do, which requires incredible specificity and precision (hence the amount of effort devoted to definitions and quantification). | |||||||||||||||||||||||||||||||||||||||||
▲ | BobbyTables2 5 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||
Agree. I deeply suspect the problem of asking an LLM to not hallucinate is equivalent to the classic Halting Problem. | |||||||||||||||||||||||||||||||||||||||||
▲ | beeflet 4 days ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||||||||
Maybe if a language model was so absolutely massive, it could <think> enough to simulate the entire universe and determine your social security number | |||||||||||||||||||||||||||||||||||||||||
|