| ▲ | w0m 2 hours ago |
| I believe his argument is that now that you've defined the limitation, it's a ceiling that will likely be cracked in the relatively near future. |
|
| ▲ | emp17344 2 hours ago | parent [-] |
| Well, hallucinations have been identified as an issue since the inception of LLMs, so this doesn’t appear true. |
| |
| ▲ | w0m 5 minutes ago | parent | next [-] | | I mean, Hallucinations are 95% better now than the first time I heard the term and experienced them in this context. To claim otherwise is simply shifting goalposts. No one is saying it's perfect or will be perfect, just that there has been steady progression and likely will continue to be for the foreseeable future. | |
| ▲ | johnfn 39 minutes ago | parent | prev [-] | | Hallucinations are more or less a solved problem for me ever since I made a simple harness to have Codex/Claude check its work by using static typechecking. | | |
| ▲ | emp17344 30 minutes ago | parent [-] | | But there aren’t very many domains where this type of verification is even possible. |
|
|