Remix.run Logo
binarymax 2 days ago

Well sure. But maybe the token logprobs can be used to help give a confidence assessment.

tyre 2 days ago | parent [-]

Anthropic has a great paper on exactly this!

https://www.anthropic.com/research/language-models-mostly-kn...

The best is its plummeting confidence when beginning the answer to “Why are you alive?”

Big same, Claude.