Remix.run Logo
ants_everywhere 3 days ago

Claude often confidently makes mistakes or asserts false things about a code base. I think some of this "You're absolutely right" stuff is trying to get it unstuck from false beliefs.

By starting the utterance with "You're absolutely right!", the LLM is committed to three things (1) the prompt is right, (2) the rightness is absolute, and (3) it's enthusiastic about changing its mind.

Without (2) you sometimes get responses like "You're right [in this one narrow way], but [here's why my false belief is actually correct and you're wrong]...".

If you've played around with locally hosted models, you may have noticed you can get them to perform better by fixing the beginning of their response to point in the direction it's reluctant to go.