Remix.run Logo
Terretta 35 minutes ago

> One of the most prominent improvements in Opus 4.8 is its honesty. We train all our models to be honest

On the contrary, they appear trained to say "Honestly" or "I have to be transparent with you" at inverse proportion to certainty.

Put another way, if they are certain, they don't use "Honestly", and if they are just wrong, or know they don't know, they don't use "Honestly".

They use "honestly" on the bubble, to the degree it's a tell that whatever it's asserting or doing is shakily grounded, sketchy or lazy work, or a host of other reasons you shouldn't trust it.

This training seems instead to be making it performatively punch up claims it cannot substantiate.

See also: https://news.ycombinator.com/item?id=48312182