▲ | og_kalu 2 days ago | |||||||
The reason is not strange or unknown. The text completion GPT-3 from 2020 often sounds more natural than 4. The reason is the post training processes. Models are more or less being trained to sound like that during RLHF. Stilted, robotic, like a good little assistant. Open AI, Anthropic have said as much. It's not a limitation of the loss function or even state of the art. | ||||||||
▲ | refulgentis 2 days ago | parent [-] | |||||||
I can't give in to misguided pessimism - "Open AI, Anthropic have said as much" is especially not something I can support! I'm hearing some of the ideas on my corner of llm x creativity Twitter expressed clunkily and if its some irrevocable thing. You're right the default is to speak like an assistant. You're wrong that its forced and immutable and a consequence of RLHF and the companies say its so. https://x.com/jpohhhh/status/1784077479730090346 You're especially wrong that RLHF is undesirable https://x.com/jpohhhh/status/1819549737835528555 https://x.com/jpohhhh/status/1819550145522160044. It's also nigh-trivial to get the completion model back https://x.com/jpohhhh/status/1776434608403325331 I don't know when I'll stop seeing surface-level opinions disguised as cold technological claims on this subject. I would have thought, by now, people doing that would wonder why the wide open lane hasn't been taken, at least once. | ||||||||
|