Remix.run Logo
lukev 20 hours ago

I really wish I could see the results of this without RLHF / alignment tuning.

LLMs actually have real potential as a research tool for measuring the general linguistic zeitgeist.

But the alignment tuning totally dominates the results, as is obvious looking at the answers for "who would you vote for in 2024" question. (Only Grok said Trump, with an answer that indicated it had clearly been fine-tuned in that direction.)

jesenator 12 hours ago | parent [-]

Yeah would also be interested to see the responses without RLHF. Not quite the same, but have you interacted with AI base models at all? They're pretty fascinating. You can talk to one on openrouter: https://openrouter.ai/meta-llama/llama-3.1-405b and we're publishing a demo with it soon.

Agreed on RLHF dominating the results here, which I'd argue is a good thing, compared to the alternative of them mimicking training data on these questions. But obviously not perfect, as the demo tries to show.