Remix.run Logo
Bolwin a day ago

In their AMA moonshot said it was mainly finetuning

teaearlgraycold a day ago | parent [-]

OpenAI and the other big players clearly RLHF with different users in mind than professionals. They’re optimizing for sycophancy and general pleasantness. It’s beautiful to finally see a big model that hasn’t been warped in this way. I want a model that is borderline rude in its responses. Concise, strict, and as distrustful of me as I am of it.