Remix clone Hacker News

new | show | ask | jobs Github

	▲	Bolwin a day ago
		In their AMA moonshot said it was mainly finetuning
	▲	teaearlgraycold a day ago \| parent [-]
		OpenAI and the other big players clearly RLHF with different users in mind than professionals. They’re optimizing for sycophancy and general pleasantness. It’s beautiful to finally see a big model that hasn’t been warped in this way. I want a model that is borderline rude in its responses. Concise, strict, and as distrustful of me as I am of it.