Remix clone Hacker News

new | show | ask | jobs Github

	▲	coppsilgold 12 hours ago
		This is more or less what happens. These models are tuned with reinforcement learning from human feedback (RLHF). Humans give them feedback that this type of language is good. The notorious "it's not X, it's Y" pattern is somewhat rare from actual humans, but it's catnip for the humans providing the feedback.