Remix clone Hacker News

new | show | ask | jobs Github

	▲	mjburgess 3 hours ago
		The first anthropomorphization of AI which is actually useful.
	▲	Retr0id 3 hours ago \| parent [-]
		It's not even an anthropomorphization, the reward function in RLHF-like scenarios is usually quite literally "did the user think the output was good"