Remix clone Hacker News

new | show | ask | jobs Github

	▲	djeastm 4 days ago
		I thought reinforcement learning with human feedback was meant to get that quantification of "taste"