Remix clone Hacker News

new | show | ask | jobs Github

	▲	tekne a day ago
		The raw pretrained models make the errors, I believe -- we then reinforcement-learn them out.
	▲	Tomte a day ago \| parent [-]
		That‘s interesting! Do you have a paper or blog post or so at hand that shows examples of raw and RL‘ed output?