Remix clone Hacker News

new | show | ask | jobs Github

	▲	zozbot234 6 hours ago
		Among other things, because you simply can't get those "massive amounts" of text from a SOTA model at reasonable cost. And complex reasoning cannot possibly be trained in a pure one-shot fashion, real post-training takes massive resources. The whole story doesn't add up.