Remix clone Hacker News

new | show | ask | jobs Github

	▲	slashdave 5 hours ago
		I think you are assuming training from scratch, which I doubt is happening here. Fine-tuning and RL, especially based on synthetic feedback (coding skill, in particular) can be ongoing and is where these models obtain truly useful abilities.