Remix clone Hacker News

new | show | ask | jobs Github

	▲	maxwg 18 hours ago
		The training methods are largely published in their open research papers - though arguably some open weight companies are less open with the exact details. Realistically a model will never be "compiled" 1:1. Copyrighted data is almost certainly used and even _if_ one could somehow download the petabytes of training data - it's quite likely the model would come out differently. The article seems to be talking more about the difficulties of fine tuning models though - a setup problem that likely exists in all research, and many larger OSS projects that get more complicated.
	▲	alansaber 17 hours ago \| parent [-]
		Yes the issue is they can embelish the shit out of the papers b/c we only see the final result