Remix.run Logo
maxwg 18 hours ago

The training methods are largely published in their open research papers - though arguably some open weight companies are less open with the exact details.

Realistically a model will never be "compiled" 1:1. Copyrighted data is almost certainly used and even _if_ one could somehow download the petabytes of training data - it's quite likely the model would come out differently.

The article seems to be talking more about the difficulties of fine tuning models though - a setup problem that likely exists in all research, and many larger OSS projects that get more complicated.

alansaber 17 hours ago | parent [-]

Yes the issue is they can embelish the shit out of the papers b/c we only see the final result