| ▲ | maxwg 18 hours ago | |
The training methods are largely published in their open research papers - though arguably some open weight companies are less open with the exact details. Realistically a model will never be "compiled" 1:1. Copyrighted data is almost certainly used and even _if_ one could somehow download the petabytes of training data - it's quite likely the model would come out differently. The article seems to be talking more about the difficulties of fine tuning models though - a setup problem that likely exists in all research, and many larger OSS projects that get more complicated. | ||
| ▲ | alansaber 17 hours ago | parent [-] | |
Yes the issue is they can embelish the shit out of the papers b/c we only see the final result | ||