Remix.run Logo
dminik 6 hours ago

I mean, I'm not sure that's the correct read on this.

If you want an Opus class model, it makes sense that you would train on what Opus outputs. But, if you want something better than Opus, training on the same data that Opus was trained on with the same architecture will only result in an Opus class model. Then, if your dataset also contains Opus outputs, many of which are wrong, then it makes sense that the model would have reduced performance.

All this to say that I don't think there's such a thing as a "Model Collapse," but there likely is a "Model Stagnation."

krustyvonklown 4 hours ago | parent [-]

A model trained on all the data X was trained on should be improved to the extent that X is already out of date. A model trained on X itself has all the errors of X and all of it's own. Society itself seems to show that model collapse is entirely possible today and was presumably a problem in the past given the significance placed on citation and going to original sources that predates obsession with credit.