| ▲ | brokencode 9 hours ago | |
That just means their pretraining data set was older. You can train as many models as you want on the same data. I’m sure all these AI labs have extensive data gathering, cleanup, and validation processes for new data they train the model on. Or at least I hope they don’t just download the current state of the web on the day they need to start training the new model and cross their fingers. | ||