| ▲ | jumploops 4 days ago | |
It’s possible they’re using some new architecture to get more up-to-date data, but I think that’d be even more of a headline. My hunch is that this is the same 5.1 post-training on a new pretrained base. Likely rushed out the door faster than they initially expected/planned. | ||