| ▲ | morleytj 5 hours ago |
| The behind the scenes on deciding when to release these models has got to be pretty insanely stressful if they're coming out within 30 minutes-ish of each other. |
|
| ▲ | meisel 5 hours ago | parent | next [-] |
| I wonder if their "5.3" was continuously being updated, with regenerated benchmarks with each improvement, and they just stayed ready to release it when claude released |
| |
| ▲ | morleytj 4 hours ago | parent [-] | | This seems plausible. It would be shocking if these companies didn't have an automated testing suite which is recomputing these benchmarks on a regular basis, and uploading to a dashboard for supervision. Given that they already pre-approved various language and marketing materials beforehand there's no real reason they couldn't just leave it lined up with a function call to go live once the key players make the call. |
|
|
| ▲ | Havoc 5 hours ago | parent | prev [-] |
| It’s also functionally not likely without some sort of insider knowledge or coordination |
| |
| ▲ | morleytj 5 hours ago | parent [-] | | Could be, could also be situations where things are lined up to launch in the near future and then a mad dash happens upon receiving outside news of another launch happening. I suppose coincidences happen too but that just seems too unlikely to believe honestly. Some sort of knowledge leakage does seem like the most likely reason. |
|