| ▲ | lanthissa 2 hours ago | |
did they not pay them enough to get good ratings on the other 3 models? whats the logic in claiming its a borked metric when everything listed is an anthropic model. | ||
| ▲ | Narretz 2 hours ago | parent [-] | |
There a few benchmarks out there where all existing models have abysmal scores. So it's not actually a problem if Antrophic's older models are bad, especially if the jump to the newest model is huge, and the competition is also way below it. | ||