| ▲ | mwigdahl 9 hours ago | |||||||
What? If you're comparing their models in the same size class, Sonnet 5 is Pareto-optimal over Sonnet 4.6. | ||||||||
| ▲ | zamadatix 9 hours ago | parent [-] | |||||||
I think they mean per dollar in the perf/$charts, not per marketing class. I.e. the new model is a complete Pareto failure in said perf/$ charts with the sole exception of Sonnet 5 low, which is dumb enough to not have comparison at all. Opus 4.8 delivers a better outcome per dollar, regardless what the underlying size of the models is. I'd generously assume this is something about the specific category of agentic task presented in the chart... but it does raise the question "then why is that category the one they chose to highlight here". | ||||||||
| ||||||||