Opus 4 beats all other models in my personal eval set for coding and writing.
Sonnet 4 also beats most models.
A great day for progress.
https://x.com/paradite_/status/1925638145195876511