| ▲ | XCSme 6 hours ago | |
What's interesting, is that Sonnet 5 is actually worse[0] than 4.6 without reasoning. It makes some sense, as models are trained more and more with reasoning, than without. [0]: https://aibenchy.com/compare/anthropic-claude-sonnet-4-6-non... | ||