| ▲ | phillipcarter 7 hours ago | |
... sigh. I realize there's little that can be done about this, but I just got through a real-world session determining of Opus 4.7 is meaningfully better than Opus 4.6 or GPT 5.4, and now there's another one to try things with. These benchmark results generally mean little to me in practice. Anyways, still exciting to see more improvements. | ||