| ▲ | culi 3 hours ago | |
LMArena actually has a nice Pareto distribution of ELO vs price for this
https://arena.ai/leaderboard/code?viewBy=plot&license=open-s... | ||
| ▲ | logicprog 2 hours ago | parent [-] | |
LMArena isn't very useful as a benchmark, however I can vouch for the fact that GLM 5.1 is astonishingly good. Several people I know who have a $100/mo Claude Code subscription are considering cancelling it and going all in on GLM, because it's finally gotten (for them) comparable to Opus 4.5/6. I don't use Opus myself, but I can definitely say that the jump from the (imvho) previous best open weight model Kimi K2.5 to this is otherworldly — and K2.5 was already a huge jump itself! | ||