| ▲ | vessenes 13 hours ago | ||||||||||||||||||||||
This is super exciting. I've been poking at it today, and it definitely changes my workflow -- I feel like a full three or four hour parallel coding session with subagents is now generally fitting into a single master session. The stats claim Opus at 1M is about like 5.4 at 256k -- these needle long context tests don't always go with quality reasoning ability sadly -- but this is still a significant improvement, and I haven't seen dramatic falloff in my tests, unlike q4 '25 models. p.s. what's up with sonnet 4.5 getting comparatively better as context got longer? | |||||||||||||||||||||||
| ▲ | steve-atx-7600 12 hours ago | parent | next [-] | ||||||||||||||||||||||
Did it get better? I used sonnet 4.5 1m frequently and my impression was that it was around the same performance but a hell of a lot faster since the 1m model was willing to spends more tokens at each step vs preferring more token-cautious tool calls. | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | mattfrommars 12 hours ago | parent | prev [-] | ||||||||||||||||||||||
Random: are you personally paying for Claude Code or is it paid by you employer? My employer only pays for GitHub copilot extension | |||||||||||||||||||||||
| |||||||||||||||||||||||