| ▲ | Someone1234 7 hours ago | |
Opus 4.7 is also worse at 256K context. Go look at page 195 and page 196. It is across the board regression, not just 1M context. | ||
| ▲ | RobinL 4 hours ago | parent | next [-] | |
Thanks, interesting. Does this make it more surprising that the other benchmarks have improved? I'm not sure I understand the benchmarks well enough - but I'm wondering whether with agentic workflows it's possible to get away with a smaller more focussed context (and hence lower cost) whilst achieving the same or better performance, because of agentic model's ability to decide what the put in context as they work | ||
| ▲ | timvb 3 hours ago | parent | prev [-] | |
what's all this mean in real world use? | ||