| ▲ | reedf1 15 hours ago | ||||||||||||||||||||||
Given that it is the general consensus that a step function occurred with Opus 4.5/4.6 only 3 months ago - it seems like an insane omission. | |||||||||||||||||||||||
| ▲ | jeremyjh 15 hours ago | parent | next [-] | ||||||||||||||||||||||
This has been the general consensus for about three years now. "Drastic increases in capability have happened the last 3-6 months" have been a constant refrain. Without any data from the study past September I think its not unreasonable, if you want to make an argument based on evidence. For me personally, I agree with you, I'm really seeing it as well. | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | Toutouxc 15 hours ago | parent | prev | next [-] | ||||||||||||||||||||||
There's a consensus that SOMETHING changed with Opus 4.5. It might have been the "merge rates" metric, it might have not. I'm certainly getting faster and cleaner-looking solutions for certain issues on Opus 4.6 than I was 5 months ago, but I'm not sure about the ability to solve (or even weigh in) the actual hard stuff, i.e. the stuff I'm paid for. And I'm definitely not sure about the supposed big step between 4.5 and 4.6. I'm literally not seeing any. | |||||||||||||||||||||||
| ▲ | 15 hours ago | parent | prev [-] | ||||||||||||||||||||||
| [deleted] | |||||||||||||||||||||||