| ▲ | mohsen1 4 hours ago | |||||||||||||||||||||||||
I don't know about Mythos but in recent weeks I've noticed Opus is constantly failing to fix things in tsz[0] vs GPT 5.5 can easily churn out fixes that are solid and pass tests. I've stopped paying for Claude for now and all my money is going to OpenAI at the moment. Either Opus is massively nerfed or GPT 5.5 is really head and shoulder higher in terms of very difficult tasks. The last percent of conformance tests in tsz are really really difficult and I've seen Opus bailing again and again. So annoying to waste time and tokens to finally get "this is too involved" or "this requires a multi-week sprint to fix". [0] https://tsz.dev | ||||||||||||||||||||||||||
| ▲ | _pdp_ 4 hours ago | parent | next [-] | |||||||||||||||||||||||||
The new Opus feels like a step backwards. More expensive, thinks more, and it does not get the job done. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
| ▲ | dyauspitr 4 hours ago | parent | prev [-] | |||||||||||||||||||||||||
Having never used Claude and only Codex, does Claude actually say “this is too involved” as a response to a prompt? | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||