▲ | CuriouslyC 15 hours ago | |
Oh yeah, Sonnet performance has been in the toilet for me. They claim they've mitigated it but when 4.0 first dropped CC was really impressive, and now I constantly have to babysit it because any time it hits a challenge it'll just stop trying and make a simple toy version and declare false victory. If I don't catch it and I let it build on top of that bullshit, things get nasty in a hurry. It's a shame because the plan is a great deal but the number of all caps and profanity laced messages I'm firing off at Claude is too damned high. | ||
▲ | resonious 8 hours ago | parent | next [-] | |
This hits home for me too. Claude feels like it has gotten more "yes-man"-y. I can no longer trust its judgement. Even if I come in with something dead wrong, I'm "absolutely right" and it finds amazing ways to spin my BS into something vaguely believable. I am also bullying Claude more nowadays. Seeing this thread, I might give Codex another go (I was on Codex CLI before Claude Code. At that time, Claude blew Codex out of the water but something's changed) | ||
▲ | dmazin 10 hours ago | parent | prev | next [-] | |
Yes, this. I feel like I’m going crazy. I pay for the extra Opus usage and I keep checking the model switcher to see if it has automatically switched to Sonnet. It has not. I just have a lot more experiences of it feeling anecdotally dumb lately. | ||
▲ | wahnfrieden 13 hours ago | parent | prev [-] | |
GPT-5 is comparable to Opus without needing to constantly dip back down to Sonnet for cost management |