Remix.run Logo
kingkongjaffa 11 hours ago

Just one more anecdote:

I'm on the enterprise team plan so a decent amount of usage.

In March I could use Opus all day and it was getting great results.

Since the last week of March and into April, I've had sessions where I maxed out session usage under 2 hours and it got stuck in overthinking loops, multiple turns of realising the same thing, dozens of paragraphs of "But wait, actually I need to do x" with slight variations of the same realisation.

This is not the 'thinking effort' setting in claude code, I noticed this happening across multiple sessions with the same thinking effort settings, there was clearly some underlying change that was not published that made the model get stuck in thinking loops more for longer and more often without any escape hatch to stop and prompt the user for additional steering if it gets stuck.

chrsw an hour ago | parent | next [-]

I'm also an enterprise user and this has been my experience exactly. Same asks, same code bases, same models, much worse results. Everyone on my team is expressing the same thing.

Not only that, but the lack of transparency about what's happening, in clear and simple terms, directly from Anthropic is concerning.

I've already told my org's higher ups that in the current situation we're not close to getting our money's worth with these models.

UqWBcuFx6NV4r 11 hours ago | parent | prev | next [-]

Whenever I see Opus say “but wait, …”—which is all the time—I get a little bit closer toward throwing my computer out the window. Sometimes I just collapse the thinking section, cross my fingers, and wait for the answer. It’s too frustrating watching the thinking process.

natpalmer1776 7 hours ago | parent | next [-]

I stop the thinking and manually correct with explicit instructions or direction. I treat my agents like well meaning ivy-league graduate interns. They lack the experience to know what to do sometimes and need a “common sense” direction every now and then.

oldmanhorton 6 hours ago | parent | prev [-]

Have you considered just… writing code? Like we used to in the good old days? If the tool drives you to that point of frustration, maybe it’s time to give the tool a break.

trollbridge 3 minutes ago | parent [-]

A lot of folks aren't "allowed" to write code anymore.

gfody 4 hours ago | parent | prev | next [-]

this timing matches my experience, enterprise plan, but using opus from vscode - finished a heavy refactor of a large C# codebase mid march, tried to do basically the same thing early april and couldn't

adahn 11 hours ago | parent | prev | next [-]

I’ve seen the point raised elsewhere that this could be the double usage promo that was available from the 13th of March to the 28th. ie. people getting used to the promo then feeling impacted when it finished.

Although it seems that enterprise wasn’t included, so maybe not in your case.

https://support.claude.com/en/articles/14063676-claude-march...

cyanydeez 10 hours ago | parent | next [-]

its sounds like, tinfoil hat, they reduced the quant size of their model and tried to mask the change with the promo. your theory only addresses the spend not the reduced realiability

derangedHorse 9 hours ago | parent | prev [-]

[dead]

derangedHorse 10 hours ago | parent | prev [-]

It's probably because you didn't specify "make no mistakes" /s

In all seriousness though, I've observed the same thing with my own usage.