Remix.run Logo
summarity 5 hours ago

Not claude code specific, but I've been noticing this on Opus 4.6 models through Copilot and others as well. Whenever the phrase "simplest fix" appears, it's time to pull the emergency break. This has gotten much, much worse over the past few weeks. It will produce completely useless code, knowingly (because up to that phrase the reasoning was correct) breaking things.

Today another thing started happening which are phrases like "I've been burning too many tokens" or "this has taken too many turns". Which ironically takes more tokens of custom instructions to override.

Also claude itself is partially down right now (Arp 6, 6pm CEST): https://status.claude.com/

andoando 4 hours ago | parent | next [-]

Ive been noticing something similar recently. If somethings not working out itll be like "Ok this isnt working out, lets just switch to doing this other thing instead you explicitly said not to do".

For example I wanted to get VNC working with PopOS Cosmic and itll be like ah its ok well just install sway and thatll work!

albert_e 3 hours ago | parent | next [-]

Experienced this -- was repeatedly directing CC to use Claude in Chrome extension to interact with a webpage and it was repeatedly invoking Playwright MCP instead.

robotswantdata 3 hours ago | parent | prev | next [-]

It’s as if it gives up, I respond keep going with original plan, you can do it champ!

rootnod3 4 hours ago | parent | prev [-]

[flagged]

andoando 4 hours ago | parent [-]

?

satvikpendem 2 hours ago | parent [-]

They're saying just do it yourself instead of trying to herd an unpredictable animal to your bidding like an LLM.

robwwilliams 4 hours ago | parent | prev | next [-]

Yes, and over the last few weeks I have noticed that on long-context discussions Opus 4.6e does its best to encourage me to call it a day and wrap it up; repeatedly. Mother Anthropic is giving preprompts to Claude to terminate early and in my case always prematurely.

TonyAlicea10 2 hours ago | parent | next [-]

I've noticed this as well. "Now you should stop X and go do Y" is a phrase I see repeated a lot. Claude seems primed to instruct me to stop using it.

logicchains 4 hours ago | parent | prev [-]

Try Codex, it's a breath of fresh air in that regard, tries to do as much as it can.

onlyrealcuzzo 4 hours ago | parent | prev | next [-]

> Whenever the phrase "simplest fix" appears, it's time to pull the emergency break.

Second! In CLAUDE.md, I have a full section NOT to ever do this, and how to ACTUALLY fix something.

This has helped enormously.

aktenlage a minute ago | parent | next [-]

Where is that? I found "Return the simplest working solution. No over-engineering." which sounds more like the simplest fix.

bowersbros 4 hours ago | parent | prev | next [-]

Any chance you could share those sections of your claude file? I've been using Claude a bit lately but mostly with manual changes, not got much in the way of the claude file yet and interested in how to improve it

onlyrealcuzzo 2 hours ago | parent [-]

https://github.com/cuzzo/easy-vm/blob/master/CLAUDE.md

LeoPanthera 2 hours ago | parent [-]

Typo - "proove". "Prove" only has one O.

onlyrealcuzzo 2 hours ago | parent [-]

Thank you!

causal 3 hours ago | parent | prev | next [-]

I switched from Cursor to Claude because the limits are so much higher but I see Anthropic playing a lot more games to limit token use

talim 4 hours ago | parent | prev [-]

What wording do you use for this, if you don't mind? This thread is a revelation, I have sworn that I've seen it do this "wait... the simplest fix is to [use some horrible hack that disregards the spec]" much more often lately so I'm glad it's not just me.

However I'm not sure how to best prompt against that behavior without influencing it towards swinging the other way and looking for the most intentionally overengineered solutions instead...

twalichiewicz 4 hours ago | parent | next [-]

My own experience has been that you really just have to be diligent about clearing your cache between tasks, establishing a protocol for research/planning, and for especially complicated implementations reading line-by-line what the system is thinking and interrupting the moment it seems to be going bad.

If it's really far off the mark, revert back to where you originally sent the prompt and try to steer it more, if it's starting to hesitate you can usually correct it without starting over.

aforwardslash an hour ago | parent [-]

That is generically my experience as well. Claude half-assing work or skipping stuff because "takes too much time" is something I've been experiencing since I started using it (May 2025). Forcing it to create and review and implementation plan, and then reviewing the implementation cross-referenced with the plan almost always produces consistent results in my case.

onlyrealcuzzo 2 hours ago | parent | prev | next [-]

https://github.com/cuzzo/easy-vm/blob/master/CLAUDE.md

imiric 3 hours ago | parent | prev [-]

Make sure to use "PRETTY PLEASE" in all caps in your `SOUL.md`. And occasionally remind it that kittens are going to die unless it cooperates. Works wonders.

mghackerlady 2 hours ago | parent | next [-]

I love how despite how cold and inhuman LLMs are, we've at least taught them to respect the lives of kittens

KaoruAoiShiho 2 hours ago | parent | prev [-]

Can you paste the relevant section in your soul please?

psadauskas 4 hours ago | parent | prev | next [-]

I need to add another agent that watches the first, and pulls the plug whenever it detects "Wait, I see the problem now..."

iterateoften 3 hours ago | parent | prev | next [-]

Yeah it’s so frustrating to have to constantly ask for the best solution, not the easiest / quickest / less disruptive.

I have in Claude md that it’s a greenfield project, only present complete holistic solutions not fast patches, etc. but still I have to watch its output.

giwook 4 hours ago | parent | prev | next [-]

I think in general we need to be highly critical of anything LLMs tell us.

pixel_popping 4 hours ago | parent [-]

Claude code shows: OAuth error: timeout of 15000ms exceeded

giwook 4 hours ago | parent [-]

Maybe a local or intermittent issue? Working for me.

pixel_popping 4 hours ago | parent [-]

Seems solved now indeed.

pixel_popping 4 hours ago | parent | prev | next [-]

It's a bit insane that they can't figure out a cryptographic way for the delivery of the Claude Code Token, what's the point of going online to validate the OAuth AFTER being issued the code, can't they use signatures?

mikepurvis 4 hours ago | parent | prev | next [-]

That helps explain why my sessions signed themselves out and won't log back in.

me_vinayakakv 4 hours ago | parent [-]

I just experienced this some time ago and could not sign in still.

Their status page shows everything is okay.

simooooo 3 hours ago | parent | prev | next [-]

How complex are we talking? I one shotted a game boy emulator in <6 minutes today

root_axis 3 hours ago | parent | next [-]

There are countless reference examples online, that's just a slower, buggier, and more expensive git clone.

TimTheTinker an hour ago | parent [-]

Yep. If you ask Claude to create a drop-in replacement for an open-source project that passes 100% of the test suite of the project, it will basically plagiarize the project wholesale, even if you changed some of the requirements.

whateveracct 2 hours ago | parent | prev [-]

try one shotting something actually original and see how it goes

i keep getting nonsense

nikanj 4 hours ago | parent | prev | next [-]

”I can’t make this api work for my client. I have deleted all the files in the (reference) server source code, and replaced it with a python version”

Repeatedly, too. Had to make the server reference sources read-only as I got tired of having to copy them over repeatedly

mavamaarten 2 hours ago | parent [-]

Haha yeah. I once asked it to make a field in an API response nullable, and to gracefully handle cases where that might be an issue (it was really easy, I was just lazy and could have done it myself, but I thought it was the perfect task for my AI idiot intern to handle). Sure, it said. Then it was bored of the task and just deleted the field altogether.

5 hours ago | parent | prev | next [-]
[deleted]
j45 3 hours ago | parent | prev | next [-]

Certain phrases invoke an over-response trying to course correct which makes it worse because it's inclined to double down on the wrong path it's already on.

rootnod3 4 hours ago | parent | prev [-]

The cope is hard. Just at this point admit that the LLM tech is doomed and sucks.

subscribed 3 hours ago | parent | next [-]

But it was clearly really food before the regression, the original link (analysis) says as much.

randomNumber7 3 hours ago | parent | prev | next [-]

Just because some people try to use a hammer as a screwdriver it doesn't follow that the hammer sucks.

r_lee 3 hours ago | parent | prev [-]

how is it "doomed"?