I've been doing something a lot like this, using a claude-desktop instance attached to my personal mcp server to spawn claude-code worker nodes for things, and for a month or two now it's been working great using the main desktop chat as a project manager of sorts. I even started paying for MAX plan as I've been using it effectively to write software now (I am NOT a developer).

Lately it's gotten entirely flaky, where chat's will just stop working, simply ignoring new prompots, and otherwise go unresponsive. I wondered if maybe I'm pissing them off somehow like the author of this article did.

Now even worse is Claude seemingly has no real support channel. You get their AI bot, and that's about it. Eventually it will offer to put you through to a human, and then tell you that don't wait for them, they'll contact you via email. That email never comes after several attempts.

I'm assuming at this point any real support is all smoke and mirrors, meaning I'm paying for a service now that has become almost unusable, with absolutely NO means of support to fix it. I guess for all the cool tech, customer support is something they have not figured out.

I love Claude as it's an amazing tool, but when it starts to implode on itself that you actually require some out-of-box support, there is NONE to be had. Grok seems the only real alternative, and over my dead body would I use anything from "him".

▲

throwup238 6 hours ago | parent | next [-]

Anthropic has been flying by the seat of their pants for a while now and it shows across the board. From the terminal flashing bug that’s been around for months to the lack of support to instabilities in Claude mobile and Code for the web (I get 10-20% message failure rates on the former and 5-10% on CC for web).

They’re growing too fast and it’s bursting the seams of the company. If there’s ever a correction in the AI industry, I think that will all quickly come back to bite them. It’s like Claude Code is vibe-operating the entire company.

▲

laserDinosaur 3 hours ago | parent | next [-]

The Pro plan quota seems to be getting worse. I can get maybe 20-30 minutes work done before I hit my 4 hour quota. I found myself using it more just for the planning phase to get a little bit more time out of it, but yesterday I managed to ask it ONE question in plan mode (from a fresh quota window), and while it was thinking it ran out of quota. I'm assuming it probably pulled in a ton of references from my project automatically and blew out the token count. I find I get good answers from it when it does work, but it's getting very annoying to use.

(on the flip side, Codex seems like it's being SO efficient with the tokens it can be hard to understand its answers sometimes, it rarely includes files without you doing it manually, and often takes quite a few attempts to get the right answer because it's so strict what it's doing each iteration. But I never run out of quota!)

▲

aanet 2 hours ago | parent | next [-]

^ THIS

I've run out of quota on my Pro plan so many times in the past 2-3 weeks. This seems to be a recent occurrence. And I'm not even that active. Just one project, execute in Plan > Develop > Test mode, just one terminal. That's it. I keep getting a quota reset every few hours.

What's happening @Anthropic ?? Anybody here who can answer??

▲

bmurphy1976 2 minutes ago | parent | next [-]

I've been hitting the limit a lot lately as well. The worst part is I try to compact things and check my limits using the / commands and can't make heads or tails how much I actually have left. It's not clear at all.

I've been using CC until I run out of credits and then switch to Cursor (my employer pays for both). I prefer Claude but I never hit any limits in Cursor.

▲

vbezhenar 23 minutes ago | parent | prev | next [-]

This whole API vs plan looks weird to me. Why not force everyone to use API? You pay for what you use, it's very simple. API should be the most honest way to monetize, right?

This fixed subscription plan with some hardly specified quotas looks like they want to extract extra money from these users who pay $200 and don't use that value, at the same time preventing other users from going over $200. Like I understand that it might work at scale, but just feels a bit not fair to everyone?

▲

genewitch 2 hours ago | parent | prev | next [-]

sounds like the "thinking tokens" are a mechanism to extract more money from users?

▲

vunderba an hour ago | parent | next [-]

Anecdotally but it definitely feels like in the last couple weeks CC tends to be more aggressive at pulling in significantly larger chunks of an existing code base - even for some simple queries I'll see it easily ramp up to 50-60k token usage.

▲

genewitch 43 minutes ago | parent [-]

I'm curious if anyone has logged the number of thinking tokens over time. My implication was the "thinking/reasoning" modes are a way for LLM providers to put their thumb on the scale for how much the service costs.

they get to see (if not opted-out) your context, idea, source code, etc. and in return you give them $220 and they give you back "out of tokens"

	▲	throwup238 20 minutes ago \| parent [-]
		> My implication was the "thinking/reasoning" modes are a way for LLM providers to put their thumb on the scale for how much the service costs. It's also a way to improve performance on the things their customers care about. I'm not paying Anthropic more than I do for car insurance every month because I want to pinch ~~pennies~~ tokens, I do it because I can finally offload a ton of tedious work on Opus 4.5 without hand holding it and reviewing every line. The subscription is already such a great value over paying by the token, they've got plenty of space to find the right balance.

▲

mystraline an hour ago | parent | prev [-]

Its the clanker version of the "Check Wallet Light" (check engine light).

▲

fragmede an hour ago | parent | prev [-]

How quickly do you also hit compaction when running? Also, if you open a new CC instance and run /context, what does it show for tools/memories/skills %age? And that's before we look at what you're actually doing. CC will add context to each prompt it thinks is necessary. So if you've got a few number of large files, (vs a large number of smaller files), at some level that'll contribute to the problem as well.

Quota's basically a count of tokens, so if a new CC session starts with that relatively full, that could explain what's going on. Also, what language is this project in? If it's something noisy that uses up many tokens fast, even if you're using agents to preserve the context window in the main CC, those tokens still count against your quota so you'd still be hitting it awkwardly fast.

▲

stareatgoats 3 hours ago | parent | prev [-]

Claude Code allegedly auto-includes the currently active file and often all visible tabs and sometimes neighboring files it thinks are 'related' - on every prompt.

The advice I got when scouring the internets was primarily to close everything except the file you’re editing and maybe one reference file (before asking Claude anything). For added effect add something like 'Only use the currently open file. Do not read or reference any other files' to the prompt.

I don't have any hard facts to back this up, but I'm sure going to try it myself tomorrow (when my weekly cap is lifted ...).

▲

IgorPartola 2 hours ago | parent | prev | next [-]

You are giving me images from The Bug Short where the guy goes to investigate mortgages and knocks on some random person’s door to ask about a house/mortgage just to learn that it belongs to a dog. Imagine finding out that Anthropic employs no humans at all. Just an AI that has fired everyone and been working on its own releases and press releases since.

▲

sixtyj 5 hours ago | parent | prev | next [-]

They whistleblowed themselves that Claude Cowork was coded by Claude Code… :)

▲

throwup238 4 hours ago | parent | next [-]

You can tell they’re all vibe coded.

Claude iOS app, Claude on the web (including Claude Code on the web) and Claude Code are some of the buggiest tools I have ever had to use on a daily basis. I’m including monstrosities like Altium and Solidworks and Vivado in the mix - software that actually does real shit constrained by the laws of physics rather than slinging basic JSON and strings around over HTTP.

It’s an utter embarrassment to the field of software engineering that they can’t even beat a single nine of reliability in their consumer facing products and if it wasn’t for the advantage Opus has over other models, they’d be dead in the water.

▲

loopdoend an hour ago | parent | next [-]

Single nine reliability would be 90% uptime lol. For 99.9% we call it triple 9 reliability.

	▲	throwup238 an hour ago \| parent [-]
		Single 9 would be 90%, which is roughly what I’m experiencing between CC for Web and the Claude iOS app. About 1 in 10 messages fail because of an unknown error and 1 in 10 CC for web sessions die irrecoverably. It’d probably be worse except for the fact that CC’s bugs in the terminal aren’t show stoppers like they are on web/mobile. The only way Anthropic has two or three nines is in read only mode, but that’s be like measuring AWS using the console uptime while ignoring the actual control plane.

▲

cactusplant7374 3 hours ago | parent | prev [-]

You're right.

https://github.com/anthropics/claude-code/issues

Codex has less but they also had quite a few outages in December. And I don't think Codex is as popular as Claude Code but that could change.

▲

notsure2 5 hours ago | parent | prev [-]

Whistleblowed dog food.

	▲	b00ty4breakfast 4 hours ago \| parent [-]
		normally you don't share your dog food when you find out it actually sucks.

▲

Bombthecat 6 hours ago | parent | prev [-]

Well, they vibe code almost every tool at least

▲

tuhgdetzhh 5 hours ago | parent [-]

Claude Code has accumulated so much technical dept (+emojis) that Claude Code can no longer code itself.

▲

wwweston 4 hours ago | parent [-]

What’s the opposite of bootstrapping? Stakebooting?

	▲	outside1234 4 hours ago \| parent \| next [-]
		Sell to Google and run away
	▲	irishcoffee 4 hours ago \| parent \| prev [-]
		Hmm... VC funded?

▲

unyttigfjelltol 4 hours ago | parent | prev | next [-]

> I'm paying for a service now that has become almost unusable, with absolutely NO means of support to fix it.

Isn’t the future of support a series of automations and LLMs? I mean, have you considered that the AI bot is their tech support, and that it’s about to be everyone else’s approach too?

	▲	b00ty4breakfast 4 hours ago \| parent [-]
		Support has been automated for a while, LLMs just made it even less useful (and it wasn't very useful to begin with; for over a decade it's been a Byzantine labyrinth of dead-ends, punji-pits and endless hours spent listening to smooth jazz).

▲

uxcolumbo 5 hours ago | parent | prev | next [-]

Have you tried any of the leading open weight models, like GLM etc. And how does chatGPT or Gemini compare?

And kudos for refusing to use anything from the guy who's OK with his platform proliferating generated CSAM.

	▲	Leynos 3 hours ago \| parent [-]
		I tried GLM 4.7 in Opencode today. In terms of capability and autonomy, it's about on par with Sonnet 3.7. Not terrible for a 10th the price of an Anthropic plan, but not a replacement.

▲

hecanjog 6 hours ago | parent | prev | next [-]

> I've been using it effectively to write software now (I am NOT a developer)

What have you found it useful for? I'm curious about how people without software backgrounds work with it to build software.

	▲	bastard_op 4 hours ago \| parent \| next [-]
		About my not having a software background, I started this as I've been a network/security/systems engineer/architect/consultant for 25 years, but never dev work. I can read and follow code well enough to debug things, but I've never had the knack to learn languages and write my own. Never really had to, but wanted to. This now lets me use my IT and business experience to apply toward making bespoke code for my own uses so far, such as firewall config parsers specialized for wacky vendor cli's and filling in gaps in automation when there are no good vendor solutions for a given task. I started building my mcp server enable me to use agents to interact with the outside world, such as invoking automation for firewalls, switches, routers, servers, even home automation ideally, and I've been successful so far in doing so, still not having to know any code. I'm sure a real dev will find it to be a giant pile of crap in the end, but I've been doing like applying security frameworks, code style guidelines using ruff, and things like that to keep it from going too wonky, and actually working it up to a state I can call it as a 1.0 and plan to run a full audit cycle against it for security audits, performance testing, and whatever else I can to avoid it being entirely craptastic. If nothing else, it works for me, so others can take it or not once I put it out there. Even being NOT a developer, I understand the need for applying best practices, and after watching a lot of really terrible developers adjacent to me over the years make a living, think I can offer a thing or two in avoiding that as it is.
	▲	bastard_op 4 hours ago \| parent \| prev \| next [-]
		I started using claude-code, but found it pretty useless without any ability to talk to other chats. Claude recommended I make my own MCP server, so I did. I built a wrapper script to invoke anthropic's sandbox-runtime toolkit to invoke claude-code in a project with tmux, and my mcp server allows desktop to talk to tmux. Later I built in my own filesystem tools, and now it just spawns konsole sessions for itself invoking workers to read tasks it drops into my filesystem, points claude-code to it, and runs until it commits code, and then I have the PM in desktop verify it, do the final push/pr/merge. I use an approval system in a gui to tell me when claude is trying to use something, and I set an approve for period to let it do it's thang. Now I've been using it to build on my MCP server I now call endpoint-mcp-server (coming soon to github near you), which I've modularized with plugins, adding lots more features and a more versatile qt6 gui with advanced workspace panels and widgets. At least I was until Claude started crapping the bed lately.
	▲	ofalkaed 4 hours ago \| parent \| prev [-]
		My use is considerably simpler than GP's but I use it anytime I get bogged down in the details and lose my way, just have Claude handle that bit of code and move on. Also good for any block of code that breaks often as the program evolves, Claude has much better foresight than I do so I replace that code with a prompt. I enjoy programming but it is not my interest and I can't justify the time required to get competent, so I let Claude and ChatGPT pick up my slack.

▲

spike021 6 hours ago | parent | prev | next [-]

> where chat's will just stop working, simply ignoring new prompots, and otherwise go unresponsive

I had this start happening around August/September and by December or so I chose to cancel my subscription.

I haven't noticed this at work so I'm not sure if they're prioritizing certain seats or how that works.

▲

sawjet 3 hours ago | parent [-]

I have noticed this when switching locations on my VPN. Some locations are stable and some will drop the connection while the response is streaming on a regular basis.

	▲	fragmede 2 hours ago \| parent [-]
		The Peets right next to the Anthropic office could be selling VPN endpoint service for quite the premium!

▲

Bombthecat 6 hours ago | parent | prev | next [-]

Have a max plan, didn't use it much the last few days. Just used it to explain me a few things with examples for a ttrpg. It just hanged up a few times.

Max plan and in average I use it ten times a day? Yeah, I am cancel. Guess they don't need me

	▲	bastard_op 4 hours ago \| parent [-]
		That's about what I'm getting too! It just literally stops at some point, and any new prompt it starts, then immediately stops. This was even on a fairly short conversation with maybe 5-6 back and forth dialogs.

▲

thtmnisamnstr 5 hours ago | parent | prev | next [-]

Gemini CLI is a solid alternative to Claude Code. The limits are restrictive, though. If you're paying for Max, I can't imagine Gemini CLI will take you very far.

▲

samusiam an hour ago | parent | next [-]

Gemini CLI isn't even close to the quality of Claude Code as a coding harness. Codex and even OpenCode are much better alternatives.

▲

bastard_op 4 hours ago | parent | prev | next [-]

I tried Gemini like a year or so ago, and I gave up after it directly refused to write me a script and instead tried to tell me how to learn to code. I do not make this up.

	▲	mkl 4 hours ago \| parent [-]
		That's at least two major updates ago. Probably worth another try.

▲

Conscat 5 hours ago | parent | prev | next [-]

Gemini CLI regularly gets stuck failing to do anything after declaring its plan to me. There seems to be no way to un-lock it from this state except closing and reopening the interface, losing all its progress.

	▲	genewitch 2 hours ago \| parent [-]
		you should be able to copy the entire conversation and paste it in (including thinking/reasoning tokens). When you have a conversation with an AI, in simple terms, when you type a new line and hit enter, the client sends the entire conversation to the LLM. It has always worked this way, and it's how "reasoning tokens" were first realized. you allow a client to "edit" the context, and the client deletes the hallucination, then says "Wait..." at the end of the context, and hits enter. the LLM is tricked into thinking it's confused/wrong/unsure, and "reasons" more about that particular thing.

▲

andrewinardeer 4 hours ago | parent | prev [-]

Kilocode is a good alt as well. You can plug into OpenRouter or Kilocode to access their models.

▲

syntaxing 6 hours ago | parent | prev [-]

Serious question, why is codex and mistral(vibe) not a real alternative?

	▲	pixelmelt 3 hours ago \| parent \| next [-]
		The Claude models are still the best at what they do, right now GLM is just barely scratching sonnet 4.5 quality, mistral isnt really usable for real codebases and gemini is kind of in a weird spot where it's sometimes better then Claude at small targeted changes but randomly goes off the rails. Haven't tried codex recently but the last time I did the model thought for 27 minutes straight and then gave me about the same (incorrect) output that opus would have in 20 seconds. Anthropics models are their only moat as demonstrated by their cutting off of tools other then Claude code on their coding plans.
	▲	bastard_op 4 hours ago \| parent \| prev \| next [-]
		I tried codex, using my same sandbox setup with it. Normally I work with sonnet in code, but it was stuck on a problem for hours, and I thought hmm, let me try codex. Codex just started monkey patching stuff and broke everything within like 3-4 prompts. I said f-this, went back to my last commit, and tried Opus this time in code, which fixed the problem within 2 prompts. So yeah, codex kinda sucks to me. Maybe I'll try mistral.
	▲	5 hours ago \| parent \| prev [-]
		[deleted]