| ▲ | aroman 4 hours ago |
| This makes me think they really are quite capacity constrained at the moment. I had assumed they were primarily limiting it to entice people to upgrade, but I feel like these limits are so low and so temporary (especially over July 4th weekend in the US) that people will barely get a chance to get "used to it" and then think: "man, I can't live without this, I'll pay for API pricing". |
|
| ▲ | timpera 4 hours ago | parent | next [-] |
| That's strange, because they were seemingly way less capacity constrained lately, raised limits and removed the peak hours usage. It's crazy to think that even spending $1.25 billion a month to rent GPUs from SpaceX didn't do much to improve the situation. |
| |
| ▲ | atonse 3 hours ago | parent [-] | | I don't know, I feel like for a few weeks before the SpaceX datacenter, I was just constantly checking my weekly limits. And now after that miraculously, I rarely even come close to hitting my weekly limits. and I still have 5-7 claudes open a day (defaulting to Opus 4.8 xhigh, sometimes ultracode). So I feel that the additional datacenter caused them to just ease up a bit. But demand is also insane, so who knows... | | |
| ▲ | unshavedyak 3 hours ago | parent [-] | | Yea, i'm on x20 and while it has been up and down in terms of token-usage-UX, i feel like its the best its ever been. Context: I entirely use Opus 4.8 fwiw. Now is that because 4.8 is nerfed compared to 4.6 and thus more token efficient? No idea. I just know on x20 with a pretty plain workflow i struggle to use my tokens every week. |
|
|
|
| ▲ | echelon 4 hours ago | parent | prev [-] |
| If it's API pricing, I'm going to ditch Claude Code and switch to a harness that can jump between GLM and Claude Code. Cheap pricing is why I use Claude Code. The minute they fumble that, I'm using Chinese models for 90% of the work. |
| |
| ▲ | matheusmoreira 4 hours ago | parent | next [-] | | Yeah. Their cheap subscriptions are the only reason to keep using them. If they ruin the plans there's nothing holding us back anymore. | |
| ▲ | holoduke 4 hours ago | parent | prev | next [-] | | I dont believe they can afford to switch to API pricing. Everyone will leave.
I am easily spending the equivalent of 1000 dollars a day on tokens with two max subscriptions. that about 400 dollars a month. Thats acceptable for my position. But thats like 30k per month. Totally not viable. | |
| ▲ | Natfan 3 hours ago | parent | prev [-] | | how can a harness switch between glm (a model) and claude code (another harness)? | | |
| ▲ | aroman 3 hours ago | parent [-] | | I've been doing this for ages - you just spin up harness B as a subprocess/tool call from harness A. For example, I had a "/codex-review" claude skill for ages that did exactly that. Technically you're right it wouldn't be switching, since you're right the two ideas are at different altitudes, but I think in practice it has the same impact: within one harness, you can delegate certain tasks to certain models or harnesses. |
|
|