Wow, bad enough for them to actually publish something and not cryptic tweets from employees.

Damage is done for me though. Even just one of these things (messing with adaptive thinking) is enough for me to not trust them anymore. And then their A/B testing this week on pricing.

▲

saghm 9 hours ago | parent | next [-]

The A/B testing is by far the most objectionable thing from them so far in my opinion, if only because of how terrible it would be for something like that to be standard for subscriptions. I'd argue that it's not even A/B testing of pricing but silently giving a subset of users an entirely different product than they signed up for; it would be like if 2% of Netflix customers had full-screen ads pop up and cover the videos randomly throughout a show. Historically the only thing stopping companies from extraordinarily user-hostile decisions has been public outcry, but limiting it to a small subset of users seems like it's intentionally designed to try to limit the PR consequences.

	▲	lifthrasiir 9 hours ago \| parent [-]
		The best possible situation that I can imagine is that Anthropic just wanted to measure how much value does Claude Code have for Pro users and didn't mean to change the plan itself (so those users would get CC as a "bonus"), but that alone is already questionable to start with.

▲

polishdude20 5 hours ago | parent | prev | next [-]

Bruce here from the Twitter team.

I got finally fired.

▲

mannanj 9 hours ago | parent | prev [-]

so who do you trust and go to? (NotClearlySo)OpenAI?

▲

carlgreene 9 hours ago | parent | next [-]

I "subconsciously" moved to codex back in mid Feb from CC and it's been so freaking awesome. I don't think it's as good at UI, but man is it thorough and able to gather the right context to find solutions.

I use "subconsciously" in quotes because I don't remember exactly why I did it, but it aligns with the degradation of their service so it feels like that probably has something to do with it even though I didn't realize it at the time.

	▲	cageface 5 hours ago \| parent \| next [-]
		Codex does better if you ask it to take screenshots and critique its own UI work and iterate. It rarely one-shots something I like but it can get there in steps.
	▲	GenerWork 8 hours ago \| parent \| prev \| next [-]
		Anthropic definitely takes the cake when it comes to UI related activities (pulling in and properly applying Figma elements, understanding UI related prompts and properly executing on it, etc), and I say this as a designer with a personal Codex subscription.
	▲	snissn 9 hours ago \| parent \| prev \| next [-]
		it's been frustrating how bad it is at UI. I'm starting to test out using their image2 for UI and then handing it to codex to build out the images into code and I'm impressed and relieved so far
	▲	cmrdporcupine 7 hours ago \| parent \| prev [-]
		Codex isn't great at UI, but you might find Gemini is competent enough as an adjunct. I've had some luck with that.

▲

simlevesque 9 hours ago | parent | prev | next [-]

I went with MiniMax. The token plans are over what I currently need, 4500 messages per 5h, 45000 messages per week for 40$. I can run multiple agents and they don't think for 5-10 minutes like Sonnet did. Also I can finally see the thinking process while Anthropic chose to hide it all from me.

I'm using Zed and Claude Code as my harnesses.

▲

Robdel12 9 hours ago | parent | prev | next [-]

At the moment, yeah. If Google ever figures out how to build an agentic model, I would use them as well.

However you feel about OpenAI, at least their harness is actually open source and they don’t send lawyers after oss projects like opencode

▲

IncreasePosts 7 hours ago | parent [-]

Is Gemini cli not an agentic model? Or are you just saying it's built poorly? Gemini 2.5 didn't really work for me but Gemini 3 seems fairly solid

	▲	cmrdporcupine 7 hours ago \| parent [-]
		Gemini fairs poorly at tool use, even in its own CLI and even in Antigravity. It gets into a mess just editing source files, it's tragic because it's actually not a bad model otherwise.

▲

parliament32 7 hours ago | parent | prev | next [-]

Self-hosted models are the one true path.

▲

bensyverson 9 hours ago | parent | prev | next [-]

Anecdotally, I know many people who have supplemented Claude with Codex, and are experimenting with models such as GLM 5.1, Kimi, Qwen, etc.

▲

irthomasthomas 9 hours ago | parent | prev [-]

I like chutes because they always use the full weights, and prompts are encrypted with TEE.