Remix.run Logo
trueno 5 hours ago

noticing sharp uptick in "i switched to codex" replies lately. a "codex for everything" post flocking the front page on the day of the opus 4.7 release

me and coworker just gave codex a 3 day pilot and it was not even close to the accuracy and ability to complete & problem solve through what we've been using claude for.

are we being spammed? great. annoying. i clicked into this to read the differences and initial experiences about claude 4.7.

anyone who is writing "im using codex now" clearly isn't here to share their experiences with opus 4.7. if codex is good, then the merits will organically speak for themselves. as of 2026-04-16 codex still is not the tool that is replacing our claude-toolbelt. i have no dog in this fight and am happy to pivot whenever a new darkhorse rises up, but codex in my scope of work isn't that darkhorse & every single "codex just gets it done" post needs to be taken with a massive brick of salt at this point. you codex guys did that to yourselves and might preemptively shoot yourselves in the foot here if you can't figure out a way to actually put codex through the ringer and talk about it in its own dedicated thread, these types of posts are not it.

andai 19 minutes ago | parent | next [-]

Well, I can share my experience from a few days ago. Gave the same task (a major refactor) to both Claude and Codex.

Codex finished in 5 minutes, Claude was still spinning after 20 minutes. Also it used up all my usage, about twice over (the 5-hour window rolled over in the middle of the task, so the usage for one task added up to 192%). Codex usage was 9%. So, 21x difference there, lol

They're saying there's bugs lately with how usage is being measured, but usage being buggy isn't exactly more encouraging...

So I was on task #4 with Codex while Claude was still spinning on #1.

I didn't like the results Codex gave me though. It has the habit of doing "technically what you asked, but not what a normal human would have wanted."

So given "Claude is great but I can't actually use it much" and "Codex is cheap and fast but kinda sucks", the current optimum seems to be having Claude write detailed specs and delegate to Codex. (OpenAI isn't banning people for using 3rd party orchestration, so this would actually be a thing you could do without problems. Not the reverse though.)

Jcampuzano2 5 hours ago | parent | prev | next [-]

No, I assure you you are not being spammed because legitimately many people prefer codex over claude right now. I am one of those people. And if you go on tech social media spaces you'll see many prominent well known devs in open source say the same. And of course others praise claude as well.

At my job we have enterprise access to both and I used claude for months before I got access to codex. Around the time gpt-5.3-codex came out and they improved its speed I was split around 50/50. Now I spend almost 100% of my time using Codex with GPT 5.4.

I still compare outputs with claude and codex relatively frequently and personally I find I always have better results with codex. But if you prefer claude thats totally acceptable.

malfist 5 hours ago | parent | prev | next [-]

I don't know, I think java is the best programming language. I use it for everything I do, no other programming language comes close. Python lost all my trust with how slow it's interpreter is, you can't use it for anything.

^^^^ Sarcastic response, but engineers have always loved their holy wars, LLM flavor is no different.

solenoid0937 3 hours ago | parent | prev | next [-]

OAI marketing/PR in overdrive:

1. Subsidize compute unsustainably

2. Trick a bunch of people into thinking you're more pro-developer than the other guy [we are here]

3. Rug pull when you have enough market share.

rafaelmn 2 hours ago | parent | prev | next [-]

GPT 5.4 xhigh thinking was really good at teasing out problems in multi step flows of a process I was refactoring, caught higher level/deeper problems than Opus 4.6. However getting it to write the code is just not a good experience for me, it changes the style/does not follow surrounding code, codes in a sloppy way and creates subtle bugs that I don't see from Opus. So I use codex for review and opus to write code. Testing the new Opus 4.7 still to see if the review/reasoning catches more/better stuff. I frequently fire off all 3 (Gemini 3.1 pro, Opus, Codex xhigh) on same code than have them cross reference each other and stuff like that. Gemini is so bad it's not even funny, not sure why I keep it running.

agentifysh 5 hours ago | parent | prev | next [-]

i think you are being needlessly paranoid here

openai doest offer affiliate marketing links

the reason you see lot of users switching to codex is for the dismal weekly usage you get from claude

what users care about is actual weekly usage , they dont care a model is a few points smarter , let us use the damn thing for actual work

only codex pro really offers that

4 hours ago | parent | prev | next [-]
[deleted]
vessenes 4 hours ago | parent | prev | next [-]

I use and pay for both. Currently I use 4.6 (well as of yesterday) to do broad strokes creation. I use codex for audit. Generally first two or three audit cycles claude completes. There is often a subtlety that only codex can fix, but I usually do that at the end.

IME, codex is sort of somehow more .. literal? And I find it tangents off on building new stuff in a way that often misses the point. By comparison claude is more casual and still, years later, prone to just roughing stuff in with a note "skip for now", including entire subsystems.

I think a lot of this has to do with use cases, size of project, etc. I'd probably trust codex more to extend/enhance/refactor a segment of an existing high quality codebase than I would claude. But like I said for new projects, I spend less time being grumpy using claude as the round one.

Computer0 3 hours ago | parent | prev | next [-]

I use both but I find even the way the model writes in codex to be harder to read. The usage limits in Codex were very generous the past year until this week.

blueblisters 4 hours ago | parent | prev | next [-]

Yeah it's weird, almost like we're seeing two cults form in real-time.

I imagine there's a benign explanation too - the intelligence of these models is very spiky and I have found tasks were one model was hilariously better than the other within the same codebase. People are also more vocal when they have something to complain about.

In my general experience, Opus is more well-rounded, is an excellent debugger in complex / unfamiliar codebases. And Codex is an excellent coder.

enraged_camel 4 hours ago | parent | prev | next [-]

>> are we being spammed? great. annoying.

Yeah, very. Every single time this happens here, where there's a thread about an Anthropic model and people spam the comments with how Codex is better, I go and try it by giving the exact same prompt to Codex and Opus and comparing the output. And every single time the result is the same: Opus crushes it and Codex really struggles.

I feel like people like me are being gaslit at this point.

6thbit an hour ago | parent [-]

this is exactly how the other side feels

frankdenbow 5 hours ago | parent | prev [-]

we arent bots because we disagree with you. I switch between codex and opus, they have their differing strengths. As many people have mentioned, opus in the past few weeks has had less than stellar results. Generally I find opus would rather stub something and do it the faster way than to do a more complete job, although its much better at front end. I've had times where I've thrown the same problem at opus 4/5 times without success and codex gets it first shot. Just my experience.

solenoid0937 3 hours ago | parent [-]

If you comment on a post about a new Anthropic model within a couple hours of release and say "well I prefer Codex!", I hate to say it, but you're little different from a bot.

frankdenbow 3 hours ago | parent [-]

So what am i then? i only replied to someone claiming people are bots for having an opinion. I use opus regularly and its great.