Thank you for coming on HN and offering to answer questions.[a]

This is a fantastic piece, very timely, evidently well-researched, and also well-written. Judging by the little that I know, it's accurate. Thank you for doing the work and sharing it with the world.

OpenAI may be in a more tenuous competitive position than many people realize. Recent anecdotal evidence suggests the company has lost its lead in the AI race to Anthropic.[b]

Many people here, on HN, who develop software prefer Claude, because they think it's a better product.[c]

Is your understanding of OpenAI's current competitive position similar?

---

[a] You may want to provide proof online that you are who you say you are: https://en.wikipedia.org/wiki/On_the_Internet%2C_nobody_know...

[b] https://www.latimes.com/business/story/2026-04-01/openais-sh...

[c] For example, there are 2x more stories mentioning Claude than ChatGPT on HN over the past year. Compare https://hn.algolia.com/?dateRange=pastYear&page=0&prefix=tru... to https://hn.algolia.com/?dateRange=pastYear&page=0&prefix=tru...

▲

ronanfarrow 11 hours ago | parent | next [-]

Thank you for this, very much appreciate the thoughtful response.

The piece captures some of the anxieties within OpenAI right now about their competitive position. This obviously ebbs and flows but of late there has been much focus on Anthropic's relative position. We of course mention the allegations of "circular deals" and concerns about partners taking on debt.

▲

cs702 9 hours ago | parent [-]

Thank you. Yes, I saw that. The company's always been surrounded by endless talk about insane hype, speculative bubbles, and financial engineering. I wasn't asking so much about that.

I was asking more about your informed view on how OpenAI's technology, products, and roadmap are perceived, particularly by customers and partners, in comparison to those of competitors.

If you have an opinion about that, everyone here would love to hear about it.

▲

Ericson2314 4 hours ago | parent | next [-]

Ronan Farrow's expertise is investigations into elite amorality, not evaluating technical products. Why are you asking this question?

	▲	cs702 3 hours ago \| parent \| next [-]
		I didn't asking him to evaluate them. I asked him how customer and partners perceive them. He's had so many conversations that he likely has a sense of how perceptions of the company and its offerings have changed. I'm curious.
	▲	bloppe 21 minutes ago \| parent \| prev [-]
		Much of the article and general palace intrigue is predicated on the idea that OpenAI has a singularly revolutionary product. If it later turns out to be a commodity, or OpenAI is simply outcompeted nonetheless, then the idea that Sam Altman's personal shortcomings are something to stress about would seem quaint. Just another hubristic tech billionaire acting in bad faith doesn't really pry attention the same way as someone "controlling your future".

▲

irishcoffee 4 hours ago | parent | prev [-]

My guess is that the answer to your question, fantastic question, is that nobody knows. I remember having the same thoughts when Covid was first “arriving” if you will: we wanted people in the know to throw us a nugget of information, and they just didn’t know.

As it turns out, and what I’m kind of going with for this LLM shit, is that it’ll play out exactly how you think it will. The companies are all too big to fail, with billionaire backers who would rather commit fraud than lose money.

▲

unsupp0rted 7 hours ago | parent | prev | next [-]

Many of us prefer OpenAI's Codex, because we think it's a better product.

No comment on the CEO: I just find the product superior in everything but UI/UX and conversation. It's better at quality code.

▲

mliker 7 hours ago | parent | next [-]

Who is “us”? It does seem that some scientists prefer Codex for its math capabilities but when it comes to general frontend and backend construction, Claude Code is just as good and possibly made better with its extensive Skills library.

Both codex and Claude code fail when it comes to extremely sophisticated programming for distributed systems

▲

keldaris 4 hours ago | parent | next [-]

As a scientist (computational physicist, so plenty of math, but also plenty of code, from Python PoCs to explicit SIMD and GPU code, mostly various subsets of C/C++), I can confirm - Codex is qualitatively better for my usecases than Claude. I keep retesting them (not on benchmarks, I simply use both in parallel for my work and see what happens) after every version update and ever since 5.2 Codex seems further and further ahead. The token limits are also far more generous (and it matters, I found it fairly easy to hit the 5h limit on max tier Claude), but mostly it's about quality - the probability that the model will give me something useful I can iterate on as opposed to discard immediately is much higher with Codex.

For the few times I've used both models side by side on more typical tasks (not so much web stuff, which I don't do much of, but more conventional Python scripts, CLI utilities in C, some OpenGL), they seem much more evenly matched. I haven't found a case where Claude would be markedly superior since Codex 5.2 came out, but I'm sure there are plenty. In my view, benchmarks are completely irrelevant at this point, just use models side by side on representative bits of your real work and stick with what works best for you. My software engineer friends often react with disbelief when I say I much prefer Codex, but in my experience it is not a close comparison.

	▲	ricksunny 2 hours ago \| parent [-]
		>As a scientist (computational physicist, Is there one that you prefer for, i dunno, physics?

▲

the__alchemist 3 hours ago | parent | prev | next [-]

Claude Code, Codex, and Cursor are old news. If you're having problems, it's because you're not using the latest hotness: Cludge. Everyone is using it now - don't get left behind.

	▲	outside1234 an hour ago \| parent [-]
		Cludge has been left behind by Clanker, that’s the new hotness. 45B valuation!

▲

zeroxfe 6 hours ago | parent | prev | next [-]

I'm in that camp -- I have the max-tier subscription to pretty much all the services, and for now Codex seems to win. Primarily because 1) long horizon development tasks are much more reliable with codex, and 2) OpenAI is far more generous with the token limits.

Gemini seems to be the worst of the three, and some open-weight models are not too bad (like Kimi k2.5). Cursor is still pretty good, and copilot just really really sucks.

▲

unsupp0rted 7 hours ago | parent | prev | next [-]

Us = me and say /r/codex or wherever Codex users are. I've tried both, liked both, but in my projects one clearly produces better results, more maintainable code and does a better job of debugging and refactoring.

▲

sampullman 7 hours ago | parent | next [-]

That's interesting, I actively use both and usually find it to be a toss up which one performs better at a given task. I generally find Claude to be better with complex tool calls and Codex to be better at reviewing code, but otherwise don't see a significant difference.

▲

SOLAR_FIELDS 5 hours ago | parent | next [-]

If you want to find an advocate for Codex that can give a pretty good answer as to why they think it's better, go ask Eric Provencher. He develops https://repoprompt.com/. He spends a lot of time thinking in this space and prefers Codex over Claude, though I haven't checked recently to see if he still has that opinion. He's pretty reachable on Discord if you poke around a bit.

▲

aswanson 6 hours ago | parent | prev [-]

Any difference in performance on mobile development?

	▲	sampullman 4 hours ago \| parent [-]
		For that I'm not so sure. I tried both early 2025 and was disappointed in their ability to deal with a TCA based app (iOS) and Jetpack compose stuff on Android, but I assume Opus 4.6 and GPT 5.4 are much better.

▲

rocketpastsix 5 hours ago | parent | prev [-]

yea Im not in this "us" you speak of.

▲

zem 5 hours ago | parent | prev | next [-]

I've found claude startlingly good at debugging race conditions and other multithreading issues though.

	▲	josephg 5 hours ago \| parent [-]
		My rule of thumb is that its good for anything "broad", and weaker for anything "deep". Broad tasks are tasks which require working knowledge of lots of random stuff. Its bad at deep work - like implementing a complex, novel algorithm. LLMs aren't able to achieve 100% correctness of every line of code. But luckily, 100% correctness is not required for debugging. So its better at that sort of thing. Its also (comparatively) good at reading lots and lots of code. Better than I am - I get bogged down in details and I exhaust quickly. An example of broad work is something like: "Compile this C# code to webassembly, then run it from this go program. Write a set of benchmarks of the result, and compare it to the C# code running natively, and this python implementation. Make a chart of the data add it to this latex code." Each of the steps is simple if you have expertise in the languages and tools. But a lot of work otherwise. But for me to do that, I'd need to figure out C# webassembly compilation and go wasm libraries. I'd need to find a good charting library. And so on. I think its decent at debugging because debugging requires reading a lot of code. And there's lots of weird tools and approaches you can use to debug something. And its not mission critical that every approach works. Debugging plays to the strengths of LLMs.

▲

7thpower 6 hours ago | parent | prev [-]

Not a scientist and use codex for anything complex.

I enjoy using CC more and use it for non coding tasks primarily, but for anything complex (honestly most of what I do is not that complex), I feel like I am trading future toil for a dopamine hit.

▲

bko 4 hours ago | parent | prev | next [-]

I also find Codex much more generous in terms of what you get with a Pro ($20/mo) subscription. I use it pretty much non-stop and I have yet to hit a limit. Weekly reset is much better as well.

▲

enraged_camel 5 hours ago | parent | prev [-]

Yeah, there are dozens of you. Dozens!

▲

brightbeige 15 hours ago | parent | prev | next [-]

He’s replying on this twitter thread - perhaps someone with an account can ask there and link his comment here?

https://xcancel.com/RonanFarrow/status/2041127882429206532#m

▲

jamiequint 10 hours ago | parent [-]

Here is the actual link, not a link to some weird third-party site that can't be trusted.

https://x.com/RonanFarrow/status/2041127882429206532

▲

rounce 5 hours ago | parent | next [-]

FYI xcancel is just a mirror that allows reading replies without needing an account.

▲

SwellJoe 8 hours ago | parent | prev [-]

Whereas X can be trusted?

▲

jamiequint 4 hours ago | parent [-]

Yes? It's the data source, not a third-party. How is this even a question?

	▲	minimaxir 2 hours ago \| parent \| next [-]
		There's pedantic, and then there's needlessly pedantic. xcancel is a valid workaround for X links on Hacker News and is sufficient for original attribution.
	▲	SwellJoe 2 hours ago \| parent \| prev [-]
		X restricts what you can view without logging in. Many folks don't want to log in to X, for obvious reasons. Posting an xcancel link is kinda like folks posting various `archive` URLs to bypass paywalls, work around overloaded servers, etc. That's an extremely common practice here that usually goes without comment.

▲

ed 6 hours ago | parent | prev | next [-]

It's worth noting Codex has 2x more stories than Claude https://hn.algolia.com/?query=codex

	▲	cloverich an hour ago \| parent [-]
		But by page 5, those stories have around 50-60 karma, while claude page five is still 500+ (i found your comment surprising based on my daily hn reading recollection - i mostly read top N daily and feel i only occassionally see codex stories).

▲

ATMLOTTOBEER 3 hours ago | parent | prev | next [-]

Yeah we moved to Claude a few months ago, mostly because the devs kept using it anyway. Altman stuff is interesting but at the end of the day you just go with whatever tool works

▲

georgemcbay 12 hours ago | parent | prev [-]

> You may want to provide proof online that you are who you say you are

Unfortunately it probably doesn't even matter here on HN considering how brigaded down this story is predictably getting.

But yeah, it was a fantastic piece.

	▲	dang 7 hours ago \| parent \| next [-]
		It wasn't getting "brigaded down" - it set off a software penalty called the flamewar detector. I turned that off as soon as I saw it.
	▲	ronanfarrow 11 hours ago \| parent \| prev [-]
		Fair request, here you go: https://x.com/RonanFarrow/status/2041203911697068112