I think Anthropic and OpenAI have found product-market fit

▲ I think Anthropic and OpenAI have found product-market fit(simonwillison.net)

328 points by simonw 3 hours ago | 413 comments

▲ trjordan 3 hours ago | parent | next [-]

They've got, ballpark, $5t to $10t to make back in the next 5 years, or the hardware buildouts will start getting written down.

This means we're going to need $1t+ per year in spending, per year, on tokens. 200m knowledge workers in the world, 30m developers. We're talking about a world where you need 5% of every knowledge workers salary to go into tokens. 20% if you're a developer.

That's a _huge_ shift. Most people I know cite +20%-40% velocity with these tools, against the actual work their company cares about doing. +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending.

We're not there yet. This is still the upswing of the hype cycle, and unless we figure out how to make developers 2x, 5x, 10x as productive on stuff that matters, this isn't going to play out well.

▲

whatshisface an hour ago | parent | next [-]

Here are a few thoughts:

- The publicly available information about how inference costs compare to training costs is conflicted. EEs involved in datacenters talk about power usage spikes during training runs as if they were a major factor in the designs, but academic papers discussing cost-optimal scaling confidently treat inference-time compute as a major factor.

- On the side of the balance indicating that training is more compute-intensive after amortization than inference is that Chinese providers, constrained primarily by access to compute, have nearly unlimited token availability at a lower price than US providers (inference), but poorer model capabilities (training). That would make sense only if US providers are inflating inference costs by 20-30x due to amortized training costs that overseas providers were not able to take on (there are other factors too).

- If training >> inference, they're in a prisoner's dilemma that far exceeds the ordinary zero-marginals model of competition between firms (due to its huge discrete stepwise nature). On the other hand, if inference>>training, the high-level analysis popularized by certain thought leaders, that it's like a utility, would be true. You'd tend to count this as a vote for inference>>training, but the CEOs saying it at least have a huge incentive to agree because the alternative, the prisoner's dilemma, would stop investment very fast.

- The only voice in the story that I just told you to have anything to do with fact (as opposed to high-level analysis and ivory tower armchair management of a secretive business) were the rumors from facilities engineers. That shows you the state of our understanding...

- If we don't even know the ratio between amortized capital expenses and operational costs, outside investor analysis is impossible. It doesn't matter how finely they divide the accounting buckets for office ferns and indoor ferns if the single biggest part of their business is obscured for trade secret reasons.

▲

materielle an hour ago | parent [-]

I'm about to leave a shallow comment, but I am a bit skeptical of the supposed drop in inference costs. If AI labs saw a lot of potential there, they'd surely be bragging about it non-stop? So the fact that publicly available information is conflicted is probably a sign that at the very least, the numbers aren't amazing.

Yes I know there's no evidence and this is lazy reasoning. But there's probably a bit of truth to this line of thought.

▲

Tuna-Fish an hour ago | parent | next [-]

Why on earth would AI labs be bragging about how little the product they sell actually costs them to make? You don't want to do anything that reduces it's perceived value to the user, that might make them less willing to pay for it.

Also, inference costs are bound to go way down with more optimized architectures. GPUs are fundamentally not great at inference. No platform where the weights are streamed from a large pool of memory is. If the models ever quiet down, there will be massive step changes in cost/token, energy/token and tokens/second, as models are etched into silicon ala https://chatjimmy.ai/

▲

golem14 30 minutes ago | parent [-]

Why would any company brag about their margins ? Yet they do, to attract investors.

▲

Tuna-Fish 24 minutes ago | parent [-]

The key AI labs are not public companies, they are at liberty to brag about their margins to potential investors in private.

	▲	bwhiting2356 10 minutes ago \| parent [-]
		this is changing soon

▲

whatshisface an hour ago | parent | prev [-]

Inference has traditionally been far less expensive than training. One public example is the fact that hobbyists can run StableDiffusion ($600k training costs[1]) on their personal computers.

Speaking to your point, inference being dramatically less costly than training would not be seen as a delta from the norm. The model of providing inference for anything near the operational costs (like a utility would), would the delta from the norm if it were true.

[1] https://x.com/emostaque/status/1563870674111832066

▲

FuriouslyAdrift an hour ago | parent | prev | next [-]

I work for a tiny little company ($150MM annual rev with 9% net) and we are already looking at dropping $100k on hardware to run local models because, for us, they're "good enough."

Our estimated spend for AIaaS would exceed that cost in less than a year.

In a few years, there will be hardware capable of running frontier models good enough for most things at accessible prices for even tiny companies.

▲

simplyluke 38 minutes ago | parent | next [-]

Yeah, that's the part that just seems to be wildly under-discussed to me.

If open source models are ~3-6 months behind SOTA, and ~opus4.6 capabilities are good-enough for product market fit, do the frontier labs have half a decade to catch up on their prior burn?

AI cost ballooning faster than companies can afford is becoming a very common topic in my circles right now. The era of "I'll pay infinitely more for marginal gains" is over from what I can tell.

▲

svara 12 minutes ago | parent | next [-]

There's still a lot of room for the best models to get better at coding .

Your argument rests on the "for marginal gains" part but it's really not clear that the gains are marginal in the foreseeable future.

▲

doug_durham 19 minutes ago | parent | prev [-]

Open source models that you can run locally are much more than 3 to 6 months behind. 6 months was the November inflection for Claude. No open source model is as good as Claude Opus 4.6.

	▲	PeterStuer 4 minutes ago \| parent \| next [-]
		Many business tasks do not need the latest frontier models. I have a production system running since early GPT-4o. It now runs with GPT-5.2, not for improvements, but because it is cheaper. I could invest in switching to a local model, I tried and it works well enough, but api costs for this task are so low, it barely scratches $30/month. So I am using the local machine for other things and leave the inference on OpenAI, for now.
	▲	simplyluke 16 minutes ago \| parent \| prev \| next [-]
		> that you can run locally That's doing a lot of work here. The future I see isn't most companies buying hundreds of thousands in hardware to run models, it's them adding a line item to their AWS bill. Inference costs on the larger hosted open source models are dramatically lower than the frontier labs API pricing.
	▲	PunchyHamster 8 minutes ago \| parent \| prev [-]
		But one will be in few months. And then you have choice of paying say $100k for hardware and pay just power cost (or pay someone to do that for you), or pay way, way more for your team to have access to marginal improvement. And 5% worse model for 10% of the price of the bleeding edge will be worth it for majority of people

▲

EvanAnderson an hour ago | parent | prev | next [-]

> ...we are already looking at dropping $100k on hardware to run local models...

Just think how much further that $100K would have gone if the hardware market wasn't so screwed-up.

Anecdote: I priced-out adding 1TB of RAM to a four node cluster a couple months ago. The cluster was purchased in fall of 2024 w/ 4 nodes, each with 256GB RAM. The nodes cost just over $14K apiece back in 2024 (entire box, not just the RAM).

Dell wanted >$90K a couple months ago to add 256GB to each node.

▲

arbuge 28 minutes ago | parent | prev | next [-]

> In a few years, there will be hardware capable of running frontier models good enough for most things at accessible prices for even tiny companies.

What makes you so confident about this prediction? Hardware costs haven't exactly been cratering recently.

▲

cmdrk 20 minutes ago | parent | prev | next [-]

Do you think this will be a trend for larger companies as well?

The decadal move to all-cloud-all-the-time killed off in-house hardware teams while the C-suite chased their OpEx dreams.

It would be interesting if we come full circle on this.

▲

alex_suzuki an hour ago | parent | prev | next [-]

I’m curious: are you spending on beefy developer machines, or some kind of shared local inference server? Would be interested to know more if it’s the latter.

	▲	irishcoffee 29 minutes ago \| parent [-]
		I am aware of at least a handful of companies doing the latter. I don’t work for them and cannot speak to their setup.

▲

mv4 22 minutes ago | parent | prev | next [-]

I configured a dual DGX Spark cluster, and it's certainly "good enough" for my agentic and coding needs.

	▲	datadrivenangel 10 minutes ago \| parent [-]
		what models are you using on that? My experiences with apple hardware have convinced me that it is not really good enough for coding locally.

▲

nonethewiser an hour ago | parent | prev | next [-]

What models? Last I tried different local modals there was a pretty big difference from frontier.

▲

awesome_dude 36 minutes ago | parent | prev [-]

> In a few years, there will be hardware capable of running frontier models good enough for most things at accessible prices for even tiny companies.

I was going to say - the models are just going to keep growing at a pace exceeding the pace of hardware pricing/availability

But then I realised that, far more likely, there will be a plateau reached (again) where nobody is seeing gain, and at that point hardware will catch up

▲

alexpotato 13 minutes ago | parent | prev | next [-]

I was in college in the late 1990s/early 2000s and I distinctly remember an econometrics professor state the following:

"As cable TV and Pay Per View came out, there were studies done about how many movies people would watch if given unlimited access to films. The results were bandied about as proof that we should build out all this infrastructure to support this line of business. When the data was further analyzed by statisticians etc, it turned out that people claimed they were going to watch films 10-12 hours a day, every day of the week. Impossible."

I feel like we are in a similar boat here where some people are assuming:

- EVERYONE is going to be using max tokens

- tokens will NEVER get cheaper due to improvements in hardware, software, design, market forces etc etc

▲

regularfry 2 hours ago | parent | prev | next [-]

The bottleneck has moved from producing a thing that works to knowing that the thing was the right thing to build. The more of the latter they can take on, the fewer knowledge workers are needed at all. So rather than 5% of every knowledge worker's salary going into tokens, 100% of the knowledge worker's total employment cost goes into tokens and you get a 20x productivity boost as a theoretical minimum across those tasks.

That's the game. There's a view you could take of this that this is just a growing of the pie: with those cost dynamics a lot more "small businesses" get a vast amount of leverage, so the overall economy grows without replacing the knowledge workers. I'm not sure I trust the MBA class to have that view.

▲

seanp2k2 2 hours ago | parent | next [-]

>The bottleneck has moved from producing a thing that works to knowing that the thing was the right thing to build

I would argue that that's been the case for quite some time before AI. As an example, what innovative amazing world-changing products have Google or Meta launched in the past decade with their very high numbers of very talented and highly-compensated engineers? The issue with most big tech companies are leadership, strategy, and product direction. I'm not saying that they don't make any profits, just that they probably aren't "building [the right thing]".

AI for product development and management would be far more impactful than automating rote coding tasks / building React UIs that mirror API structures IMO.

▲

Figs an hour ago | parent | next [-]

> AI for product development and management would be far more impactful than automating rote coding tasks [...]

Yeah, if this stuff actually worked that well already, OpenAI et al. would just run AI CEOs and engineers. Why get some other company to pay you at all when you can automate every other company out of existence and take all the money they make?

The fact of the matter is that while the tech has some uses, it sure as hell isn't a full scale replacement and you almost always actually have to massage the input into LLMs to get anything decent back out in practice. Some CEOs and managers can learn to do this, of course, and some already are... but that quickly turns into a second full time job. A "programmer" is still needed. The job might change from mostly hand-writing C++/JS/Python to prompt engineering + some manual coding to fix all the stupid fuck-ups that the bots can't solve themselves, but you still need someone to actually prompt the bot.

When that changes, it won't just be engineers losing work; there will be no reason to even have a human CEO any more.

▲

aspenmartin an hour ago | parent | prev | next [-]

I don't know, if you've ever tried to build something at companies of that scale you run into incredibly boring problems "what data table do I need for X" and "who is the right person to reach out to for Y" and "they aren't answering me I guess I'll have to escalate"

I don't think there is any shortage of great ideas at these companies, they are just extremely bloated. And I don't think its something like indecision or bad PMs, it's "we have a finite amount of time and resources so we need to be conservative but also not too conservative"

If you have AI systems that can simply build out POCs in days, backtest on real data, show reliable results and numbers, you get a suite of product options you were never able to get before. If you have coding agents that can speed up implementation, you can build more stuff and choose the things that stick.

It changes the cost/benefit calculus of the entire business. I think you are exactly right in that: PMs/leadership are by their nature orchestration machines. Other roles are as well, but I think PM's are at a particular advantage here in that it will be quite awhile I would expect before core product decisions and creativity can be delegated to an AI, but not quite awhile until virtually everything that they're blocked on (legal approvals, POCs, wire frames, etc etc etc) will become less and less of a blocker

▲

supern0va an hour ago | parent | next [-]

>If you have AI systems that can simply build out POCs in days, backtest on real data, show reliable results and numbers, you get a suite of product options you were never able to get before. If you have coding agents that can speed up implementation, you can build more stuff and choose the things that stick.

I'll also add this: within a large organization, you often need to interact with many different codebases owned by many different teams. Agents have made it much easier to wrangle by having the ability to deploy one to scope out your web of dependencies to learn about what would be needed for feature X, and how that integration can happen.

We've been doing far more away team work simply because it makes things move faster. It's easier to convince a team to sign off/review something than it is to get them to commit to the planning and eventual work.

It genuinely is helping things move faster inside large organizations. Or at least, it is for us, particularly since we're getting organizational prioritization to actually build the scaffolding to make those agents more effective at search.

	▲	aspenmartin an hour ago \| parent [-]
		> It's easier to convince a team to sign off/review something than it is to get them to commit to the planning and eventual work. 1000x yes: you have touched on what I think is the single biggest factor here, that is the humongous value of POCs. they are gnarly to build without agents, and so we used to have to get everyone on board so we didn't get screwed in performance reviews, which was monumental task because that means convincing very busy PMs who have a lot on their plate and dont want to take risks on things they don't understand, and now it's like "can we scale this out" and you have a very nicely formatted proposal and POC. It de-risks things very quickly

▲

skydhash an hour ago | parent | prev [-]

Pieces of concept and other prototypes have always been cheap (see hackatons). The main issue is that as soon as you’re touching customer data or modifying process they’ve paid you for, you have to be really careful. No one wants to be responsible for an outage that cost the company its biggest customer.

▲

aspenmartin an hour ago | parent [-]

Yes, but at scaled companies, where building a simple POC hooked into real systems is nowhere close to easy. To the point where it means that you might as well just do the full thing. That's where the enterprise spend and the impact is.

▲

skydhash 41 minutes ago | parent [-]

Isn’t that a matter of configuration management? Or do you want to alter the real systems as well?

	▲	aspenmartin 23 minutes ago \| parent [-]
		historically it's been a matter of an absolutely horrific amount of Kafka-esque problems. Say I want to build a feature in a product. - DS has to do a deep dive (need buy in) to opportunity size and derisk with data. That DS has to work with other DS (people may have left or moved teams) to figure out how to get the right data and figure out what the difference is between 10 different tables that have overlapping but inconsistent data. - Eng has to build up an actual simple demo (need buy in) - Design has to make it not hideous (need buy in) - Legal has to review what you're doing; POCs should involve real data where possible because otherwise no one will trust it, even if its just for user analysis on existing products This plus about 6 internal system bugs for custom tools that are flaky and who's team has long been re-orged or laid off, 8 people who won't answer you, 2 PTO's for the stakeholders, 6 weekly meetings no one did POCs, they just had ideas and tried to get PM's to put it on the roadmap so if it fell through at least it was bought into

▲

nostrademons 15 minutes ago | parent | prev | next [-]

Google's internally developed and sometimes even launched plenty of innovative new products in the past decade. Stadia, Fuchsia, federated learning, and the whole transformer architecture that underlies this AI boom are good examples.

The problem is they get killed by some other executive who is afraid of their department looking bad by comparison.

I think this is fairly illustrative of the challenges in AI becoming as impactful as the Internet. The bottleneck is not making things. There are plenty of people who are really good at making things and can easily be 10x or 100x as productive as the average corporate worker. YCombinator was founded on that premise - small teams of founders and early employees could be orders of magnitudes more productive than the 1000s of corporate employees at their competitors.

The bottleneck is on bringing your product to market. If your innovative new product is built within a corporate environment, it'll get killed unless the executive you work under can get a promotion out of it, and you'll be denied all sorts of help with approvals, launch process, PR, marketing, branding, etc. If it's a startup, they'll try to shut you out with exclusive distribution deals, legal threats, lobbying efforts to change the legal environment, PR campaigns, FUD, etc.

The Internet was revolutionary because it let millions of people bring products to market without asking permission. Instead of having to bid for retail shelf space among dozens of entrenched competitors that all had sweetheart deals with the retailer, you could just put up a website and sell it to anyone across the globe. Instead of following hundreds of regulations that governed existing commerce, you could just launch something and sort it out later. AI doesn't really have that property - if anything, it makes things more centralized, with more gatekeepers, and so seems more likely to destroy economic value than add to it.

▲

nonethewiser an hour ago | parent | prev | next [-]

>I would argue that that's been the case for quite some time before AI.

I would agree but it's really minimized the building. More and more time is being spent on pre-coding work.

▲

beambot an hour ago | parent | prev [-]

Google & Meta are illustrative of late-stage capitalism -- it's all about distribution, not innovation. Their job is (mostly) to just acquire the products that have passed the gauntlet, then scale up their monetization through their distribution-focused machine. The same dynamic plays out in virtually every industry (not just tech).

You'll find that most internal "innovation" teams are just lip service. In most cases, the "mothership" will be incapable of reproducing true innovation -- from a statistical perspective, culture perspective (mega corps are anti-scrappy; internal politics), and motivation perspective (startups aren't 9-to-5). It's much easier to have big M&A budgets, a VC arm, and some handwavvy internal innovation group.

Every now and again, you'll get real innovations (Waymo, transistors, GUIs), but even those have a spotty track record of commercialization when created internally.

▲

cogman10 an hour ago | parent | prev | next [-]

This is the same argument that has been historically made for outsourcing developers. Get 20 more devs for the cost of 1 dev in the US.

I suspect that AI will fail to pan out to the same extent for the same reason why outsourcing hasn't fully panned out (even though every company tries it after getting big enough).

The problems that will come up will be and always have been ongoing maintenance. AI is great at writing new code without a brain behind it, but once you get to the point where you need to refactor code, you start really needing someone with coding experience to guide the AI or veto it's mistakes.

I don't think that's really fixable even with a lot better AI. It's not something that ultimately comes out of the likes of github data.

I'm not saying that AI isn't going to make things better, btw, I just don't think we'll see a 20x improvement. Probably more like 1.5 or 2x.

▲

roncesvalles an hour ago | parent [-]

Outsourcing of knowledge workers didn't work out because at large enough scales, the geographic arbitrage disappeared. Companies mostly always got what they paid for.

The determinant of success was only whether the task needed American-tier labor or could make do with sub-American quality labor.

▲

m1coti 40 minutes ago | parent | next [-]

I am not sure this feels right. I agree that the US currently has leading minds in terms of tech, but I am not sure it is too big of difference with the EU knowledge workers. EU is still a lot cheaper then US in terms of wages you would need to pay.

▲

cogman10 an hour ago | parent | prev [-]

That's certainly part of it. But the other part that I've heard time and time again is that in order for outsourcing to be successful you basically needed an american engineer in the mix hand holding everything, clarifying requirements, and vetoing bad code.

That part of dev work, the requirements gathering, attention to details, clarifying requirements, is something AI also struggles with. A lot of companies basically waste time and money on outsourced devs because without a clear path forward they effectively will sit and do nothing, waiting for a prompt.

	▲	m1coti 24 minutes ago \| parent [-]
		I would not agree on that point. It really depends on company's structure. I mean it also depends with people that makes the team. I would say there are a lot of unknowns but I would certainly not generalize. How I find your argument is that one distinguished engineer from US could do the same with the use of AI. I worked with both and I know great and bad engineers from both sides. Only thing is that US has a bigger pool of great engineers.

▲

layer8 2 hours ago | parent | prev | next [-]

Who pays for that value, and from what, if all knowledge workers lose their jobs?

It sounds like the economy would largely reduce to the small minority class of independently wealthy people.

▲

simonw 2 hours ago | parent | next [-]

The more time I spend using agent tools the less I worry about knowledge worker job loss.

It takes a skilled knowledge worker to use these things.

	▲	keeda 19 minutes ago \| parent \| next [-]
		Yes, but I do worry about junior knowledge worker job loss. These models are very good (and getting better) at the vast dark matter of "donkey work" that happens in knowledge-based industries -- work typically done by junior devs / analysts / lawyers / consultants, paralegals, admin assistants, customer success / support, etc. -- and those roles comprise the bulk of the workforce. And worse, these are the tasks that help the junior people eventually grow into the skilled knowledge workers required to operate models, so there's a pipeline problem too.
	▲	kansface an hour ago \| parent \| prev \| next [-]
		We'll get around to training job specific models or the equivalent. Thats just lower on the value chain for now.
	▲	layer8 an hour ago \| parent \| prev [-]
		Sure. I was challenging the parent on how the “game” they are positing would play out.

▲

whatshisface 2 hours ago | parent | prev | next [-]

There were no knowledge workers in the middle ages.

▲

wongarsu an hour ago | parent | next [-]

Back then people were mostly farmers, but we already automated that job away.

Not completely, but compared to the middle ages we 50x'd their output. Which is a great illustration what it means to make a job 50 times more productive. We went from 80-90% of the population being required to barely make enough food for everyone to survive, to 4% of the population producing such an abundance that consuming too much food has become a systemic health issue

	▲	fodkodrasz 42 minutes ago \| parent [-]
		At the mere cost of destroying soil, and polluting water and the atmosphere in only 200 years! Possibly this will also play out well, and there is a huge market of... maybe social media influencer economy to pick up those being automated out of their previous work... or rather identity, as actually much like in the middle ages, the modern world also makes the profession largely the identity of the individual. I'm pretty skeptical on the outcomes and the costs also (natural and social as well), but possibly we can have 50x or even more software in the end! The phrase will be truer than ever: > Software is eating the world!

▲

thewebguyd an hour ago | parent | prev | next [-]

There definitely were what could be considered knowledge workers in the (high) middle ages, it just wasn't the majority of work like today. The knowledge workers then were just a tiny, elite faction, mostly employed by the church or directly by nobility. Kindgoms were still big bureaucracies and needed scribes, theologians, academics, lawyers.

▲

jrochkind1 an hour ago | parent | prev | next [-]

Relatively few anyway. Monks (who wrote and edited books and managed libraries, and also taught), artists and musicians, bookkeepers/treasury/exchequer, scribes/chancery (who were the administrators of the kingdoms), and bankers all existed in European "middle ages". But a significantly smaller part of economy/society compared to "western world" now, yes.

▲

layer8 an hour ago | parent | prev | next [-]

There wasn’t 20x value to pay for in the middle ages either.

▲

skydhash an hour ago | parent | prev [-]

Are you sure? Any functional organization requires keepers to oil the machine. First the government. The best examples were the chinese empire, the catholic church, and the various kingdoms. Or do you think that everyone was either fighting or farming? Stewardship is knowledge work. Bookkeeping is another.

▲

rvz 2 hours ago | parent | prev [-]

> Who pays for that value, and from what, if all knowledge workers lose their jobs?

They do not care unless these companies can get a bailout.

UBI only exists for companies that are too big to fail. Case in point, 2008 and SVB when there was too much money on the line.

One of the AI companies attempted to guarantee themselves a way for the government to bail them out if they were close to defaulting on the debt from the data center build out.

▲

mikeocool an hour ago | parent [-]

SVB didn't get bailed out, their investors and creditors were wiped out. You could argue depositors were bailed out -- as they took the undue risk of keeping more than $250k in a single bank (though as part of a requirement for getting a loan from SVB, you had to have your operating accounts with them. So lots of companies had no choice, as SVB was one of the few banks that would lend to them).

Arguably, the main impact of securing SVB depositors above the $250k limit is that it prevented thousands of people from being laid off that week, as their employers wouldn't have had the money to make payroll the following Wednesday.

	▲	matwood 35 minutes ago \| parent [-]
		Thank you for saying this. Continuing to point at SVB as a bailout is annoying. They were not bailed out. Anyone with deposits in an accredited bank should be made whole - always. Without trusted banking we have no economy.

▲

kmac_ an hour ago | parent | prev | next [-]

Producing a thing has always been cheap since personal computers existed. From mail-order software companies' times to SaaS times, producing a sellable MVP was an initial cost that is relatively small compared to the later cost of expansion and maintenance. Marketing and selling was and still is the hardest part.

▲

roncesvalles an hour ago | parent | prev | next [-]

Why do you think of knowledge workers as a fungible commodity?

What makes you think the people who used to build (or would have built) software will switch into the industry of "knowing that the thing was the right thing to build", as opposed to something cooler like surgery, city planning or experimental physics? The roles within a tech company are not the only jobs in the world.

▲

OtherShrezzing an hour ago | parent | prev | next [-]

> The bottleneck has moved from producing a thing that works to knowing that the thing was the right thing to build

“There’s more capital than good ideas to fund” has been a complaint from the likes of A16z & other VCs for a long time now. It’s why we ended up with stuff like NFTs getting funded.

▲

radicaldreamer 2 hours ago | parent | prev [-]

If knowledge workers get laid off in mass, you can expect political curbs on AI adoption.

▲

datsci_est_2015 8 minutes ago | parent | prev | next [-]

I could see such productivity gains being possible, if only because the current tooling around LLMs is terrible. The fact that we have 30 blog pieces per day making the front page of Hacker News about someone’s convoluted system to guide LLM output to something reasonable is absurd. There needs to be standardization in tooling, and it needs to be open source. Then, and only then IMO, will we see huge productivity gains.

But, at that point I think the big players’ moats will have dried up. Local models will probably be sufficient for 99% of daily office worker tasks.

So I disagree with TFA’s premise. I think this fear is probably shared amongst the LLM giants, and they’re still hoping that neural network transformers are somehow the path to AGI (probably not, imo).

▲

spamizbad an hour ago | parent | prev | next [-]

I will also tell you, as someone who works at a company that's trying to remain profitable, that token spend has caught the eyes of finance and much like cloud spend they've already started applying pressure to control costs. This May my team is protected to use 30% fewer tokens than we did in April - this was by intention. I suspect we'll drop more in June.

▲

tedggh 10 minutes ago | parent | prev | next [-]

Also, not all developers work on software products. The vast majority of developers work supporting software solutions as part of a much bigger business model, such as infrastructure, industry, healthcare and services. Many of these are complex organizations. So, unless you get to turn every employee into a 10x employee, the 10X coder along won’t necessarily make a 10X productivity contribution. What’s likely going to happen is the 10X coder will start to slow down or adding more (unnecessary) complexity to avoid having to sit and wait on overhead, for other areas of the business which are not easily automated away to AI to catch up. As a developer I can finish my project in June instead of December, but what if the customer is still not ready for integration until December? what do I do?

▲

mv4 23 minutes ago | parent | prev | next [-]

If people figure out how to run agents on-prem (already becoming feasible for both agentic tasks and coding on consumer hardware like Mac Studio 128GB+ or DGX Spark with some models) these companies will be in deep trouble.

Privacy is also a huge issue.

▲

jkelleyrtp 30 minutes ago | parent | prev | next [-]

I agree in principle with the math. But I believe that in reality if revenues don't show up quickly, then lenders will just restructure the debt and defer the payback period. Similar to SF commercial real-estate; many buildings should've come due during the depressed covid market, but lenders (banks) were willing to delay payment until the market picked up again.

The scale of these investments put the lenders at substantial risk, so the lenders will do anything to make it work. If the current lenders will be damaged by extended payback periods, they can simply sell the debt to someone else who won't be.

▲

jgbuddy 2 hours ago | parent | prev | next [-]

You are making the assumption that the models are only used / paid for by 2.5% of the population (your knowledge workers value). There will be new value created by these models which people are happy to pay for which simply did not exist at all before. It is also naive to say that the hyperscalers are going to be expecting a return on this in 5 years, it will be entirely propped up by investments / IPOs as has been the case with any tech company for decades now to reach scale. The hyperscalers are currently spending ~650b combined annually, which they have the cash for and can sell in future compute instantly.

▲

specproc 2 hours ago | parent | next [-]

I'm sorry, what the feck does "value creation" mean here? I live in a place where people are so, insanely squeezed from every angle. Wages are stagnant, prices rocketing. Where is the money to pay for this value going to come from?

No one I know feels richer than they did a decade back. I've not been able to meaningfully put up my prices for a decade. People are tired and stressed and scared, particularly scared of a technology everyone keeps telling them will make them redundant.

There is no rising tide lifting all boats, just most of us drowning whilst a few whizz past in their yachts.

I honestly hope these guys faceplant ASAP. Couldn't happen to a nicer bunch of people.

▲

dirck-norman an hour ago | parent | next [-]

Feelings aren’t fact. A lot of data shows the doomerism is not reflected in the actual numbers and much of it has to do with rapid inflation and continued vibes.

Consumption has risen, inflation adjusted wages have risen for blue collar and white collar alike. Most social mobility has been the middle class moving into the upper middle class, not moving to the lower class.

The main thing holding people back is the housing crisis. This is orthogonal to the value creation of businesses.

Value creation is growth. If it didn’t exist the S&P would still be 42.55$.

	▲	jacobgkau 18 minutes ago \| parent [-]
		> Consumption has risen, inflation adjusted wages have risen for blue collar and white collar alike. My wages haven't risen for nearly 5 years, while inflation has occurred over the past 5 years. Why the blanket statements? > The main thing holding people back is the housing crisis. This is orthogonal to the value creation of businesses. Are you suggesting a "housing crisis," in your words, wouldn't impact consumption? I'm watching my spending (and living like a child in his parent's house, except it's not my parent and I have to pay for it) in the hopes that in about a decade, I'll have saved up enough of a down payment for a home somewhere in my state that I could actually afford the mortgage on the remaining amount. There are plenty of things I'd potentially spend money on but won't as long as I feel like I'm economically stuck and have a chance in hell of saving my way out of it. So this feeling translates to fact. If you think my personal experience is just an anecdote and doesn't count because it's not being told through the lens of large-scale numbers, fine. But I really agree with the person you replied to that you're gonna have to be a whole lot more specific than "value creation" if you want people to spend money on your AI products "in this economy," whether it's because they're actually strapped for cash or just pretending like you seem to think they are.

▲

WarmWash an hour ago | parent | prev | next [-]

Sounds like internet sentiment and not research data.

It's kind of become socially taboo to not be suffering "in this economy", but on paper it's hard to see weakness in places that there isn't always weakness. As long as the 65-95% are doing well, there isn't going to be a collapse.

	▲	forlorn_mammoth 42 minutes ago \| parent [-]
		The most recent U Michigan 'Survey of Consumer Sentiment', which is THE authorative source in the US, shows consumer sentiment at the lowest levels since the survey started in 1977 From the U Michigan page: https://www.sca.isr.umich.edu/ or from the FED https://fred.stlouisfed.org/series/UMCSENT

▲

jgbuddy an hour ago | parent | prev | next [-]

A literal example is that I can use AI to file my taxes instead of spending a weekend and hundreds of dollars to have an accountant do it for me. It costs me like $5. that 245$ delta is the value of that output to me, as long as I am confident it is correct.

▲

mfuzzey 33 minutes ago | parent | next [-]

Seems to be a thing in the US to need specialised software, an accountant or AI to file taxes.

In most of Europe individuals at least don't need any of that. I'm in France and it's just a connection to a government run website to enter a few figures, takes less than an hour most of it is already pre-entered (salary etc), the main thing to add manually is charitable donations.

If you're running a business then yes an accountant can be good (or be required depending on the legal form of the business) but not for individuals.

▲

moduspol 21 minutes ago | parent | prev | next [-]

Part of the value of paying an accountant is that you can get representation in case you are audited. Though I guess you did say you were confident it is correct.

▲

WarmWash an hour ago | parent | prev | next [-]

I did my taxes this year too with 5.5 and 3.1

Otherwise normally costs around $800 to do, because I have a small business too.

▲

smnc an hour ago | parent | prev [-]

> as long as I am confident it is correct

Are you? Does it cost you extra (time or money) to be?

▲

jgbuddy an hour ago | parent [-]

Yes, and they were accepted. A year or two ago I would have been less confident but now almost UX is happy to cite sources.

	▲	redfern314 3 minutes ago \| parent [-]
		Not speaking to the wisdom of filing taxes using LLMs, but just FYI (assuming US here) taxes being accepted doesn't mean they were correct. It just means the IRS hasn't found anything major wrong (e.g. SSN used on multiple returns). Even being approved isn't a guarantee, an audit could come later.

▲

deaton an hour ago | parent | prev [-]

Thats the thing; the "increase in productivity" isn't being felt by the general public, the end user. If your "increase in productivity" just means more money being shifted around at the corporate level then it is meaningless.

▲

mrandish an hour ago | parent | prev | next [-]

> There will be new value created by these models which people are happy to pay for which simply did not exist at all before.

True, but I think the GP's point was that what consumers will pay won't be nearly as profitable as what enterprises will pay to increase the output of their developers and knowledge workers. ChatGPT is currently the overwhelming leader in consumer AI usage but only ~5% pay $20/mo.

As a recently retired serial tech founder, I'm now one of those consumers. I use AI webchat daily for general search, Q&A and even to write little automation scripts for myself, yet I haven't paid anyone anything for AI yet. Even after being heavily restricted and performance nerfed to hell in recent months, free webchat AI is still fine for everything I do, and I'm not remotely price sensitive.

Even as AI compute costs fall over time, I doubt serving ads against AI webchat to consumers will generate the kind of high-margin, sustainable growth VCs get excited about. It's so undifferentiated I bounce around between all four leading providers because there's virtually no moat locking casual consumers to any chatbot beyond a single question thread. I guess if it had a nearly infinite context window seamlessly integrated across all sessions, that might be somewhat sticky for some consumers but it could also get creepy for some others - and it would devour gobs of the scarcest resource in AI. Beyond Maslow's Hierarchy of Needs, the mobile phone is the largest revenue, long-term mass consumer product ever but I just got a new flagship phone from a top-tier provider for $30/mo over 3 yrs. IMHO, even an all-you-can-eat, infinite context window, next-gen Mythos couldn't reach and sustain mobile phone levels of global consumer adoption at ~$20/mo. Unlike professional developers and knowledge workers, consumers don't have any "job to be done" big enough for an LLM to command that much of their zero-sum discretionary spend.

	▲	jgbuddy 35 minutes ago \| parent [-]
		100%, a driving factor will likely be how good we can make models that are so small they use almost no compute. Until then it is a race for adoption and moat-building (or screwing people over?) once you have users

▲

Planktonne an hour ago | parent | prev [-]

> There will be new value created by these models which people are happy to pay for which simply did not exist at all before

What sort of new value, and why will people pay for it from someone else rather than prompting for it themselves?

▲

jvanderbot an hour ago | parent | prev | next [-]

Hey, I wrote this down one time. I estimated way higher yearly revenue required, to be adversarial. And you can keep the "cost per unit AI work" a parameter and play with the results.

But the point is that if people are willing to delegate part of their salary (e.g., buy consumer products), vs requiring employers to pay for the tokens, then it's quite possibly a net win. Something like "I pay a largeish fee every month to make my own job much easier", similarly to how we buy a car to make commuting easier.

https://jodavaho.io/posts/ai-jobpocolypse.html

▲

onlyrealcuzzo 2 hours ago | parent | prev | next [-]

> We're talking about a world where you need 5% of every knowledge workers salary to go into tokens.

They are assuming ~10% global GDP growth instead of ~3%. You probably don't need the same %s if the pie grows a ton.

I'm highly skeptical we get that growth, but if you aren't, it makes it easier to digest.

▲

freakynit 2 hours ago | parent | next [-]

I mean this case with AI-productivity fires itself back when we talk about GDP.

The more AI causes productivity increases, the less and less number of workers will be needed. This will heat up the job market even more and bring salaries down.

Net effect of this productivity increase: less consumption by the masses, even though you may be producing more good and much more efficiently.

A third effect also comes into play that once all this starts to happen, common people, who are generally living paycheck to paycheck, will now start to hesitate towards making any long term investment, housing included. And that indirectly will end up impacting financial and banking sector, which will then impact existing savings, bonds yields and retirement funds, and the recession-like cycle starts.

This productivity increase only makes sense if it is capped to a very small number.. like 20% max. Beyond that, who these companies will even be selling to?

Am I overthinking all this?

▲

onlyrealcuzzo 3 minutes ago | parent | next [-]

> The more AI causes productivity increases, the less and less number of workers will be needed.

Why does this have to be the case with AI but it didn't have to be (and wasn't) the case with the steam engine, electricity, the automobile, or the computer & internet?

Certainly, AI could be different.

It's curious to me why the vast majority of people on here think it must be different.

▲

seanp2k2 an hour ago | parent | prev | next [-]

>The more AI causes productivity increases, the less and less number of workers will be needed. This will heat up the job market even more and bring salaries down.

>Net effect of this productivity increase: less consumption by the masses, even though you may be producing more good and much more efficiently.

Big tech companies can't even create login flows and account recovery flows that work for everyone yet. There are countless stories of folks losing access to business Instagram accounts that get hacked, Google support from a human to fix a problem that is outside of their help articles is non-existent, etc etc. There's still so much "low-hanging fruit" IMO that isn't particularly fun or exciting to fix, but ask your average non-tech friend or family member what they think of the Facebook + Instagram security settings pages / sites / desktop-only settings.

Who is going to pay for all of these subscriptions that will power this GDP increase when average purchasing power of those outside of the top ~10% of earners is decreasing YoY? We're headed toward food and water shortages next to sprawling datacenters, not shared societal prosperity and a healthy middle class.

▲

simonw 2 hours ago | parent | prev | next [-]

> The more AI causes productivity increases, the less and less number of workers will be needed.

That only holds if companies have a fixed need for "productivity" which is met by their current employees, such that their employees becoming more productive means they need less of them.

Every company I've ever worked for has wanted to achieve way more than they are able to get done with current resources.

But generally yes, the biggest open question about all of this is how the impact will play out on the economy, job opportunities etc. I've not seen anyone come close to a confident prediction about how this will play out.

▲

jbreckmckye 2 hours ago | parent [-]

> Every company I've ever worked for has wanted to achieve way more than they are able to get done with current resources.

I mean sure. Every company wants an infinite addressable market. But that doesn't mean it exists.

It might not be possible to sell 10x the software we sell today. It might not even be possible to sell 2x

	▲	forgetfulness an hour ago \| parent [-]
		It's hard to imagine how making insurance sales cheaper for the brokers, churning out astrology apps faster, AI boyfriend bots or running ad campaigns with fewer and lower paid designers is going to drive 10% GDP growth in developed and middle income countries, that's the sort of figures you see when very poor countries finish rolling out electrification, sanitation and transportation.

▲

arjie an hour ago | parent | prev [-]

First of all, common people are not living paycheck to paycheck in the sense that they're at risk of not having money[0]. This is corporate content marketing that has entered the collective memory of people, not anything close to reality.

Secondarily, reducing the cost of making a thing doesn't always mean you get less of a thing. For me, certainly, what happened is that I write way more software than I originally did. When we built compilers, the amount of human engineering effort required to do things plunged, but the amount of software engineering jobs didn't go down.

This is as bad as models will ever be. That part is true. And it's entirely possible we go foom. But it's also possible we don't, and then it depends on where the asymptote lands.

0: https://www.slowboring.com/p/this-economic-myth-needs-to-go-...

▲

seanp2k2 2 hours ago | parent | prev [-]

And yet the job everyone loves to hate, the humble "burger flipper", continues to resist automation yet command minimum wage labor rates. This future of either being a CEO of a company consisting primarily of AI agents building some monthly subscription-based solution to some trivial digital chores OR manual labor that isn't [yet] fiscally viable to automate seems quite bleak. We'd also need a ton of robot technicians and manufacturing that the US has neither the educational and training institutions to support nor the will of the population to fill. Given the ongoing war on immigration, visas, and foreign-made hardware, if this continues, good luck.

	▲	stared an hour ago \| parent [-]
		This would be a Bladerunner future Pope Leo XIV warned against (https://news.ycombinator.com/item?id=48265206), though in different words.

▲

amelius 8 minutes ago | parent | prev | next [-]

At least they're not going to make us watch ads.

▲

yalogin 25 minutes ago | parent | prev | next [-]

To get that revenue and adoption they have to vastly increase their infrastructure spending. If they are currently losing in even the 200/month plans how is it sustainable?

▲

cryo32 an hour ago | parent | prev | next [-]

This is never going to materialise. It’s dead in under 2 years.

The market is shrinking and saturated already and it’s not because of AI gains but geopolitical instability and supply chain issues, some of which are caused by AI spending and stupid ass PE firms refocusing on AI supply chains.

Only our pensions and futures burning.

▲

aspenmartin an hour ago | parent [-]

What do you mean by the market is shrinking?

▲

cryo32 23 minutes ago | parent | next [-]

Literally revenue is collapsing in most sectors. Technology purchasing is declining. Service models are failing to turn a reasonable ROI.

People stopped buying shit.

▲

packetlost 19 minutes ago | parent | prev [-]

It's consolidating into fewer, higher value assets. Over 40% of the S&P500 is in companies that are heavily (potentially over) invested in AI.

	▲	aspenmartin 12 minutes ago \| parent [-]
		tech companies have grown disproportionately to other industries, but that says nothing about the growth in other industries - S&P has a Q1 2026 blended revenue growth of 11.3% according to FactSet - most sectors are growing, not just tech

▲

jstummbillig an hour ago | parent | prev | next [-]

> 200m knowledge workers in the world, 30m developers. We're talking about a world where you need 5% of every knowledge workers salary to go into tokens. 20% if you're a developer.

This is where the napkin math is breaking down in a big way. There is absolutely no reason to assume this will only impact "knowledge workers". Farmers use computers. Farmers will use AI.

	▲	vablings an hour ago \| parent \| next [-]
		AI for what? None of the AI a farmer could or would use would be any more meaningful that light chatbot usage or already existing computer vision/gps
	▲	quantumleaper 35 minutes ago \| parent \| prev [-]
		The kind of farm that would use AI is already 99% machinery and automation.

▲

browningstreet 2 hours ago | parent | prev | next [-]

Somehow Uber and WeWork survived the same kind of grand projections that they never met.

▲

121789 2 hours ago | parent | next [-]

uber sure....but how did wework survive? they are a smoldering husk of a failed company looted by its founder

▲

hamdingers an hour ago | parent | next [-]

I'm sitting in one right now and don't see any smoldering...

▲

khuey 22 minutes ago | parent | next [-]

They literally went bankrupt and wiped out the original shareholders.

	▲	hamdingers 10 minutes ago \| parent [-]
		I guess I'm just not clued into your exotic definition of "survived" if continuing to function doesn't qualify. I tend to go by the dictionary definition. Chapter 11 is not Chapter 7. Businesses survive chapter 11 bankruptcies all the time. For example, WeWork.

▲

kevin2107 an hour ago | parent | prev [-]

lmao. I'm sitting in Hiroshima and nothing is burning

▲

naravara 2 hours ago | parent | prev [-]

The company’s gone but the assets just got sold to other commercial real estate firms.

Uber was basically only ever software to help people use their own cars so a very small part of their valuation was physical stuff to upkeep, it was just deals and obligations they had.

Not sure how it shakes out for Anthropic and OpenAI. There’s a lot of physical capacity that needs to be built out and can depreciate. But there’s also a lot of network effects and dependencies being built in with enterprise users.

I don’t know how swappable the tooling is either. I think over the long term the UI, model training and documentation, and infrastructure are going to end up being run by different parties and I’m not sure which leg of that chain ends up in a position to skim most of the profit off. My guess is that Apple and Google end up raking in all the money since they control the OS and app stores while the rest of the stack gets driven down to being generic commodities. At least where mass market consumer adoption is concerned.

▲

windexh8er 2 hours ago | parent | prev | next [-]

The difference is that they had room to charge more of their customers and pay less to their workers. The AI industry doesn't have both sides to play at this point. Training and inference are getting more expensive and if you take on the high prices now you're just floating yourself further downstream from profitability long term (which does not look viable for any of them currently).

▲

paxys 2 hours ago | parent | prev | next [-]

WeWork absolutely did not survive

▲

tapoxi 2 hours ago | parent | prev | next [-]

I don't think Uber was doing $1 trillion in infrastructure spend.

▲

hansmayer 2 hours ago | parent | prev | next [-]

Funny you should mention Uber. What was it their COO said recently about the AI costs?

▲

simonw an hour ago | parent [-]

I quoted exactly what they said in my piece, under the heading "The AI-failure stories around this are pretty thin": https://simonwillison.net/2026/May/27/product-market-fit/#th...

> But then you sometimes go and talk to your senior engineering leaders and you’re saying, OK, how many projects that were on the cutting room floor got moved above the line because of the productivity gains because 25% of our code commits were via Claude Code last quarter?

> That link is not there yet, right? I think maybe implicitly there’s more that is getting shipped. But it’s very hard to draw a line between one of those stats and, OK, now we’re actually producing like 25% more useful consumer features, right? And that line is hard to draw.

That's pretty weak sauce. I don't think that justifies the headlines that came out of it, personally.

▲

hansmayer an hour ago | parent [-]

? What are you talking about mate? The man all but says "this shit does not work for us". It iss layered in that careful, sanitised corporate shit-sandwich communication approach, where you take a nice piece of shit and layer it in between two slices of avocado so its sweeter to swallow for the "consumer" of your message.

He also said in that article that what prompted the discussion was the public statement by the Uber CTO that he had already burnt through his organisations yearly AI-budget in April. Please stop this shilling mate, and trying to hide the overall perspective between this or that word.

	▲	simonw an hour ago \| parent [-]
		Did you read my piece? I covered the Uber CTO thing too: https://simonwillison.net/2026/May/27/product-market-fit/#th... > The most discussed has been Uber, based on this report where CTO Praveen Neppalli Naga indicated that Uber had “maxed out its full year AI budget just a few months into 2026”, mostly thanks to Claude Code. > Given that Claude Code only got really good in November it’s entirely unsurprising to me that a budget set in 2025 may have failed to predict demand for that tool in 2026!

▲

xoac 2 hours ago | parent | prev [-]

somehow the invisible hand of the market is also blind af

	▲	ArcHound 2 hours ago \| parent [-]
		Makes sense if you think about it: if all photons pass through you (invisible) then you can't capture them to get info (blind).

▲

ciconia 19 minutes ago | parent | prev | next [-]

> make developers 2x, 5x, 10x as productive on stuff that matters

What does this even mean? Is this about speed of development? Is this about headcount? LoC? How are coding agents contributing to productivity in places like GitHub, Shopify or Meta? I mean companies that already have an established product. I really wanna understand this because I'm not seeing that GitHub's product suddenly became so much better than it was 2 years ago, so where's all that productivity going?

	▲	zamalek 12 minutes ago \| parent \| next [-]
		The productivity is going into perverse incentives[1], e.g. we have improved (by which I mean "increased") token use. More PRs every day. More lines of code. All things we knew were shit-brained metrics a decade ago (obviously except token use). We've also increased how much our coworkers need to read, or deal with. You can get an AI to make any point you want, so you can ignore the 5 humans raising alarms due to the 1 clanker you made say what you want to hear. All numbers going up. There are obviously people producing additional true value with it, probably, but that's almost certainly scarce. [1]: https://en.wikipedia.org/wiki/Perverse_incentive
	▲	flexagoon 15 minutes ago \| parent \| prev [-]
		Productivity is measured in the number of AI-generated Twitter posts developers can make about their AI-generated startups

▲

golly_ned an hour ago | parent | prev | next [-]

This is why 'agents' are the solution for these companies. Token spending goes through the roof. As long as a human is in the loop needing to read or review at human speed, that's a ceiling on how many tokens per user they can generate.

▲

npn 38 minutes ago | parent | prev | next [-]

we all know it is impossible goal to make. surely AI will be even more useful in the future, but as long as china exists and continue to undercut the price, the goal will be never meet.

> We're talking about a world where you need 5% of every knowledge workers salary to go into tokens. 20% if you're a developer.

with that much money, the companies can easily buy their own hardware and hosting free public models, no need for those expensive subscriptions.

▲

TimTheTinker 2 hours ago | parent | prev | next [-]

I thought Anthropic and OpenAI's combined CapEx has been <100B?

source: https://isaiprofitable.com/

▲

kilroy123 an hour ago | parent | next [-]

That site needs Apple on the list. ;-)

▲

Danox 19 minutes ago | parent [-]

Why? All their money is going to Apple Silicon and the five ecosystems, so far in Apples entire history, the largest acquisition has only been $3 billion dollars, OpenAI is currently getting nothing and they gave Google a measly $1 billion refund per year for the use of Gemini.

If John Ternus wants to spend some money, spend it on bringing memory in house. Apple has the money and the engineering talent to do so, have it fab/made onshore in partnership with TSMC.

Do it Apple because you have to not because you want to the Chinese probably will be taking over the memory industry, worldwide, by taking advantage of the greed from three memory companies and their AI overlords.

	▲	kilroy123 4 minutes ago \| parent [-]
		That's the point. To show how they _haven't_ lost billions on this.

▲

deaton an hour ago | parent | prev [-]

Maybe so far, but they've committed to well over a trillion in future capex.

▲

mirekrusin an hour ago | parent | prev | next [-]

Now try to take back llms from developers and see what happens.

	▲	bigfishrunning 34 minutes ago \| parent [-]
		If, by some miracle, all LLMs ceased working right this second, any developer who would no longer be productive should not have been a developer in the first place.

▲

PunchyHamster 14 minutes ago | parent | prev | next [-]

That assuming once they start squeezing people won't just go to deepseek or other cheaper competition

> That's a _huge_ shift. Most people I know cite +20%-40% velocity with these tools, against the actual work their company cares about doing. +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending.

And most research shows people far over-estimating their own gains. Once companies start counting the actual (and not just reported) gains, the AI budgets will be more limited as people realize it's an useful and versatile additon but not replacement for most types of work

> We're not there yet. This is still the upswing of the hype cycle, and unless we figure out how to make developers 2x, 5x, 10x as productive on stuff that matters, this isn't going to play out well.

Upswing of the hype cycle while growth of tech itself is flattening, both coz of techs innate issues (which might or might not be solved, but some papers claim they are unsolvable with current approach) and just the fact the spike in growth caused so high economy cost that it put brakes on itself.

▲

superxpro12 25 minutes ago | parent | prev | next [-]

It's going to be a typical saturation curve. A lot of upfront tokens spent on things that have stockpiled over the years, and then the derivative on token spend trends to zero as the users run out of immediate things to try. Sure there will be ongoing maintenance and experiments, but it wont be nearly as close as the initial inrush.

▲

logtempo an hour ago | parent | prev | next [-]

> +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending.

Except that if your company go 20% faster than the others companies, you win market shares. But then, everyone will use the same tools and companies will be at even speed, but the tool will stay.

Now...if the market is saturated, it's useless to try to do things faster. Cheaper yes, but not faster.

	▲	archagon an hour ago \| parent [-]
		Pretty much all major tech companies today are horribly bloated and mostly metastasizing instead of innovating. I'm not sure how 20% increased productivity will help in any way with that. If anything, it might accelerate enshittification and turn potential customers off even more.

▲

deaton an hour ago | parent | prev | next [-]

Bigger than that, they have to contend with open weight local inference. Open weight models right now haven't caught up to the frontier models of right now, but they're as good as the frontier models of not too long ago. If open weight models reach a certain point, then frontier model providers are going to struggle to make anything selling tokens, because eventually people will realize they don't need Mythos for everything.

▲

aprdm an hour ago | parent | prev | next [-]

"Next 5y" doesn't apply to AI factories

▲

jmyeet 2 hours ago | parent | prev | next [-]

YEPPP... and I'm kind of shocked at how many people can't do simple math.

Let's put it context. Google's annual revenue seems to be north of $400B. So if OpenAI suddenly had Google's revenue, it would still be insufficient to recover their investment.

and it's a ticking time bomb because $1T in servers, CPUs, GPUs and memory is going to be worth $200B in 5 years. You can say they can keep using what they've got. Sure. But they're also not going to stop spending on new hardware. And the competitor that comes along in 5 years and spends $1T doing the exact same thing is going to have a huge advantage.

OpenAI at this point reminds me very much of the Russ Henneman pre-money hype cycle.

▲

mfuzzey 28 minutes ago | parent | next [-]

It's actually worse than that. It's not just financial depreciation or that the existing hardware becomes obsolete due to being less powerful than new hardware but also that hardware being run all the time at high load actually has a limited lifetime of a few years so it will physically break...

	▲	jmyeet 10 minutes ago \| parent [-]
		I agree but it's even worse than that. Data centers come down to performance-per-Watt. Electricity accounts for 20-30% of a data center's operating cost [1]. I don't know the exact breakdown but the GPU part of that is probably the majority given how power hungry GPUs are. The B200 is upwards of 1200 Watts [2]. The B200 is rated at ~4.5PFLOPS of dense FP8. So you're getting 3.75PFLOPS/W. We don't know what the next generation will look like. The A200 (Hopper architecture card that preceded the B200) had ~4PFLOPS apparently but also lower power consumption. Obviously this changes depending on whether you're looking at dense or spare and FP8 vs INT8 vs INT4 vs FP4, etc so we're just using FP8 as a yardstick. Imagine a fictional B200 successor, the T200 that has 8PFLOPS of dense FP8 at 1000 Watts. Well then a DC built on that where the T200 will likely cost similar to what the B200 does now, you'll get nearly double PPW so the same size DC and same electricity load is going to be like 2 of your old DCs in operating costs. That's a big deal when you've laid out a trillion dollars. [1]: https://iaeimagazine.org/electrical-fundamentals/how-much-el... [2]: https://www.trgdatacenters.com/resource/h200-power-consumpti...

▲

hansmayer 2 hours ago | parent | prev | next [-]

This should be the top comment. Also, I think its not that many people, including our Simon here, are not good at math. Its more like, some of them seem to be incentivised to not be cough, cough, "good at math". How else will the hype sell?

▲

simonw an hour ago | parent | next [-]

I thought my post was pretty free of hype. I said that this new revenue "Maybe even enough to start covering their costs!"

▲

WhrRTheBaboons an hour ago | parent | next [-]

that statement is pretty high on hype relative to the actual financials though

▲

hansmayer an hour ago | parent | prev [-]

Well, your title certainly was not, in any case!

	▲	chipotle_coyote 20 minutes ago \| parent [-]
		I mean, a company that loses money on every widget they sell might technically have found "product-market fit." :) It seems quite possible to me that developer tooling is going to end up being the biggest win from LLMs because there is a product-market fit -- and also quite possible that OpenAI and/or Anthropic end up getting bought for pennies on the dollar because their burn rate is unsustainable. AI may end up being this generation's "dark fiber."

▲

Imustaskforhelp an hour ago | parent | prev [-]

At a certain point, I genuinely feel like the best way this hype is being sold is by making people genuinely believe in it.

and in that sense, if Anthropic and OpenAI are able to create the projection that they can-be profitable despite finances seeming bubbly at best, I think that what happens is that these companies spew so much amount of content that people like Simon get into it too.

There is a deeper problem of people falling into AI psychosis too, in general, I am not sure if Simon has fallen into it or not

I think that the greatest point which can be made here is to not offload your thinking to others and to think about the situation yourself. Sounds familiar (looks like we are all off-loading our thinking itself to machines)

Side-note: As humans, we have a tendency to quickly judge or make quick decisions which stems from our times foraging and scavenging in jungles.

Another Side-note: at a certain point, I am unsure of how much to think about AI or not, certainly discussions about it that were happening 2 years ago weren't helpful in contexts that they are used now (well not in any way or form that a person discussing and getting into the weeds of AI 2 years ago is better than a person just getting into it say 2-3 months ago)

With the industry (moving so fast) [but that doesn't mean that you can't catch up with it, I feel like the fast word has made people think that they are falling behind which is imo wrong i suppose]*, It is basically unsure to me of any FOMO or anything if you aren't using AI already, I find this notion naive.

People might be making strong opinions (AI psychosis) and skills on the tools available at the moment the same done 2 years ago. We don't quite know about the tech as these are still black-boxes and how they progress and what these "AI skills" might survive or not in future. Heck, we aren't even sure if these tools might survive or not or wouldn't be made magnitudes more expensive simply to break even as they are given to us for the first time at percentages of the price.

I don't know if I should form (strong) opinions yet and also a question of its worth so much thinking efforts in the first place, probably just gonna do my own thing (the way I want to) which includes learning C at the moment. because learning is fun.

▲

simonw 25 minutes ago | parent [-]

I didn't exactly say that they were about to become wildly successful companies. I suggested that they had "found product-market fit" - not too impressive for more than a decade of work - and that their revenue may even be "enough to start covering their costs".

▲

Imustaskforhelp 7 minutes ago | parent [-]

Firstly thanks for responding and I wish you to have a nice day. your suggestions have value and I appreciate you writing the article. Perhaps enterprise businesses do end up becoming the fat and meat of the AI industry.

My question which I wish to ask: What would happen to these AI companies if they turn out to be anything but wildly successful companies, both to the investors who have already invested in it and to those who might be investing indirectly into it in the near-future (passive investors, retirement funds)

I would love to hear your thoughts on it!

Thanks and have a nice day :-D

	▲	simonw 4 minutes ago \| parent [-]
		> What would happen to these AI companies if they turn out to be anything but wildly successful companies I'm not nearly enough of an economist / finance person to answer that credibly, but I expect they'll go bust, and a lot of people will lose their shirts. ... and the model weights will be sold to other companies who will then run them at a profit, and eventually figure out an economically sustainable way to train new ones. The 1800s railway booms are a good comparison here - a lot of companies went bust, a lot of investors lost money, and we still ended up with railways. If the AI companies all go bust we're going to have a lot of spare data center capacity!

▲

mountainriver an hour ago | parent | prev | next [-]

How could extremely capable artificial brains ever pay for themselves?

▲

WarmWash an hour ago | parent | prev [-]

Prices are not going to stay where they are.

You have either never seen a tech cycle, or need to be reminded of that. The pressure to buy more expensive plans is already starting to form.

▲

sowbug 2 hours ago | parent | prev | next [-]

There is also the EV (expected value) of developing AGI. Even if you personally believe the probability is low within the lifetime of either of these companies, the value would still be extraordinarily high, enough to forgive a $5T or so miscalculation here or there.

▲

jbreckmckye 2 hours ago | parent [-]

I don't think AGI was ever a serious endeavour, just something the labs talked up to grab attention.

I am willing to bet a Twix we'll look back on that stuff in 2 years with a lot of embarrassment

▲

sowbug an hour ago | parent [-]

The high-risk side of that bet would need to win more like a lifetime supply of Twix. But in a post-scarcity nirvana, everyone already has that. So sure, you're on at even money. See you in two years.

▲

deaton an hour ago | parent [-]

Theres no reason to believe, based on recent trends, that AI would lead us to a post-scarcity world, even if it could do all of our jobs better and cheaper.

	▲	sowbug 36 minutes ago \| parent [-]
		I'll wager a hypersled of my Twix against your next three rations of gruel. But I think I'm done betting after this one.

▲

ar_lan 2 hours ago | parent | prev | next [-]

> unless we figure out how to make developers 2x, 5x, 10x as productive on stuff that matters, this isn't going to play out well.

Simple - you make them work 2x, 5x, or 10x more hours.

	▲	OtomotO 2 hours ago \| parent [-]
		There are not enough hours to do that

▲

EGreg 3 hours ago | parent | prev | next [-]

Here is a serious question.. Can we sell into the hype cycle and on the way down with this: https://safebots.ai/costs.html

▲

adithyassekhar 2 hours ago | parent [-]

I asked claude to generate a frontend and it made the same template. Same san serif and serif fonts together. Same colors. Same typography. Same layout and animations even. It’s wild how similar it is. No not similar it’s the same damn thing.

▲

dd8601fn 2 hours ago | parent | next [-]

I’ve seen the same dashboard for a dozen custom web applications now, including a couple I had it make for me.

It really does have a particular lane for each chore, and it’s reproducible.

	▲	properbrew 2 hours ago \| parent [-]
		Yep and when you see it in the wild it stands out like a sore thumb, absolutely no thought into a bit of a unique design or branding. I have a few live websites built using LLMs and they will just go for default generic templates and colours if there's no vision.

▲

jeffreygoesto an hour ago | parent | prev [-]

It produces the "most average" web design unless you really prompt your way out, isn't it? If you don't care enough to prompt, Claude does not care to be individual.

	▲	WarmWash an hour ago \| parent [-]
		Technically from claude's POV, it's one individual copied millions of times. All claudes are clones.

▲

mannanj 37 minutes ago | parent | prev | next [-]

One quick question. Did tax payer money fund these data centers? If so, how does that money translate to their profit and a return for the people whose work paid for the resources?

Or did we just get scammed?

▲

2 hours ago | parent | prev | next [-]

[deleted]

▲

YetAnotherNick 2 hours ago | parent | prev | next [-]

> $5t to $10t to make back in the next 5 years

Wait what? They spent 2 order of magnitude less on hardware.

▲

trjordan 2 hours ago | parent [-]

From the verge: https://archive.is/kU4Zg

> Gartner forecasts that large AI companies would need to earn cumulatively close to $7 trillion in AI-driven revenue through 2029, which is close to $2 trillion per year by the end of the period. In order to achieve “historic returns,” the providers would need to earn nearly $8.2 trillion in the same period.

▲

YetAnotherNick 2 hours ago | parent | next [-]

Those numbers don't even track even in the same sentence. If it is $2T/year by the end of 2029, it would be something < $6T cumulative in 3 years.

	▲	layer8 2 hours ago \| parent [-]
		“Through” 2029 is a bit more than three and a half years. The $2T are likely the yearly average of the $7T in that period.

▲

b0r3dthisD4y 2 hours ago | parent | prev [-]

The numbers are made up political correctness anyway.

Everyone's agency is 100% captured by belief in Wall Street. Too few <50 have any meaningful labor skills to blink.

We'll continue to have consent manufactured via media platforms and in 3 years no one will bat an eye at these companies being worth $12 trillion as Altman and Musk climb two ladders holding a "mission accomplished" banner.

▲

HDThoreaun 2 hours ago | parent | prev | next [-]

Source on 200 million knowledge workers worldwide? My understanding is that it's just above 1 billion. I dont think a billion subscriptions at $1000/yr is out of the question but it might take a decade to get roiling

▲

swatcoder 2 hours ago | parent | next [-]

You're suggesting that 1 in 8 people worldwide, including every one from infants and the elderly, are knowledge workers. Are you sure that's what you mean?

I'm not even sure that 1 in 8 people I know would qualify as a knowledge worker, let alone a knowledge worker that might profoundly benefit from on-the-horizon AI. And I'm in a highly skewed population.

▲

WarmWash an hour ago | parent | next [-]

I think the underestimation is how many people want a personal knowledge worker in their pocket, and are willing to pay ~$65/mo for it.

	▲	swatcoder 29 minutes ago \| parent [-]
		Personally, I've only encountered any of those people on line, and almost exclusively here on HN. Most people I've met -- and again, in a pretty darn skewed sample globally -- see $65/mo as a lot of money to spend on technology of any kind and can't think of anything much they need from "a personal knowledge worker in their pocket". I don't know a single person in real life who remains excited about AI at all, and only a few software engineers who feel it'd be worth that much. Everybody seems to be mostly confident with the "knowledge productivity" in their personal and professional life and a pretty skittish about spending in today's economy. Most would be excited about a magic new robot that affordably saved them from unwanted physical labor and drudgery, but nobody needs much real help making appointments or filling out forms or whatever. That's not to say I won't be proved wrong some day, with some further innovations in AI products, but global-scale demand isn't waiting for anything that's been released so far.

▲

HDThoreaun 2 hours ago | parent | prev [-]

Well around 40% of people work. I dont think its crazy to say around a third of jobs are knowledge jobs, but what do I know

	▲	matthewowen an hour ago \| parent [-]
		85% of the world population lives outside of developed nations. 27% of the world's workforce is in agriculture (contrast to the US where it is 1-2%). 15% in manufacturing. A lot of people work in "services" (especially in high income nations, where it's roughly three quarters) and some of those are knowledge workers... but a huge number of them are nail technicians or hairdressers or bartenders (etc etc).

▲

rootusrootus 2 hours ago | parent | prev [-]

A billion? Really? At 200M you’re already including a lot of people that stretch the definition of knowledge worker.

▲

naravara 2 hours ago | parent | next [-]

A lot of those ‘edge cases’ in the definition of “knowledge worker” are probably the stuff that’s most likely to have significant parts of the work augmented or replaced by AI agents. Like, call-centers are almost certainly going to get turned over in a big way. It’s not like the median tier-1 support operator just reading off a script is much better than an LLM anyway.

▲

esseph 2 hours ago | parent | prev | next [-]

Yeah, just looked into this. Knowledge workers is a big group and probably much larger than you think it is.

Basically if you're not doing manual labor, it's probably knowledge work.

Roughly 1/3rd of the working population.

Some data tucked in here: https://gist.github.com/danielmiessler/2dc039762a202b083753b...

▲

HDThoreaun 2 hours ago | parent | prev [-]

> At 200M you’re already including a lot of people that stretch the definition of knowledge worker.

How do you know this? Im certainly open to recalibrating my numbers which is why I asked for the source

▲

windexh8er 2 hours ago | parent [-]

What's your source, because it looks wildly out of proportion compared to numbers we have now.

▲

Andoryuuta 2 hours ago | parent | next [-]

To add an actual source to this thread, a brief paper by researchers at the International Labour Organization (ILO) states that for knowledge workers globally "... there are between 644 and 997 million jobs, which represents between 19.6 per cent and 30.4 per cent of global employment respectively." [1]

[1]: Berg, Janine and Gmyrek, Pawel, Automation Hits the Knowledge Worker: ChatGPT and the Future of Work (April 21, 2023). UN Multi-Stakeholder Forum on Science, Technology and Innovation for the SDGs (STI Forum) 2023, Available at SSRN: https://ssrn.com/abstract=4458221

	▲	windexh8er an hour ago \| parent [-]
		Globally, sure. The assumption here is all users are on the same economic footing, they are not. Only about a 1/3rd (at most) of that count can afford $1000+ monthly, and even then that is wildly out of line with what most will.

▲

elliotec 2 hours ago | parent | prev | next [-]

Here's a source from 2019 that says: "By 2023, the number of knowledge workers in the world will increase to 1.14 billion, with more than four-fifths of that growth coming from the emerging world."

https://www.gartner.com/en/newsroom/press-releases/09-24-201...

	▲	windexh8er an hour ago \| parent [-]
		Thank you for validating my point. > "...with more than four-fifths of that growth coming from the emerging world." If anyone thinks this is a part of the global TAM that's got $1000 a month to blow, well then I've got a stable of flying unicorns to sell you.

▲

HDThoreaun 2 hours ago | parent | prev [-]

I googled "number of knowledge workers worldwide" and read the top results. If you read it as I was confident in a billion I apologize, Im just trying to get an accurate count. What numbers do you have now and where did you find them?

	▲	windexh8er an hour ago \| parent [-]
		That's not the TAM of 1B knowledge workers globally. If that were the case many industries would have a 2-3x target market. To simplify break that 1B up into 3 levels of purchasing: 1) High-tier (US, Western EU, ANZ, Japan, South Korea, Singapore, UAE, etc) - 200-250M knowledge workers. 2) Mid-tier (Eastern EU, Latin America, urban China, India tech sector, etc) - 300-400M 3) Low-tier (Rest of the world) - 300-400M Low-tier users are mostly free tier or heavily subsidized pricing. Mid-tier are going to account for USD sub-$100 tiers. Probably averaging less than $50/seat. High-tier are who you are assuming is the 1B. Users are not equal in that knowledge worker count, so there aren't 1B knowledge workers to charge money. And when you consider Low-tier users a majority of those are free users which need to be subsidized by the High-tier users. So either free tiers get much more restrictive or the providers lose additional training data. A bulk of Low-tier users cost money and provide little to no revenue. Edit: And think about Mid-tier and Low-tier for 5 seconds. Why would they pay Anthropic or OAI when they get get 100x+ inference from DeepSeek or Xiaomi? Mid-tier may be the only area that is willing to spend money on a US provider, but I would wager significantly on the fact that users in the Low-tier almost universally do not care.

▲

solenoid0937 2 hours ago | parent | prev [-]

> 20% if you're a developer. That's a _huge_ shift. Most people I know cite +20%-40% velocity with these tools, against the actual work their company cares about doing. +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending.

Of course it will. The value of an employee is a multiple of what they get paid.

If you pay an employee $500k and they make $2M for your company (like Meta), then of course a 20% increase for the salary is justified if the velocity is increased 20% as well.

▲

lunar_mycroft an hour ago | parent [-]

The difference between what the employer makes per employee and what they spend in compensation doesn't matter. If the increase in productivity isn't greater than the increase in cost, there isn't a reason to pay for AI over hiring more developers.

Imagine an employer with 10 employees paying $500k per employee and making $2M per employee in revenue (to use your numbers). They could hire two more employees and spend an extra $1M (+20%), but make an extra $4M in revenue (+20%). Alternatively, they could buy all ten employees a $100k AI subscription, for a total of $1M extra spending (+20%) but an extra $4M in revenue (+20%). You'll notice both scenarios are identical, so an employer optimizing for profit would have no reason to prefer one over the other.

▲

chasd00 24 minutes ago | parent [-]

There’s a lot relationship and culture management overhead involved when adding 2 more people to a 10 person company. I think any business leader would take the productivity speed up from buying a tool over hiring more people and integrating personalities/habits/viewpoints to an existing established culture any day of the week.

	▲	lunar_mycroft 15 minutes ago \| parent [-]
		You're basically positing that the real cost of a 20% headcount increase is higher and/or the productivity gain is is lower than 20%. That isn't an unreasonable claim, but it's basically rejecting the premise here. You might just as well object to the premise that you can buy a 20% speedup by spending an extra 20% on tokens.

▲ hansmayer 2 hours ago | parent | prev | next [-]

> Anthropic are strongly rumored to be about to have their first profitable quarter

No, its more like their own leak to WSJ and according to Ed Zitron -> seems to be heavily engineered via non-GAAP practices such as counting potential, but not realised revenue as actual revenue - the stuff for which I would be arrested if I did it at my company.

Also it appears according to Ed's analysis - strangely they seem to be projecting only that one quarter as profitable - potentially to calm the investors ahead of the IPO. Investor fraud anyone?

▲

cootsnuck 42 minutes ago | parent | next [-]

Also it was but a few months ago that their CFO said, in a court filing, that Anthropic's revenue across the entire lifetime of the company "exceeds $5 billion". Pretty strange.

https://www.reuters.com/commentary/breakingviews/anthropic-g...

▲

jonas21 18 minutes ago | parent [-]

How is it strange? The "exceeds $5B" quote was from December 2025. Anthropic has seen tremendous growth since then, ever since Claude Code with Opus 4.5 got really good at coding.

If you've ever been at a startup, this is exactly what it looks like when you go from not having product-market fit to having it (though with a few extra zeros on the end compared to most).

	▲	hansmayer 14 minutes ago \| parent [-]
		Ah yes, December 2025...such a long, long time ago...

▲

supern0va an hour ago | parent | prev | next [-]

>according to Ed Zitron

So, unsourced vibes from a shady guy whose entire empire is built on being against AI?

I genuinely don't know how folks can continuously buy into anything he has to say after that Wired piece. The credibility there is seriously lacking.

Please, continue to be skeptical of the labs. But people need to stop talking about this dude as if he's the Holy Grail of the anti-AI movement. It's going to blow up in y'alls faces.

▲

hansmayer 30 minutes ago | parent [-]

> So, unsourced vibes from a shady guy whose entire empire is built on being against AI?

Actually he provides sources when he analyses stuff and imho much better than the usual corporate "Sam Altman says we should ask ChatGPT how to raise babies" crap. Also, I don't know many 'shady' guys who have built entire "empires", nor does he seem to actually have an empire. Usually being shady means you are kind of unknown and all. I am not glorifying Ed, don't even know him personally. I am not even impressed with his writing style much to be honest. But he brings important facts and information to light, which otherwise would have been lost in the cacophony of corporate media light treatment of these con-men. Holy Grail? Blowing up in our faces? WTF are you talking about?

▲

supern0va 16 minutes ago | parent [-]

>Actually he provides sources when he analyses stuff and imho much better than the usual corporate

You said it was likely an internal leak to the WSJ "according to Ed Zitron". Did Ed have a source for that, or was it just vibes?

	▲	hansmayer 12 minutes ago \| parent [-]
		The source was the article in the WSJ itself, which then referred to their source at the Anthropic. Which kind of is a textbook definition of "leak". Because otherwise Anthropic would have their lawyers hunting both the employee breaking their stringent NDA and the WSJ as well...

▲

pier25 an hour ago | parent | prev | next [-]

Yeah I'll believe it when I see it. Revenue is increasing but so are their costs.

Back in 2024 their CEO claimed training costs would rise to $10-100B in the next years.

https://www.tomshardware.com/tech-industry/artificial-intell...

▲

hansmayer an hour ago | parent | next [-]

Their CEO claims a lot of wild shit. He claimed in January this year, that in about 2-3 weeks from this moment, i.e. "in 6 months" that AI will be doing all of SWE work. Lets hold these people accountable for a change!

▲

aspenmartin 35 minutes ago | parent | next [-]

> "in 6 months" that AI will be doing all of SWE work

I assume this is the quote you're referring to from Davos?

"I have engineers within Anthropic who say I don’t write any code anymore. I just let the model write the code, I edit it. I do the things around it… we might be six to twelve months away from when the model is doing most, maybe all of what SWEs do end to end."

that was in Jan, he said "might" and he said 6-12 months. Yes! Let's hold him accountable for saying reasonable things!

▲

hansmayer 28 minutes ago | parent [-]

Reasonable things? He said the same shit over and over over the last several years. Yes, lets hold him accountable - you don't make such "oopsies" accidentally, several times in a row.

▲

aspenmartin 21 minutes ago | parent [-]

Seems pretty reasonable to me. Timescales are hard for anyone to predict. He is forced to do these predictions to know how much compute to buy in advance. Surprisingly, he underbought compute and now has to scramble to secure it from xAI or wherever he can. So he was overly conservative...

	▲	hansmayer 9 minutes ago \| parent [-]
		> Timescales are hard for anyone to predict Indeed. That's why serious people are very careful, even if they are not running a company supposedly worth 1T USD > He is forced to do these predictions to know how much compute to buy in advance Ah well, that explains it. For my companies next quarter, I'll just pull some random numbers out of my ass so we can make plans with material impact to company business based on that.

▲

supern0va 37 minutes ago | parent | prev | next [-]

I work in big tech and probably 90% of code over the last month has been written by AI. And I suspect it's probably higher within Anthropic, which is probably what he's basing his opinion on.

So, he's closer to correct than not.

That said, your recollection is also flawed. It was in mid-March, and here's the relevant quotes:

>I think we’ll be there in three to six months—where AI is writing 90 percent of the code. And then in twelve months, we may be in a world where AI is writing essentially all of the code.

[...]

>But the programmer still needs to specify, you know, what are—what are the conditions of what you’re doing, what—you know, what is the overall app you’re trying to make, what’s the overall design decision? How do we collaborate with other code that’s been written? You know, how do we have some common sense on whether this is a secure design or an insecure design?

[...]

>So as long as there are these small pieces that a programmer, a human programmer, needs to do, the AI isn’t good at, I think human productivity will actually be enhanced. But on the other hand, I think that eventually all those little islands will get picked off by AI systems.

With another 3-4 months left on the clock, his prediction seems remarkably on point for at least certain organizations and domains.

I welcome you to also hold yourself accountable in the coming months if this trend continues. ;)

▲

pier25 4 minutes ago | parent | next [-]

> And I suspect it's probably higher within Anthropic

That probably explains why their uptime and reliability are so bad.

▲

hansmayer 25 minutes ago | parent | prev | next [-]

> I welcome you to also hold yourself accountable in the coming months if this trend continues. ;)

My company did not swallow hundreds of billions in shady investment deals and is not publicly traded. We work with real money, and the revenue on our books is the revenue that is actually booked, not fake revenue we plan in 2 years time to maybe happen. So no, I am not going to hold myself accountable. But people who work with other people's money should be absolutely held accountable when their wild imaginations don't come true, repeatedly, quarter after quarter, year after year!

▲

supern0va 12 minutes ago | parent | next [-]

I will note that you have essentially not responded to anything specific in my comment, nor at least acknowledged that you misstated Dario Amodei's actual prediction.

▲

aspenmartin 19 minutes ago | parent | prev [-]

I think he means hold yourself accountable when it turns out your predictions and pessimism don't age well.

▲

hansmayer 7 minutes ago | parent [-]

Mate, for 5 years I've been hearing that crap. I am not predicting anything / on the contrary the AI boosting bunch is. When are your predictions coming true?

	▲	aspenmartin 2 minutes ago \| parent [-]
		What predictions, sorry?

▲

m1coti 12 minutes ago | parent | prev [-]

Written, but was it reviewed? Do you need to edit code written by LLM?

I agree that most of the things are written by AI but writting code was never the bottleneck in big tech.

▲

sampli an hour ago | parent | prev [-]

Elon playbook

▲

aspenmartin 37 minutes ago | parent | prev [-]

thats not that far off. Costs like $100Ms to train a frontier coding agent model today, billions if you count the full pipeline. Combine that with the infra we're building out, the fact that you have multiple labs building similar scaled models, the industry-wide costs of training frontier models could easily surpass 10B/yr in 2027

	▲	pier25 23 minutes ago \| parent [-]
		Yes, when he made that claim back in 2024 they were spending like $100M to train a model.

▲

surgical_fire an hour ago | parent | prev | next [-]

Also, if I understand correctly, they are rumored to have a profitable EBITDA.

It's a funny metric considering Depreciation is a huge cost for them.

"We are profitable when we don't count our expenses"

▲

skybrian 43 minutes ago | parent [-]

There's a good reason to look at it separately: if inference is profitable then they make money (or at least lose less money) when they get more customers, because any fixed costs are spread across more usage.

	▲	surgical_fire 21 minutes ago \| parent [-]
		Depreciation is part of the cost of inference. Inference happens in GPUs that have a relatively short lifespan. Those GPUs are very expensive. Inference is expensive because a GPU can only process a certain amount of requests in a given timeframe. Remember that Anthropic is constrained in compute. If they are constrained, it means that those GPUs are not idle. If they have more customers, they will need more GPUs. If they have to play silly games using EBITDA to be "profitable", then it means that they need to ramp up prices a lot more than they already did. Which is why in these discussions I always say that inference is also extremely expensive. Too many people like to pretend without any evidence that inference is cheap.

▲

duped 2 hours ago | parent | prev [-]

AI companies/users are filled with liars and grifters, so any numbers/outlook they report should be highly suspect.

▲

supern0va 28 minutes ago | parent | next [-]

I must admit that I am going to find it fascinating when we hit the point where it becomes nearly impossible to deny the efficacy of these tools. I have straight up had people, even in real life, suggest that I'm lying about my productivity gains or what I'm able to accomplish with them.

Like, I understand the reasonable arguments against (I even agree with a few), but it's clear that some people have fully inserted their head into the sand and just don't want to believe any of this could be true. Which will be harsh, since I think getting hit with this train all at once in the future is going to be a rougher ride than a slower coming-to-terms-with, even if the result is one we're unhappy with.

	▲	hansmayer 6 minutes ago \| parent [-]
		In the meanwhile, Google AI search still says the next year after 2026 will be 2028.

▲

bflesch an hour ago | parent | prev [-]

There's a saying "the fish stinks from the head".

▲ aerhardt 2 hours ago | parent | prev | next [-]

I find this analysis confusing. PMF for coding was likely reached some time last year. Profitability, which is different, we don’t know. The article kind of confuses both without making a strong economic case or using numbers in a compelling way. I don’t understand what the Uber case has to do with this either. The Uber COO clearly said that at least in terms of ROI he’s not seeing the results either.

My take is the product has been very useful for coding (PMF) for months. But it’s certainly not useful at any cost…

▲

sixhobbits an hour ago | parent | next [-]

Pmf is this weirdly defined thing where "if you're not sure you have it then you don't".

I think it was clearly useful for months to people who had tried it and taken the time to understand it, but now that knowledge has spread to the point where wallet holders are convinced it's not just passing fad or hype so now pmf can be "claimed".

I agree it's weird to say "those people have pmf" though, usually it's something you define for yourself

▲

aspenmartin an hour ago | parent | prev | next [-]

What I also find confusing though is that folks seem to ignore trajectory which is maybe the biggest lede to bury. As Simon says, we have had "good enough" coding agents for 6 months, that is a blink of an eye, and at my company my job has now completely changed. It's almost like a dream.

And that's just one inflection point. We've had several and there are many more on the horizon. So while I could be convinced that ROI is maybe not even positive today despite the ridiculous enterprise spend, it's perfectly rational to pave the way today for what's coming over the next few months let alone years down the line.

▲

righthand 2 hours ago | parent | prev [-]

It’s not supposed to be logical, it’s an LLM evangelism blog that rarely, if ever, has any critical analysis that isn’t pro-industry. Read any/all of the other posts and you won’t find much skepticism but you will find a lot of shilling how great it all is.

▲

aerhardt an hour ago | parent | next [-]

I like his other posts. He's bullish on AI, which is fine. I'd like to read a mix of bearish and bullish level-headed takes from people who are subject matter experts. His technical credentials are well past discussion - I love Django, and he comes across as a pretty upbeat but level-headed guy. Certainly beats radical takes in either direction from people who have no clue what they're talking about. It's just this article that I find rather confusing.

▲

simonw an hour ago | parent [-]

The thing that matters most to me is if reading what I wrote teaches you some new things and gives you something useful to think about.

If I make an argument and you disagree that's fine with me, provided I didn't use misinformation or sloppy thinking in making that argument.

	▲	aerhardt 19 minutes ago \| parent [-]
		That's how I feel about most of your writing. I click through most times when I see you either on the front page or in the comments, and I generally walk away feeling like I have food for thought, without necessarily buying everything wholesale. It's part of why I keep coming back. My root comment simply represented my two cents about the current post. I don't think anything about the post is outrageously incorrect or anything, just somewhat confusing. You're a very prolific contributor in this community and I don't think me or anyone else that welcomes your takes expects everything you write to rock our collective socks every single time, anyway.

▲

simonw an hour ago | parent | prev [-]

308 posts on AI ethics: https://simonwillison.net/tags/ai-ethics/

52 on AI misuse: https://simonwillison.net/tags/ai-misuse/

149 on the unsolved challenge of prompt injection: https://simonwillison.net/tags/prompt-injection/

40 on slop: https://simonwillison.net/tags/slop/

If you want an "LLM evangelism blog that rarely, if ever, has any critical analysis that isn’t pro-industry" there are plenty out there. I'm not one of them.

▲

alexchamberlain an hour ago | parent | next [-]

I think you should highlight your exemplary pre-AI writing too.

▲

csomar 37 minutes ago | parent | prev [-]

All of these are about AI misuse, not skepticism of AI. By skepticism I mean doubting whether AI actually delivers on its promises which, based on this last post, sounds like something you think we're already past.

Many people still think AI coding agents are slop on steroids despite all the current hype around AI actually shipping functional products.

	▲	simonw 32 minutes ago \| parent \| next [-]
		It's hard for me to write about skepticism that coding agents deliver on their promises when I've been using them daily and know, for an absolute fact, that they boost my own productivity. (And that's after taking into account the METR paper that says engineers over-estimate their productivity with these tools.) I have plenty of doubts about AI delivering on its promises outside of coding. I don't write about AGI because I think it's science-fiction hysteria. I write about slop precisely because it represents a mis-use of AI that demonstrates people completely misunderstanding what it's useful for.
	▲	aspenmartin 30 minutes ago \| parent \| prev [-]
		Love when people say "its promises". What specifically are you disappointed with? Simon's posts are high quality and evidence driven. AI has already delivered an incredible amount. Read Epoch for industry trends and analyses, METR to, everything points to a pretty consistent picture. "Many people still think AI coding agents are slop on steroids despite all the current hype around AI actually shipping functional products." Oh yes, tons and tons, especially on HN. But the plural of anecdote is not data. Enterprise spend speaks for itself. You are using AI-coded functional products all the time. Do you want like a diff history for the Google codebase or something?

▲ hintymad an hour ago | parent | prev | next [-]

The real timing is that we don't have strong enough new business needs for now and we have accumulated enough tech assets, so our work has been increasingly incremental. That means we can build reliable features on top of vast amount of past work - where AI really shines. So, with or without AI, companied would hire fewer software engineers if majority of our work is incremental: add a feature here, fix a bug there, tweak a configuration and etc, then we wouldn't need as many software engineers anyway. AI just accelerated such squeeze.

In contrast, imagine if we had the same AI 20 years or so ago. Could AI really write Jersey? I guess not as people were still trying to understand JAX-RS. Could AI really answer all the questions about React? I guess not as React was just invented. Would we use 10x fewer people to build out infra on the public cloud or the entire so-called Big Data platforms? I guess not, as they were still rapidly evolving and we'd need so many engineers to explore so many different possibilities? Could we use AI to build our ML ecosystem with 10X fewer people? I highly doubt so. Heck, 20 years ago R was all the rage and Python's ecosystem was not mature at all. Oh, and mobile computing, could AI lead to 10X fewer people to build all the mobile apps and the underlying infra?

▲ bambax 21 minutes ago | parent | prev | next [-]

> That’s $2,180.16 worth of tokens for $200

So the author claims he's getting $2000 per month worth of frontier AI free of charge. Ok. If he's been doing that for 6 months that's $12k. What has this produced concretely? For $12k you can find a used car in decent condition. Heck for $1200 (his actual out-of-pocket spend) you get a brand new ebike! (on which you could put a pelican and make a photo of both if that's your fancy). But here it's unclear what has come of it.

▲

simonw 18 minutes ago | parent | next [-]

I've written a great deal of code - code that would have taken me years of work to produce without LLMs.

(It's mostly open source, you're welcome to dig around in https://github.com/simonw and https://github.com/datasette if you like.)

My time as an experienced software engineer is worth a lot of money - a whole lot more than $12,000 for the past six months.

▲

ex-aws-dude 14 minutes ago | parent [-]

And what was your return on investment?

	▲	simonw 13 minutes ago \| parent [-]
		As I commented elsewhere, I'm still bad at making money from my open source work: https://news.ycombinator.com/item?id=48296794#48298909 (I have a feeling if I could say "and I closed $2m in sales with the software I wrote!" people would find a way to say that didn't mean anything anyway, because how can I prove I wouldn't have made those sales writing it by hand?)

▲

aspenmartin 17 minutes ago | parent | prev [-]

I would be very curious what kind of answer would satisfy you here. Simon isn't building a product, where $200 is a line item on a balance sheet. If he tells you what sort of analyses or time savings $200/mo on coding agents have enabled him, do you honestly think that would satisfy you?

▲ prepend 3 hours ago | parent | prev | next [-]

> $2,180.16 worth of tokens for $200

“Tokens” don’t have an intrisic cost or value. Saying that I used $2,180.16 worth of tokens is like relying on the salesperson to convince me I’m getting a billion dollars worth of pots and pans for $19.99.

I think it’s funny how we are throwing critical thinking out the window when it comes to evaluating biased sources of info.

▲

simonw 3 hours ago | parent | next [-]

I'm not sure what you're pushing back against here.

I spent $200. If I had been paying API pricing it would have been $2,180.16. The article is about how enterprise customers get charged API pricing, which means if I had been employed by one of those companies I would have cost them $2,180.16.

What am I missing?

▲

eqvinox 2 hours ago | parent | next [-]

Just because API pricing would've been $2180.16 doesn't mean that's the value of those tokens. For starters, you personally probably wouldn't have paid that. But also, sales price isn't value. This is like saying, oh, I saw this bar of gold somewhere for $10000 but got it here for $1000! So I got $10000 worth of gold for $1000! - no, the value of that gold is determined by its weight, which wasn't even mentioned.

We have no market convergence on tokens yet (and it'll differ between LLMs), so it's impossible to say what value you got for your $200.

	▲	aspenmartin an hour ago \| parent \| next [-]
		He's saying he's getting a great deal...a token from Opus on Claude code is the same as a token from Opus on the API. I remain as confused as Simon. He's not talking about "here's the ROI I got from my $100 subscription" it's "here's how much I saved from getting the monthly subscription instead of sending things through an API".
	▲	remus an hour ago \| parent \| prev [-]
		> Just because API pricing would've been $2180.16 doesn't mean that's the value of those tokens. You seem to be suggesting the price of tokens is entirely disconnected to the cost of providing the service? I don't see much basis for that assumption.

▲

OrangeDelonge 2 hours ago | parent | prev | next [-]

Large enterprises make deals and won’t be paying 2,180.16$ either. Just like with AWS

▲

simonw 2 hours ago | parent | next [-]

That doesn't seem to be the case. From what I've seen enterprise deals get API pricing now. Have you seen evidence that's not true?

▲

roomey 2 hours ago | parent | next [-]

Hi Simon, nice article. The parent there may be making the same assumption I am, that large enterprise _never_ pays sticker price.

Also, to just color in the picture here, as I haven't seen it mentioned elsewhere, there is a very large Saas company at the moment who has given everyone unlimited tokens on Claude. And they have a dashboard showing who spends the most. So the "budget" went from about USD500 per per person (split between Claude and cursor) in Jan to... Well a soft limit of USD100k... Per month... Per person.

People can still see the top line sticker price on their spend, but honestly I can't believe that the Saas is paying that full price when the invoice comes in.

That said, there are some finance reports which are probably dropping soon where we will find out!

▲

simonw 2 hours ago | parent [-]

> The parent there may be making the same assumption I am, that large enterprise _never_ pays sticker price.

I shared that assumption until yesterday, when I found out that it wasn't holding for LLM pricing from OpenAI and Anthropic. That's what inspired me to write this piece.

I think those token leaderboards are an obviously terrible idea and will go extinct very quickly now that people are paying attention to costs.

	▲	wongarsu an hour ago \| parent \| next [-]
		But the feature list at https://claude.com/pricing#team-&-enterprise literally lists "tiered incentives on committed spend" and "non-standard terms" as perks of the sales-assisted Enterprise plan. Maybe "non-standard terms" could mean "we dance for you if you pay", but what would "tiered incentives on committed spend" mean besides "we can negotiate on price if you bring the volume"
	▲	mvanbaak an hour ago \| parent \| prev [-]
		large enterprises dont pay openai or anthropic, they get this thing called copilot and get a nice price there. At least on this side of the pond (eu)

▲

themgt 2 hours ago | parent | prev [-]

I do know of moderate-size companies deploying OSS LLMs on their own GPU clusters, for ownership/security/maybe cost reasons. I'm somewhat surprised F500 companies are apparently just handing over all their data to the model providers.

Could be fantastic for small shops while it lasts. The big guys have to pay 10x for precious tokens.

▲

waisbrot 2 hours ago | parent | prev | next [-]

And "large" just means that AWS will assign an account manager to talk with you. I was at a start-up who spent $300k/year on AWS and that was enough to get special attention and discounts. Enterprise pricing is confusing.

▲

apsurd 2 hours ago | parent | prev | next [-]

The point is that those a real prices real people are paying for real API usage. it's not made up.

your point is large players won't pay those prices at massive volume. ok

▲

Anon1096 2 hours ago | parent | prev [-]

Claude is so in demand at the moment that there aren't really volume discounts. Anthropic sets the terms and you either accept them or get lost they have that much of a lead (mindshare/desirability wise).

▲

altruios 2 hours ago | parent | prev | next [-]

> If I had been paying API pricing it would have been $2,180.16

The point being made above is that API pricing is calculated... somehow... seemingly arbitrarily. Possibly untethered to the infrastructure costs entirely: which would be the basis of any 'value', however that holds the labor theory of value, which isn't accurate either. So how do you accurately price these tokens at all (other than through price-discovery: which is slow, messy and fuzzy)?

	▲	NitpickLawyer 2 hours ago \| parent [-]
		> So how do you accurately price these tokens at all Like anything else in the economy: at the point where enough customers can pay you, and not enough will go to the cheaper competition.

▲

pembrook 2 hours ago | parent | prev | next [-]

API pricing drops DRAMATICALLY in enterprise agreements.

As with pretty much anything priced on volume/usage.

Enterprise deals are negotiated ad-hoc, the listed pricing is simply a jumping off point for the final negotiated discount.

If you’re going to give 20,000 employees Claude code you are not going to be spending $1B per year on Anthropic tokens as if you gave everyone an individual API key. Just as Anthropic isn’t paying AWS SES $10,000,000 to send 1 email update to their massive user base when the next Claude version drops.

	▲	taude 2 hours ago \| parent \| next [-]
		This isn't true at the moment, though. So far there hasn't been the negotiating power. What happens is you end up capping usage for employees at a fixed amount. I think eventually, prices will come down and there will be discounts, but for enterprise accounts at least of our size (<5000), we're paying almost 100% retail, which kind of sucks, because it's expensive, and pretty easy to burn $50 to $100+ in a day, if you're not careful. In fact we got pushed off the former plan to the token-utility one at the last contract negotiation. Going to be interesting to determing the metrics we give to engineers for determining whether the spend on this is worth it. Measuring PRs, lines of code committed, commits fully generated by agentic workflows, etc.....
	▲	simonw 2 hours ago \| parent \| prev [-]
		> API pricing drops DRAMATICALLY in enterprise agreements Do you have any numbers or reports to back that up?

▲

xnorswap 2 hours ago | parent | prev [-]

Have you or I misunderstood the "teams" plan?

edit: I missed the "enterprise" feature matrix with the usual audit/compliance stuff to force the biggest enterprise customers onto enterprise plans. Otherwise the "teams" plan is much better value for any business.

orig-continued:

https://claude.com/pricing/team

Teams premium is "Everything in standard, plus more usage*"

And from my experience, it's a very generous usage, I've only hit the limits once or twice, and both times required multi-boxing agents.

I could single-window agentic development all day on opus-4.7 auto-mode without hitting limits.

If you're a business using claude, then that seems like the right plan, the enteprise/API plan seems more suited to where your product is built on top of the agent themselves, so seats/limits aren't really meaningful?

	▲	nr378 an hour ago \| parent [-]
		Claude Teams and Claude Enterprise are 2 distinct plans. Simon is right that Enterprise seats have no included usage (and so all usage is charged at API billing rates), whereas Teams seats do.

▲

troyastorino 3 hours ago | parent | prev | next [-]

Tokens do have a clearly calculable intrinsic cost. There's the marginal cost of production (i.e. the inference cost) and the amortized R&D cost that goes into the model producing them.

Yes, value is hard to calculate, but luckily market pricing mechanisms exist exactly for this purpose. There isn't a better number to use than what people are willing to pay for them.

So he's saying that on an enterprise plan, he'd be spending $2,180.16. He's not paying that much, but enterprises are.

▲

john_strinlai 2 hours ago | parent | prev | next [-]

a little critical thinking led me to read that sentence as $2180 worth of tokens [at current api pricing]

▲

jfrbfbreudh 2 hours ago | parent | prev | next [-]

Lol. They obviously have intrinsic cost, the floor being the cost of electricity. It’s hilarious how we are throwing critical thinking out the window when it comes to evaluating biased sources of info.

▲

dnnddidiej an hour ago | parent | prev | next [-]

His point is more he was surprised enterprises weren't getting the discount. And so indeed maybe it is not a giant ponzi after all! (Could be a bubble)

▲

FergusArgyll 2 hours ago | parent | prev [-]

I think it's funnier that you can believe some things have an intrinsic cost and others don't

▲ binary0010 3 hours ago | parent | prev | next [-]

So how do openai and anthropic plan to keep customers when GLM-5.1 is just as good and open source and a lot cheaper?

I don't see the business model working. My closest friend actually does automation software for large companies.

He does not use Claude or openai at all. He primarily uses gpt 120b on cerebras and glm-5.1 for heavy thinking work. And some other small models for various tasks. All open source.

And these systems are extremely useful for the businesses and are able to run fully automated pipelines that are very stable and fast.

We discuss this a lot, and we both think any business doing heavy agentic work on Claude and openai just aren't aware of exactly how good and cheap open source has gotten on the last year.

So... once the legacy businesses and developers catch up, won't Claude and openai be unable to recoup their costs?

▲

doug_durham 14 minutes ago | parent | next [-]

GLM-5.1 isn't just as good. It is no match for Opus running in Claude Code. Please try it yourself. Open source models are about a year behind at least.

▲

peder 2 hours ago | parent | prev | next [-]

> I don't see the business model working.

Same. It's a nightmare from a Porter's Five Forces perspective.

There will be a ton of businesses competing in this space, and there will be something of a moat due to how capital intensive the business can be, but there will still basically be infinite competitors.

Great for consumers.

	▲	ex-aws-dude 13 minutes ago \| parent [-]
		Well in reality AWS will just host one of them and most companies will use that Like how snapchat kind of fell off because the feature could just be a subset of instagram It seems like it would just become a commodity like EC2

▲

smokel 2 hours ago | parent | prev | next [-]

For coding assistance, I have tried OpenCode with several large open models through OpenRouter. All were fairly bad compared to Claude Opus. Could you provide some hints on how I should be holding these open models so that I might get more value out of them?

I agree with the common trope that open models lag behind by about a year, but something magical happened just around a year ago when the state of the art models became extremely useful. By this reasoning we're about to see open models perform well, but I'm afraid there is more to it than just waiting for another revolution around the sun.

Note, my application is coding assistance. Open models can be great for other purposes.

▲

tariky an hour ago | parent | next [-]

I tried almost all OS models on opencode, none of them is on levels as opus 4.7.

In latest experiment I used opus for implementation plan then used cursor composer 2.5 for execution.

I must say that combo is really good. Main drawback of claude code is that is super slow. So when paired with composer that is super fast it flies.

	▲	cainxinth 42 minutes ago \| parent [-]
		No one is claiming that OS is as good. They are saying it isn't that far behind SOTA commercial products. So why pay exorbitantly just to get something only a few percent better than the free option? But there have been very good open source office apps for decades and few enterprises use them, so perhaps this is just the nature of B2B purchasing committees and 'nobody getting fired for buying IBM.'

▲

slopinthebag an hour ago | parent | prev [-]

Do more planning yourself, be smart about the context, break down tasks into smaller components, give it more guidance. You can't just lazily prompt it to complete large features autonomously and expect good results.

▲

eikenberry 28 minutes ago | parent | next [-]

+1 .. just wanted to reiterate that this is the answer. The open models work great if you just do a little more of the design/architectural work up front and organize your work appropriately.

▲

amilios 40 minutes ago | parent | prev [-]

But if the closed-source models can do this without the additional effort, that's a significant gap, no?

▲

flexagoon 2 minutes ago | parent | next [-]

Is it really when they are hundreds of times more expensive?

▲

10000truths 27 minutes ago | parent | prev | next [-]

The point is that the price gap is so much larger than the capability gap, that even with the extra compute needed to make up for the lack of capability, you can still come out ahead in terms of amortized $/token.

▲

eikenberry 23 minutes ago | parent | prev | next [-]

That is the 3-6 month sota-open gap people talk about, a time-window that continues to move as new models are released on both sides.

▲

bigfishrunning 25 minutes ago | parent | prev [-]

See that's the thing, they can't. Every model needs hand holding and guidance.

	▲	amilios 10 minutes ago \| parent [-]
		some require less hand-holding than others though

▲

mesmertech 3 hours ago | parent | prev | next [-]

For coding you always want to go with the best model in the category, not something that would be the best model if we went 1 year back which GLM 5.1 is, and I'm saying that as a big fan of GLM cause I run a translation site where GLM is good enough for the price.

Most of the money right now is in coding. Openai and Anthropic just have to be 6 months ahead of SOTA open source models and they'll capture most of the enterprise and dev market

▲

binary0010 3 hours ago | parent | next [-]

Yes I'm an engineer (20 years most in games/graphics industry) and only use it for code. I've been using glm 5.1 this week a lot. I went in expecting another "decent" but not really "up to standard" open source model.

I highly doubt I'll ever use Claude again.

I think you are wrong about Claude being any significant level better

	▲	cassianoleal 2 hours ago \| parent [-]
		I've been mostly coding with GLM-5.1 as well and I agree with you. DeepSeek V4 Flash is another very good surprise. Incredibly cheap, fast and effective.

▲

eikenberry 15 minutes ago | parent | prev | next [-]

> For coding you always want to go with the best model in the category [..]

And this is why many companies go out of business. You always want the best bang for your buck, sometimes this is the "best model" and sometimes it is not.

▲

odie5533 31 minutes ago | parent | prev | next [-]

If I generate code with Claude, ChatGPT, and GLM 5.1, I can't say which model is which reliably. I exclusively use Claude more out of superstition than reason.

▲

kgwgk 3 hours ago | parent | prev | next [-]

For coding like for everything else in life cost is a factor.

▲

mesmertech 2 hours ago | parent [-]

Cost for the value delivered. Like if you offered the current SOTA open source models at $0.1/M, I still think I'd be using Opus or 5.5 at $30/M. Or say GPT 5 which was released Aug 25, I don't think I'd use it for coding for even $0.1. I'd def find other uses for it(translations, agentic workflows, prompt guards etc), but for coding I don't think I'd ever completely switch to a SOTA open model

Unless ofc there was an actual speed difference, only reason I'd be willing to go with a worse model couple of percent worse than current best model is if the speed was at least 5x higher. Looking forward to kimi k2.6 offered publicly by Cerebras

	▲	kgwgk 2 hours ago \| parent [-]
		> I still think I'd be using That's fine. Other people may not want to pay 300x more and will rather make do with last year's SOTA. > For coding you always want to go with the best model Maybe you meant "For coding I always want to go with the best model"?

▲

Andrex an hour ago | parent | prev | next [-]

> For coding you always want to go with the best model in the category

Will this always be true? There will never be an event horizon/point of diminishing returns where something not-bleeding-edge is "good enough" for 51%+ of users?

▲

blackjack_ an hour ago | parent | prev | next [-]

This is a silly take. There is a line of "good enough" for most coding (most CRUD apps and APIs are nothing special), and once we are past that, nobody will care about having the "newest, best" model except extreme outliers. And this base "good enough" model will become an ultra cheap commodity as we already see with GLM, deepseek, etc.

▲

dogleash an hour ago | parent | prev | next [-]

> For XXX you always want to go with XXX, not XXX

Oh, hey, I recognize you. Thank you for the very forward and thorough orbital sander recommendation at Home Depot. That's exactly what I wanted to deal with on my holiday weekend. You just know so much about this and the rest of us are simple passersbys.

▲

EGreg 3 hours ago | parent | prev [-]

Most work is not coding.

And also, people have it wrong… their models are not the main problem anymore. It’s the RAG

▲

tomrod an hour ago | parent | next [-]

Would love to hear more about your thought about the RAG.

	▲	simonw an hour ago \| parent [-]
		I think RAG is a mostly outdated concept now, it's been subsumed by the idea of a "agent harness" which is exactly what Claude Code and Claude Cowork and OpenAI Codex and Claude.ai and ChatGPT themselves have now become. An agent harness with access to a good search tool is a much more interesting thing than 2024-era RAG systems.

▲

obsidianbases1 3 hours ago | parent | prev [-]

Depending on RAG is a workflow problem, not an AI problem

▲

IAmGraydon 7 minutes ago | parent | prev | next [-]

The only way I see it working out for them is if some legislation is passed that eliminates the competition by making it illegal to run local models. They could claim that the models are dangerous and could be weaponized without oversight, or something along those lines.

▲

csomar 21 minutes ago | parent | prev [-]

They are both (and also spacex) sprinting for IPOs. They know that the opportunity window is closing fast and that advancement in model quality has largely plateaued in the last year. Take as much investor money as you can get away with for now.

▲ antman 2 hours ago | parent | prev | next [-]

The costs are exorbitant and most software is not produced by companies with such a huge moat. Anthropic made a profit through their recent bait amd switch pricing. There is zero useful insights online to indicate whether this might die due to commoditisation with good enough open models or fail the race to get more people subsidising unsustainable growth with other people’s money. Who knows? In any case they dont seem to be able to drop usage costs so the business model seems based on wishes

▲

j_w 2 hours ago | parent | next [-]

Continuing with your skepticism:

> Stories are circulating of companies surprised at how expensive their LLM bills are becoming from usage by their staff

> Enterprise customers are now paying API prices

How long before enterprise customers start to question the bill? Anthropic goes from not making money to doing pricing shakeup, and now they are making money and the biggest spenders are shocked at prices.

Seems like things are still very uncertain.

▲

brokencode 2 hours ago | parent | prev [-]

Usage costs will come down with better hardware. Hardware is improving rapidly each generation.

▲

eikenberry 11 minutes ago | parent | next [-]

Costs will plummet as better hardware becomes available and priced reasonable so that people can more easily run their own open models locally. But that won't help Antropic/OpenAI make more money, quite the opposite.

▲

simonw 2 hours ago | parent | prev [-]

That trend held true for the past three years, but it doesn't feel as safe to me now.

But memory costs are going way up. And both OpenAI and Anthropic bumped up the price of their frontier models in April.

	▲	brokencode an hour ago \| parent \| next [-]
		Yeah, it’s called supply and demand. Demand for memory went way up suddenly. Now supply is going up rapidly as companies try to cash in on that demand. Supply will eventually catch up with demand. Then the prices will come back down.
	▲	StrauXX 2 hours ago \| parent \| prev [-]
		Algorithms are also improving. I believe it's very unlikely for these two improvements together to not result in one to two orders of magnitude cheaper cost per "intelligence". Of course, that might just make use cases that are too expensive today viable and thereby increase usage further.

▲ realo 3 hours ago | parent | prev | next [-]

200$ per month per seat is nothing .

A single 3D CAD license pack for the guys in our R&D group costs multiple thousands of dollars per seat, per month.

It's about time software seats get some love too.

▲

smokel 2 hours ago | parent | next [-]

AutoCAD is $175 per user per month [1].

[1] https://www.autodesk.com/products/autocad/buy

▲

bigbuppo 2 hours ago | parent | next [-]

AutoCAD is still the budget-friendly CAD program it has always been. You don't build big boats in AutoCAD.

	▲	rrr_oh_man an hour ago \| parent \| next [-]
		Winch Design [0], which have built some of the world's largest superyachts [1], seem to be using AutoCad. [2] Afaik it's also the same with Lürssen (but don't quote me on that) [0] https://winchdesign.com/ [1] https://www.superyachts.com/directory/1516/winch-design/flee... [2] https://www.autodesk.com/design-make/articles/naval-architec...
	▲	so_it_be an hour ago \| parent \| prev \| next [-]
		Except LLM's even with Vision are still useless at AutoCAD let alone Revit (please dont quote SCAD LLM's at me, useless). Knowledge based approaches still win. I might agree "AutoCAD" is the current level LLM's are at, but wait until your design departments discovers "Revit", its another ballpark (in wasted cots, engineers on site still get "clashes"). Revit costs are high, and the end results are marginally better - but local LLM's tokens are cheaper 24/7 at "AutoCAD" level - "Revit" level tokens will make Ubers CTO/COO weep harder than they already do. While producing results no better than "Revit" does (engineers still face "clashes").
	▲	Our_Benefactors an hour ago \| parent \| prev [-]
		As someone completely outside the 3D design world who always thought of AutoCAD as the gold standard - really? What program would be used instead? Please enlighten me.

▲

Hasz an hour ago | parent | prev [-]

Cadence and Ansys have entered the chat. A bunch of other highly-specialized engineering software has entered the chat. Licenses are on the order of 10-100k/seat.

For a pretty funny comment about pricing.

https://www.reddit.com/r/chipdesign/comments/1ajrli2/cadence...

▲

chatmasta 2 hours ago | parent | prev | next [-]

Yeah, it’s nothing, and it’s also not the cost that enterprises are paying. As the article states, the price is $20 per seat per month, PLUS per-token API usage. Enterprises are paying consumption billing, not fixed rate oversubscribed “all you can eat per seat.”

▲

avree 2 hours ago | parent | prev | next [-]

CATIA licenses which are the most expensive I've seen are roughly $600/month per user. Where are you seeing "thousands of dollars per seat"?

	▲	mountainriver an hour ago \| parent \| next [-]
		CATIA with plugins can go up to 100k a year. That’s what we currently pay
	▲	AlotOfReading 2 hours ago \| parent \| prev [-]
		CFD might reasonably be considered part of CAD and something like ansys costs about as much as catia. Still only doubles it though.

▲

dnnddidiej an hour ago | parent | prev | next [-]

Sure. Is CAD going to be used by every working human?

▲

krupan an hour ago | parent | prev | next [-]

But when previously your software developer tools were free, that's a huge increase

▲

esafak 2 hours ago | parent | prev [-]

How many guys is that? Every single white collar worker is in the AI ICP (customer profile).

edit: typo

▲

smt88 2 hours ago | parent [-]

white collar*, not color

What does ICP mean?

▲

simonw 2 hours ago | parent [-]

Insane Clown Posse, though given the context here probably Ideal Customer Profile.

▲

everdrive an hour ago | parent [-]

The similarities are quite stunning, though, as I'm sure both sets of ICPs have no idea how LLMs work.

	▲	KyleTheDev an hour ago \| parent [-]
		Now hold on there, let's not cast doubt on ICP. I'm sure they'll surprise us, as they always have.

▲ darth_avocado 2 hours ago | parent | prev | next [-]

How is the lack of bad news declaring a victory for AI? I am yet to see any company concretely publish analysis about the ROI from AI. Most companies as far as I know are still treating AI investment as sunk cost with no expectation of returns at the moment. We could very well see a world where companies heavily scale back investment.

▲ sourcecodeplz 3 hours ago | parent | prev | next [-]

With deepseek and xiaomi mimo models slashing their prices 99%, I don't see a great future for openai / antrhopic with regards to their 1T valuations. Maybe 1T valuation will be the whole market, West + East.

	▲	skeledrew 2 hours ago \| parent [-]
		They'll still have their dedicated enterprise customers. I think the Chinese providers will pull more of the single users who're paying their own way, than those backed by company budget. And it's a pretty good split as the demand becomes better distributed, resulting in better service (I'll never forgot must how bad access to Claude became until they got access to Colossus) and less potential for lock-in (we really don't want there to be a duopoly, etc on good AI).

▲ CachedaCodes 3 hours ago | parent | prev | next [-]

Ai has become indispensable but maybe not at all cost. My company just had a company-wide meeting to talk about how they're restricting who can use which models and instructing us the "be more responsible with company's tokens". And it's not an small company by any means.

▲ mtrifonov an hour ago | parent | prev | next [-]

They certainly have, but it relies entirely on the assistant frame, which is a problem in and of itself for the trillion-dollar economics.

Anthropic and OpenAI have shown people want a tool for task offloading, driving predictable token consumption and justifying the math, so long as users stay in that dynamic.

However, knowledge workers using these tools daily are getting exhausted with them. Outputs come out polished but hollow. Talking to a frictionless, frame-completing model all day drains you.

If user behavior drifts away from assistant usage because of that, per-token math implodes. The valuations we're hearing about all the time rely on usage compounding daily. The fatigue is a timer running against that compound.

Anthropic's Constitution is the closest hedge out there, I think. Installing an identity structure into the model through training. But it's still assistant-first, so the fix there is only partial.

I've spent the last year running a product that flips the architecture so identity is primary and the assistant role is secondary. Same frontier models, completely different conversational quality. The fatigue property doesn't really show up.

Whichever labs figure out how to install real identity natively in the weights are going to be the ones with PMF in the next phase.

▲ cj an hour ago | parent | prev | next [-]

> Coding agents really did change everything. These are tools which burn vastly more tokens

The assumption here is that this is a positive thing.

But this very well could end up being a major negative long term by increasing the cost per user, reducing margins.

More usage = more cost = less profit.

It's not obvious that more usage is good. It's only good if revenue per user increases more than cost does. I'm skeptical about that.

▲

simonw an hour ago | parent [-]

> It's only good if revenue per user increases more than cost does.

That's why it's so important for these labs that they're selling API tokens for more than the compute+energy costs needed to generate them.

Every indicator I've seen is that they do have a positive margin on that. If they don't, they're screwed.

▲

mattas 44 minutes ago | parent [-]

What's an example of an indicator? Genuinely curious!

	▲	simonw 40 minutes ago \| parent [-]
		Insider tips from Google and AWS telling me that they run inference at a profit (though that was over a year ago now). Dario telling Dwarkesh three months ago that they have a margin on inference: https://www.dwarkesh.com/p/dario-amodei-2?timestamp=3528.0

▲ smokel 2 hours ago | parent | prev | next [-]

Does this analysis factor in potential caching of tokens on the server side? It seems that if they organize things well (as a model provider), they can save quite a lot on that. Looking at my Cursor statistics makes it clear that the token calculations are not at all trivial.

	▲	simonw 2 hours ago \| parent [-]
		I believe the ccusage tool I used takes cached token pricing into account.

▲ asim 2 hours ago | parent | prev | next [-]

Love how everyone boasted about replacing all the software with ChatGPT and then we end up with coding agents meaning the software engineer are STILL important. The sell is the development tool. It's classic cloud. Where did all the ops people go, many got subsumed by the cloud companies YET every company still has DevOps people to manage cloud infrastructure. The layer of abstraction went up but we still need the people to write the glue code and understand the business. OK great there's a new cash printer in the room. There's a new tool. Let's just start to ground the tooling in its new found gravity, profitability and IPO market dynamics... Reality has set in. The hype cycle is about to explode... Do you remember ride hailing and just how much cash was burned on credits pre Uber IPO. Then remember the IPO itself? These companies are not the new Google. They are a layer on top. Google was still the most efficient cash printing machine in history beyond the the US government and might still be. Will be interesting to see what the trillion dollar IPOs turn into. I'm going to say we see those prices get cut to a third in less than 5 years and scale back up over the next 15-20 years.

	▲	thewebguyd 2 hours ago \| parent [-]
		> The sell is the development tool. I've been calling that out for a couple years now. LLMs best and most viable use case is still just as a dev tool. Even for non-programming tasks, I still get better results from the LLM if I instruct it to write code to do the task...look at Claude Cowork for example, it's everything I used to do with python myself. It's not really a novel capability, it's just using python & bash for automations that any sysadmin has been doing for decades. Yeah, that's valuable for a non-techincal audience but is it $1T valuable? I don't think so. When has an IDE or other dev tool ever commanded a $1T valuation? These things get lost in discussions because people conflate "overvalued" with "not useful." LLMs are useful, particularly as dev tool, but Anthropic & OpenAI are definitely way overvalued.

▲ Szpadel 43 minutes ago | parent | prev | next [-]

> but as far as I can tell those credit costs are an exact match for the API token costs listed for those models.

it is only true for USD. for example if you pay in euro, this is actually more expensive. kind of makes no sense, because it translates to $1 = €1

▲ dnnddidiej an hour ago | parent | prev | next [-]

Is PMF enough. It is such a dynamic self-disrupting wave that it is like predicting physical chaos. These aren't early Googles in a blue ocean. Maybe a blue ocean full of pirates and dragons!

This isn't me being a doomer I just don't know. Can we look at Q2 profits and draw hockey sticks yet?

Remember people are boasting how much their expenses are. That is where we are in the bubble/new paradigm.

▲ firesteelrain 2 hours ago | parent | prev | next [-]

Anyone actually making money paying all of these monthly fees? Or just hobbyists? I have yet to see any real ROI posted anywhere.

	▲	rvz 30 minutes ago \| parent [-]
		This is the same question I said about people running OpenClaw. You don't hear about anymore. Other than the hosting providers, I am also yet to see anyone directly making money from their OpenClaw agent.

▲ rubiquity 2 hours ago | parent | prev | next [-]

I think it's fair to say they had achieved product-market fit when their revenues were growing deep triple digits month over month. What we're seeing now is that perhaps they have achieved profitability or at the least a more sustainable balance sheet.

▲ osigurdson 2 hours ago | parent | prev | next [-]

Realistically, OpenAI found product market fit with the OpenAI API playground in 2021. People were using that as ChatGPT at the time.

▲ hansmayer 2 hours ago | parent | prev | next [-]

> I currently subscribe to the $100/month Max plan from Anthropic and the $100/month Pro plan from OpenAI. If you are a heavy user of coding agents these plans are a fantastic deal.

Agreed. But its only a great deal because it is heavily subsidized, as you said yourself. Enjoy while it lasts, but in my book, product-market fit means something along the lines of "product which enjoys a loyal customer base, sold at a price perceived fair by the customers, and generating profit. How many of these does your definition of product-market fit hit here?

▲ smallerfish 2 hours ago | parent | prev | next [-]

I think the reasons for them going with API pricing will become abundantly clear when the S-1s become available. If they don't have a story covering how they can get revenue closer to expenses, then they're relying on the market to believe the pixie dust version of their profitability story, which I think people increasingly don't.

▲ NortySpock 2 hours ago | parent | prev | next [-]

"[would have spent] $1,199 with Anthropic, $980 with OpenAI"

How many tokens is that, input/output-wise?

(a) I'm curious if you feel like you got $2000 worth of value out of them in the last month?

(b) I'm also curious if you would have gotten similar quality out of a slightly lower-cost provider of an open-weight model? (e.g. Kimi K2.6 and DeepSeek v4 Pro) and what the spend would have been for that.

I myself have managed to spend not quite $4 on OpenRouter and have felt it was very worth it; I just have much smaller, or more targeted requests I guess. (Lately, adding features to a static site generator in Python, or setting up log forwarding via a docker compose file)

▲ simonw 2 hours ago | parent | next [-]

Claude Code:

  Input tokens:        52,545,485
  Output tokens:        5,767,253
  Cache create tokens:  5,112,029
  Cache read tokens: 1,475,069,465
  Total tokens:      1,538,494,232
  Total cost:        $1,199.79

OpenAI Codex:

  Input tokens:          52,598,013
  Output tokens:          4,681,867
  Reasoning output:       2,091,063
  Cached input tokens: 1,153,844,864
  Total tokens:        1,211,124,744
  Total cost:          $980.37

I'm confident I got value out of OpenAI - I've been mainly on Codex for the last few weeks.

Not so sure I got that value from Claude, just because I've been using it a lot less and somehow the price came to about the same as OpenAI.

Given the code I've been able to build in the past month I genuinely do think I got value for the API price version, and (don't tell OpenAI or Anthropic) I think I'd have paid full price.

I've not spent nearly enough time with GLM-5.1 and co to compare, but I do know that the prompts I'm using with the agents are not prompts I would have expected to work just three months ago.

▲

krupan an hour ago | parent | next [-]

Are you saying that the software you wrote using those tools generated enough revenue to cover the $2000?

	▲	simonw an hour ago \| parent [-]
		Not yet, but that's because it was almost all open source and I'm really bad at generating revenue from that. When I account for the amount of time it saved me there's no question $2,000 was worth it.

▲

NortySpock 2 hours ago | parent | prev [-]

Cool! Thanks for the details, and your blog posts are usually interesting food for thought, so thank you for them too!

▲ regularfry 2 hours ago | parent | prev [-]

If it were me I'd be asking "How long would it have taken me to do that, and what's the rate I'd have been charging for the work I would have been doing otherwise?"

Personally, I've probably spent $60 or so on OpenRouter in the last month or so and got a working project out of it that it would probably have taken me a fortnight to knock together (which is inevitably an under-estimate because it covered things I'd have to learn but K2.5/6 already knew). There's an orders-of-magnitude gap there.

▲ atleastoptimal 30 minutes ago | parent | prev | next [-]

I think this was obvious since the birth of ChatGPT

Intelligence is a universal good, it can apply to anything, and no, "human intelligence" is not the only form that is useful nor special. There are limitations to AI but also huge advantages, and its obvious that the advantages are worth paying for, given their revenue.

▲ mbesto an hour ago | parent | prev | next [-]

> but I suspect there’s a more important factor here: I think they’ve finally found product-market fit

Ahhh the classic startup term that's definition is nebulous. But also, since when does any definition of product/market fit mean a product is profitable? And profitable in what sense? Unit economics? Overall company?

	▲	simonw an hour ago \| parent [-]
		Oh I'm absolutely taking advantage of the fact that "product-market fit" has a bit of a nebulous meaning here. It's a great hook to build an article around. My core point is more that April 2026 was the point when Anthropic and OpenAI finally appeared to have figured out a credible business model.

▲ Hasz an hour ago | parent | prev | next [-]

Mentioned in the article, but it cracks me up that both openai and anthropic are utilizing fairly traditional enterprise GTM plans segmented by verticals.

So many startups trying to automate sales, but somehow the two biggest frontier labs have decided that the best GTM strategy is firmly human-in-the-loop.

▲ Havoc 2 hours ago | parent | prev | next [-]

What baffles me is the range of estimates.

Operating profit is both post depreciation and fees paid to third parties for hire. So aside from shenanigans like RSUs and financing interest that's already somewhat close to actual economics.

Meanwhile we've got commenters here talking of 5-10 trillion with a T revenue shortfall.

Those are very different takes on reality

▲ spprashant 3 hours ago | parent | prev | next [-]

So it largely sounds like many more people will be able to write software - and will use AI to do it. Existing software engineers will continue to automate their tasks away like they always did, but perhaps at a faster rate.

The impact of AI in other fields seems to be muted.

▲

simonw 3 hours ago | parent | next [-]

I think it is applicable to a much wider range of knowledge work, but it's also harder to apply there.

Software development has the huge advantage that mistakes and hallucinations are very easy to spot: the software works or it doesn't.

Spotting errors in a research report or legal brief is a whole lot harder!

But... non-software professionals spend a huge amount of their time on tasks that can be safely automated - reformatting documents, extracting numbers from PDFs, all kinds of flavor of data entry.

Learning how to use a tool like Claude Cowork can take a big dent out of those.

▲

slopinthebag an hour ago | parent [-]

> Software development has the huge advantage that mistakes and hallucinations are very easy to spot: the software works or it doesn't.

Do we not care about code quality, maintainability, performance, extensibility, or understandability anymore? Honest question, not a gotcha, it's just previously getting software to pass all the tests was a small part of what we would consider "working" or perhaps "good" software. Maybe that's different now with LLMs, idk. Maybe we need automated checks for these things as well, like not compiling until the code quality is good enough to let the agent finish it's loop.

	▲	simonw an hour ago \| parent [-]
		> Do we not care about code quality, maintainability, performance, extensibility, or understandability anymore? Yes, we should care. I've been writing a whole book about that: https://simonwillison.net/guides/agentic-engineering-pattern...

▲

pianopatrick 2 hours ago | parent | prev [-]

If the AI can write code for robots the impact in other fields may be pretty large. Seems to me a lot of jobs can be automated with software and robots combined. The limit in the past was writing the software to get the robots to work. But if AI can remove that limit...

▲ mesmertech 3 hours ago | parent | prev | next [-]

If nothing else this blog did give me the idea that I should split my $200 claude max plan into two $100 CC max and $100 codex plan, esp because Claude is now offering 1.5x weekly limits so its the 5x usage is now more like 7.5x usage.

	▲	Havoc 2 hours ago \| parent [-]
		>I should split my $200 claude max plan into two $100 CC max and $100 codex plan You may want to get one of them to check the math on that :p

▲ x187463 3 hours ago | parent | prev | next [-]

I wonder how a focus on per-token API profits will impact the incentives to improve token efficiency and drive down costs through optimized compute. I suppose as long as a few leading labs are competing, we'll see progress in this regard, but it's certainly less in their interest than it is with a flat subscription pricing model.

▲ pzo an hour ago | parent | prev | next [-]

> If you are a heavy user of coding agents these plans are a fantastic deal. I just ran the ccusage tool on my laptop to get an estimate of how much I would have spent if I were to pay for API tokens in the past 30 days and got

You think this is fantastic deal only because they use similar like tricks where they inflate the price and tell you something supposed to cost $1000 but they have this today promo for $100.

I was there too and paying for a while. Few weeks ago I tried DeepSeek V4 Pro - expected its gonna be shit but its actually pretty good.

The deal is I pay daily ~$1 for DSV4-pro for ~100M API token usage. And they probably not getting broke because >90% of those token in practice is cache read and they very well optimized for that.

	▲	sourcecodeplz 17 minutes ago \| parent [-]
		Yep, exactly this. And I have so much less anxiety that I have to use my 5-hour/weekly usage or I lose it... with deepseek api the credits never expire, I can use them when I want, how much I want and the prices are ridiculously low for the quality/intelligence/performance.

▲ _verandaguy an hour ago | parent | prev | next [-]

With respect to Simon, whose writing I've usually agreed with in the past and whose insights I've liked: this is a bad take that overlooks the extent to which corporations are imposing the use of AI on employees, and in particular ICs, who make up a majority of the AI-using workforce by headcount.

Many of us are either openly having our performance reviews tied to AI use, especially at larger enterprises. Whether that's measured by sheer token count or just "how many of your tasks are you using AI for these days" (combined with the implication that question carries at many orgs which are heavily invested in AI).

▲

simonw an hour ago | parent [-]

Are you saying that Anthropic's huge leaps in revenue are caused by stupid company policies and token leaderboards, and the moment companies stop imposing AI on their employees revenue will drop to a point where Anthropic are unlikely to be profitable?

I don't think that's the case. I think the token leaderboard thing (which is clearly ridiculous) affects a tiny portion of companies and is already going out of fashion.

	▲	_verandaguy an hour ago \| parent [-]
		I'm saying that the truth lies somewhere in between, and that Anthropic's current revenue is being, in part, propped up artificially. We're also in a place where a lot of the usage guidance around these tools is still nascent. People are cowboying a lot of stuff, even as larger companies start to organize AI policy/safety/responsible use working groups to try and policy around the shortfalls of the technology. IMO: if this technology persists, and if we figure out a way to use it in a broadly safe way, the value proposition will probably trend down rather than up, at least on the code generation front. As a research tool, it shows some promise, though I still find the ethics of the technology disgusting.

▲ zuzululu 3 hours ago | parent | prev | next [-]

Great article I know this upsets a lot of people who are used to thinking Anthropic/OpenAI are just lighting cash on fire but they've cornered the market on enterprise who cannot walk away from these $200/month plans

However the valuations are still far far away from actual sanity

▲

binary0010 3 hours ago | parent | next [-]

Have you tried the large open source code models?

I use glm-5.1 and occasionally deep seek v4.

They are as good or better than Claude's latest models.

And significantly cheaper. I've converted 3 of my engineer friends as well. All three have dropped their $200 month plans they had with anthropic.

We've all been a bit shocked at just how good these models are now.

If you "have" tried GLM (I specifically find it shockingly good for code). Did you not think it's not competitive to Claude, and why?

▲

BeetleB 2 hours ago | parent | next [-]

I use GLM-5.1.

It's good enough for personal stuff. It doesn't compare to the latest Opus I use at work. You can certainly argue I don't need Opus for work, but there is clearly a difference.

Also, at least with z.ai, GLM-5.1 is s l o w! After using Claude at work, I get really impatient with GLM-5.1 at home. When doing "true" vibe coding (i.e. not really examining the code), Opus is a ton faster (easily 5x).

But yeah, I'm not willing to personally pay for the frontier models. I won't even renew my annual Z.ai plan - it's become too expensive.

▲

binary0010 2 hours ago | parent | next [-]

Hmm, I use opencode subscription, and glm seems just as fast from the tests I've tried to compare between the two. Tbh it mostly took Claude longer (mostly significantly longer) for the same tests.

Also, and I know you may not want to answer. But could you give me an idea of the type of thing you found glm to be worse with?

I think I've been fairly unbiased in testing a bunch of different development tasks. But am curious if maybe it performs well for some stuff and not others. So if you could share what you feel it's worse at.

Also are you an experienced developer or less experience?

	▲	BeetleB 2 hours ago \| parent [-]
		Perhaps opencode zen isn't using z.ai as a provider?

▲

cassianoleal 2 hours ago | parent | prev | next [-]

I'll repeat something I wrote on an entirely separate HN submission.

When DeepSeek V4 Pro came out, I had been mostly coding with GLM-5.1 on a Z.ai coding plan.

I had a large analysis task on a relatively complex codebase. I decided to try the models out.

GLM-5.1 did acceptably but got a few things wrong (easily corrected) and took quite a while to get there.

Opus 4.6 burnt through the US$10 budget I had given it in about 10-15 min, without ever returning from the first prompt.

DeepSeek V4 returned a full analysis within 2-3 min, and I carried on all the way to implementing the feature I was after. Total cost less than US$1.00.

I now mostly alternate between GLM-5.1 and DeepSeek V4 Flash, with an occasional dip into V4 Pro for more complex analyses.

▲

dominotw 2 hours ago | parent | prev [-]

task i am working on right now at work is comparing two verisions of apis and documenting responses in their outputs. i suspect a vast majority of work at entrprise is of similar complexity.

right now everyone is using latest and greatest to do dumb stuff like that. that would change fast if companies start caring about costs.

▲

therealdrag0 an hour ago | parent | prev [-]

What is the best IDE UI to use them? I don’t like CLIs.

▲

thewebguyd 2 hours ago | parent | prev | next [-]

> enterprise who cannot walk away from these $200/month plans

Any org with more than 150 users aren't on $200/month plans, they are forced into API pricing + $20/month/user

For individuals and orgs small enough to get to use the subscription plans, that's all well and good until usage limits keep going down, or cost goes up. If you compare the usage you get on $200/month maxed out vs. what that would cost at API pricing, the $200/mont plan is an absolute steal. I doubt it will last long.

▲

bigbuppo an hour ago | parent [-]

Not to mention the API plans are also still in their "lose money, just get the suckers hooked like addicts" phase. Once the reality-based pricing comes into play, it's a coin flip of whether the bulk of the companies fail, or they get to live off government subsidies for a few decades.

On the plus side, I'm happy I'll have a nice hay barn when the local half-built AI data center is abandoned.

▲

simonw an hour ago | parent [-]

I believe that API pricing runs at a healthy margin, at least compared to the server and energy costs used to serve the tokens.

Recent conversation here on that topic: https://news.ycombinator.com/item?id=47062534#47063134

▲

bigbuppo 38 minutes ago | parent [-]

There isn't a single thing about how the AI companies are operating that looks like a normal business. I know people who were in the room when Scott Sullivan, CFO of Worldcom, assured everyone that the future was bright at Worldcom days before they collapsed. So you'll have to excuse me if I don't believe the words of someone whose sole job is to justify hundreds of billions of dollars being thrown at Anthropic when he says their future is bright.

	▲	simonw 34 minutes ago \| parent [-]
		I agree that the amount of investment thrown at these companies is absurd. But I also think that their API token pricing represents a real margin over the inference costs for serving those tokens. Both things can be true at once.

▲

smallerfish 3 hours ago | parent | prev [-]

> enterprise who cannot walk away from these $200/month plans

But that's the point of the article. Enterprise plans are starting to get API pricing, not the subsidized subscription pricing.

▲ vb-8448 an hour ago | parent | prev | next [-]

> That’s $2,180.16 worth of tokens for $200—not bad at all!

Just imagine how funny it will be if it comes out that big labs were doing some fancy maths to count the 2k$/month in their forecasts ...

▲ airstrike 2 hours ago | parent | prev | next [-]

Who's to say those enterprises won't churn after XYZ comes out with a decent enough model that costs 10x less to use?

There's a whole bag of clever tricks you can play to juice short term results leading to an IPO that may not work longer term.

I'll believe they've found product-market fit when they have a product. Right now they're selling the infrastructure, in a highly subsidized and undifferentiated way (at least over a sufficient long period of time of, say, a couple of years).

▲ wewewedxfgdf an hour ago | parent | prev | next [-]

Simon Willison just hit the "Publish to top of HN" button.

	▲	simonw 39 minutes ago \| parent [-]
		Wish I'd hit that one the other day on this one, which I cared a lot more about: https://news.ycombinator.com/item?id=48228321

▲ vb-8448 an hour ago | parent | prev | next [-]

I'm a huge fan of agent coding but kinda dislike this "llm evangelism".

There are still several open points (eg.: code churn, maintainability, subtle bugs human will never do) that everyone with a minimal programming knowledge that seriously used a LLM agent knows about but somehow none of these "big influencers" never mention (or just saying "it's your fault").

▲ CuriouslyC 2 hours ago | parent | prev | next [-]

Companies are kool-aid drinking now due to hype, but given how much they're spending, if they don't see REAL, BIG wins from it soon, they're going to scale it back quickly and switch to Chinese models. Claude isn't worth the API cost for a lot of development work, and once companies have had time to collect and crunch data they'll see this.

	▲	grttq 30 minutes ago \| parent [-]
		Swear people like you were hyping the frontier labs so hard not long ago. Funny to see the change of tone - a lesson for people not to get too ahead of themselves.

▲ dude250711 3 hours ago | parent | prev | next [-]

> Anthropic are strongly rumored to be about to have their first profitable quarter.

Is that quarter same as any other quarter in terms of infrastructure costs (e.g. are there any temporary discounts happening coincidentally)?

▲

MadxX79 2 hours ago | parent | next [-]

Didn't xAI basically donate the compute for that quarter so Anthropic could get to say they turned a profit?

▲

simonw 2 hours ago | parent [-]

The SpaceX S-1 says they're charging Anthropic $1.25b a month.

	▲	travelalberta 2 hours ago \| parent [-]
		It also states that the first few months (this current quarter where Anthropic are reporting profit) are discounted.

▲

travelalberta 2 hours ago | parent | prev [-]

Hey man, that discounted rate on Colossus 1 inference is purely coincidental...

▲ Legend2440 3 hours ago | parent | prev | next [-]

>Somehow this fragment turned into headlines like Uber’s COO says it’s getting harder to justify the money spent on AI tokenmaxxing, because the market for stories about AI failures remains enormous.

I notice this all over the place. Many people hate AI and want it to fail, and they're willing to invent misinformation if it supports that idea.

▲

hansmayer an hour ago | parent [-]

Well, it is a big news when the COO of Uber says it no? Not quite some small consultancy shop here.

▲

Legend2440 an hour ago | parent [-]

But the COO did not say that. The headline was deliberately misrepresenting what he said.

▲

hansmayer an hour ago | parent | next [-]

No, he said exactly that, if you remove the corporate sanitised language designed to not offend the Uber CTO.

▲

simonw an hour ago | parent [-]

I think you're putting way too much weight into what one person said in unprepared remarks at the 27 minute mark in a 32 minute podcast conversation.

▲

hansmayer 35 minutes ago | parent [-]

That "one" person is the COO of Uber. And the other one - the one based on whose statement about burning through yearly AI budget in the first few months - the whole discussion sprung up internally at Uber in the first place is the bloody CTO of that huge company. So yes, their words do have A TON OF WEIGHT. Thats why they are in such important positions, arent they? They're not quite the Derek from the pub, casually commenting on how Liverpool will fare this season.

▲

simonw 29 minutes ago | parent [-]

I think the way people reacted to those statements was entirely out of proportion to what was said.

I repeat: a CTO saying that they spent their entire AI budget for 2026 when that budget was clearly set in 2025 before anyone knew what those November models + harnesses were capable of is entirely unsurprising. Any analysis that doesn't also point out the difference between 2025 and 2026 era coding agents is either ignorant or deliberately misleading.

▲

hansmayer 20 minutes ago | parent [-]

Yes, but that's irrelevant, because the COO uses that to base his core argument - that all that jackshit 1800 code changes per week that the CTO boasts about, mean absolutely nothing in terms of value. It means they are spending a lot on it, to gain as he diplomatically said "perhaps 20% more" - and I wonder 20% of fucking what - it's a ride-sharing app, what could they be possibly building on top of it with all that token crap?

	▲	simonw 16 minutes ago \| parent [-]
		You have to try pretty hard to get to "all that jackshit 1800 code changes per week that the CTO boasts about, mean absolutely nothing in terms of value" from what he said on that podcast. (We still don't even know what Uber's planned AI budget for 2026 was. They didn't reveal that when asked - in https://www.theinformation.com/newsletters/applied-ai/uber-c... it says "He wouldn’t disclose exact figures of the company’s software budget or what it spends on AI coding tools").

▲

uncivilized 19 minutes ago | parent | prev [-]

The article was posted on HN and discussed a day or two ago.

https://news.ycombinator.com/item?id=48268871

▲ mschuller an hour ago | parent | prev | next [-]

yep, and the issue is, they took investment

▲ stego-tech an hour ago | parent | prev | next [-]

The big assumption with all of these sorts of analyses is that things will continue as they are for the foreseeable future.

In hype-driven markets, you cannot be certain of that.

Let's take a view that the author is right: coding agents and their associated harnesses were the inflection point for some degree of profitability and widespread consumption, and that these tools are now yet another SaaS subscription or API bucket expense to bake into every single developer (or developer-adjacent) in the organization alongside your collab suite, HR seat, CRM seat, design seat, etc. To be fair I honestly think that's a safe assumption to make for highly technical firms whose image is derived from remaining on the cutting edge of things.

That begs the following questions, which we won't know until IPOs start happening:

* Are subscriptions profitable, or just API consumption?

* What's the run rate when we just consider subscription-based usage like Claude Code and Codex? What about API calls?

* Is there any profitable pathway forward at which enterprises can get unlimited usage but at fixed rates via subscription?

* What does customer churn look like for subscription users versus API users?

We also have a number of questions for customers that I suspect we'll start seeing receipts for in the coming months, at least from the early adopters:

* What was the net gain (loss) from leveraging coding agents?

* What's the cost of a developer with or without access to a coding agent + harness? Is it cheaper to hire an outsourced worker with a coding agent subscription, or a domestic worker without one?

* At what point does further AI spend result in diminishing returns, i.e. where's the 'sweet spot' for spend?

* Did AI boost actual revenue and outcomes, or did it just gamify KPIs?

* What roles or work did AI actually replace, versus merely displace during the hype cycle?

Not to mention the questions regarding the technology itself:

* Will we develop the means to run foundational/frontier models at edge using less resources through some existing (e.g. distillation) or new technology, thus cutting off the profit centers of these firms?

* When the market mismatch between supply and demand is resolved, won't it be more affordable for consumers and companies to operate their own AI infrastructure rather than support further centralized buildouts?

* Will coding agents improve to the point of being able to bootstrap and self-orchestrate on edge/consumer hardware without substantial technical expertise, or at least improve to the point that traditional IT teams can securely operate them internally without an expensive subscription or API token bucket?

All of these will influence the long tail of this bubble, because it is a bubble at this point. Even if these companies are indeed profitable thanks to the coding agent inflection point, there's still so many unanswered questions about utility beyond coding that it's impossible to extrapolate a future. If coding agents are indeed the extent of utility for profitability, then there's no possible way these entities will recoup the investment already sunk into their infrastructure buildouts. Even if more profitable uses are discovered, does this offset or replace the firms disappearing due to AI speculation and their associated contributions to the economy as a whole (RE: the consumer compute industry at present, higher energy costs due to datacenter builds, opportunity cost from harms to local infrastructure from haphazard builds, etc)? Should these firms indeed be runaway successes and immensely profitable to the point of paying off their investors and growing the larger economy, does this end up stifling innovation in a world where most new ideas are fed into LLMs for R&D that are then controlled by only a handful of companies and immensely wealthy people, via systems that are easily surveilled and stolen from without recourse?

So many, many questions yet to be answered. Betting the farm because of coding agents is one hell of a gamble.

▲ bellowsgulch 2 hours ago | parent | prev | next [-]

How will they stay profitable if every business lays off engineers because of AI and there are no engineers to use it? /s

▲ enraged_camel 2 hours ago | parent | prev [-]

I wonder how Ed Zitron will shift goal posts this time, and how long it will take for that article, when published, to reach HN front page.