| ▲ | returnInfinity 6 hours ago |
| Brad Gerstner confirmed that tokens aren't being sold at a loss. Whatever the formula, API + Subscription split, the companies are making a profit on net token sale. They maybe running at loss after all the salaries and stock comp, but tokens are in profit now. |
|
| ▲ | utopiah 5 hours ago | parent | next [-] |
| It's like witnessing a rocket using the most powerful engine on Earth then once it escaped orbit turn off the engine and said "It is flying without power!". Yes, sure, right now it is ... but that's NOT how it got here. There are trillions invested to recoup and at most billions in sales. It doesn't add up to tokens making a profit any time soon. |
| |
| ▲ | StevenWaterman 5 hours ago | parent | next [-] | | The problem is, people see "they're not profitable once you account for training" and equate that to "AI will go away soon" But if all the AI companies stopped training new models, they would all instantly become profitable (and stick around) The thing that makes them unprofitable, is having to compete (which means training models). If / when enough companies exit the market, the cost to compete goes down and you end up in an equilibrium | | |
| ▲ | notahacker 4 hours ago | parent | next [-] | | Sure, but if companies don't exit the market and FOSS alternatives don't end up being unable to get near them in quality, they have to keep spending on training. And conversely, if the market becomes uncompetitive and FOSS sucks, the winners of the AI arms race are very strongly incentivised to stick their prices up anyway... | | |
| ▲ | JumpCrisscross 4 hours ago | parent [-] | | > if companies don't exit the market and FOSS alternatives don't end up being unable to get near them in quality, they have to keep spending on training Eh, the AI companies still have lots of datacentres. For the guys who funded with equity, they could collapse down to just running those as utilities. (For the guys who funded with debt, they'd have to restructure.) From the customer's perspective, this situation shouldn't result in a cost spike. (Consolidation, on the other hand, would. But that's a separate argument from the one the article attemptes to make.) | | |
| ▲ | notahacker an hour ago | parent [-] | | How often do VC funded unicorns collectively decide to stop scaling up, shut down all their departments targeting growth and reach breakeven point by becoming low margin utilities that will never justify their valuation? |
|
| |
| ▲ | InsideOutSanta 4 hours ago | parent | prev [-] | | That's all true, but that ends badly for us either way. If there's competition, training must continue, which must eventually be reflected in pricing. But if there's no more competition, there's no more incentive to keep prices low, which will also be reflected in pricing. |
| |
| ▲ | JumpCrisscross 4 hours ago | parent | prev | next [-] | | > There are trillions invested to recoup and at most billions in sales. It doesn't add up to tokens making a profit any time soon But this isn't "a ticking time bomb for enterprise." It's an issue for the AI companies' investors. | | |
| ▲ | shimman 4 hours ago | parent | next [-] | | Good thing the entire nation's economic growth outlook isn't tied to these companies then. For a second I thought we had a potentially dangerous situation on how we misappropriated trillions of capital. | | |
| ▲ | airstrike 4 hours ago | parent [-] | | For anyone doubting this, total private investments in the US grew 2% in 2025 relative to the prior year, adjusted for inflation. But within that big pie, the "IT-related" investments grew 15.7% whereas non-IT actually shrank 2.0%. |
| |
| ▲ | alkyon 4 hours ago | parent | prev [-] | | Not really, because investors will sooner or later want to see real returns on what they invested. Tokens are suddenly not dirt cheap and enterprises are screwed. It's like selling dope, once they're addicted, a dealer could turn the screw on them | | |
| ▲ | mpyne 4 hours ago | parent | next [-] | | That's why it's an issue for investors. Their investment may not payout. But the things that were built will still have been built and available to sell for related purposes, the models that were trained will still be trained, and so on. If things don't end up working out a lot of people have already been (and in the future will be) paid. It's the investors that will lose out, not the subscriber. | |
| ▲ | airstrike 4 hours ago | parent | prev [-] | | Not if they IPO and some other sucker buys the stock. |
|
| |
| ▲ | manmal 3 hours ago | parent | prev [-] | | Certainly not trillions. The models costing tens of billions to train are a very new development. |
|
|
| ▲ | ainch 6 hours ago | parent | prev | next [-] |
| Tokens can be sold at profit, but 70% of compute expenditure goes to R&D and model training[0]. Inference needs to cover all of that as well as being profitable in a vacuum. [0] https://epoch.ai/data-insights/openai-compute-spend |
| |
| ▲ | ml_basics 4 hours ago | parent | next [-] | | this will change as inference demand increases (which is happening right now faster than many people expected) | | |
| ▲ | ainch 37 minutes ago | parent | next [-] | | At the same time, the training paradigm being scaled, Reinforcement Learning, is significantly less data-efficient than next-token prediction. You basically need to run an agent for minutes (or longer if you want good long-horizon performance), only to give it a binary pass/fail - one bit of information. Inference compute is definitely scaling fast, but to scale RL, training and R&D compute also needs to scale hard. I don't think it's obvious that inference will overtake R&D/training, unless there's a reputable source that states that. | |
| ▲ | vb-8448 3 hours ago | parent | prev [-] | | do you have some ref? |
| |
| ▲ | benjiro3000 5 hours ago | parent | prev [-] | | [dead] |
|
|
| ▲ | cryo32 5 hours ago | parent | prev | next [-] |
| They aren't being sold at a loss but they aren't being sold at enough to cover the current losses and the costs. The losses are being passed around in some fucked up circular funding mess which will inevitably collapse into a debt crisis at some point. |
|
| ▲ | malshe 4 hours ago | parent | prev | next [-] |
| In other words, AI companies have positive earnings before expenses |
| |
| ▲ | mrweasel 4 hours ago | parent [-] | | "We'd be making money if we didn't have to manufacture the product", something like that? |
|
|
| ▲ | gobdovan 4 hours ago | parent | prev | next [-] |
| Do you think it will be the case for the Claude Code/Codex tokens as well? I think those are heavily subsidized, but they're the only ones I find real value in. |
|
| ▲ | m0llusk 6 hours ago | parent | prev | next [-] |
| That isn't enough. Over time the need for growth and increasing profits will squeeze existing margins. |
| |
| ▲ | hypercube33 5 hours ago | parent | next [-] | | I think for a while this is possible - the models definitely aren't as efficient as they can be as we've seen a lot of promising papers over the last year about how people are changing pieces and parts to do more with less. None of it has come to market yet that I'm aware of so for now it's just a hope I suppose but things like Opus definitely burn a ton of compute to be the leader in benchmarks but the gaps are closing. | |
| ▲ | riddlemethat 6 hours ago | parent | prev [-] | | Open source models apply pressures on the low end of the market. The paid models are so much better that they can charge based on value for enterprises. | | |
| ▲ | InsideOutSanta 4 hours ago | parent | next [-] | | I wouldn't call Kimi K2.6, GLM5.1, DS4 or newer Qwen models "low end". I prefer GPT5.5, but if it disappeared tomorrow, I'd be perfectly fine with any of these chinese models. | |
| ▲ | rglullis 5 hours ago | parent | prev [-] | | Have you used any of the recent models? My experience with GLM 5.1 does not make me miss Opus at all. |
|
|
|
| ▲ | vb-8448 3 hours ago | parent | prev | next [-] |
| Is sounds very much like "trust me bro" ... Obviously I, like basically everyone else here, don't have access to Open AI or Anthropic books so it's just guessing based on public available evidences, but "tokens aren't being sold at a loss" does not imply there is any profit. And, even if there is some profit, it needs to be big enough to at least pay back the capex spendings and finance the next model iteration. |
|
| ▲ | jknoepfler 4 hours ago | parent | prev | next [-] |
| Brad Gerstner might need a primer in asset depreciation. |
|
| ▲ | mpalmer 6 hours ago | parent | prev | next [-] |
| This is the sort of uncritical thinking that inflates bubbles in the aggregate. |
| |
| ▲ | wqaatwt 6 hours ago | parent [-] | | Compared to the inference prices for open models it’s highly unlikely OpenAI/Anthropic are not making decent amounts of money from inference. How many times bigger could Opus be than GLM or Kimi, it’s certainly not proportional to the price | | |
| ▲ | mpalmer 5 hours ago | parent [-] | | it’s highly unlikely OpenAI/Anthropic are not making decent amounts of money from inference.
Based on what? Why are we all whispering about how profitable all this is? It is the absolute last thing these firms would keep secret. | | |
| ▲ | JumpCrisscross 4 hours ago | parent | next [-] | | > Why are we all whispering about how profitable all this is? Nobody is whispering about anything. Everyone is loudly assuming what's convenient for their thesis. Even if you have access to the books, the accounting isn't straightforward–there are yet insufficient data for a meaningful answer. > It is the absolute last thing these firms would keep secret If you find an optimisation strategy that you don't think your competitors have, you absolutely keep your margins secret for as long as possible. Knowing something is possible is the first step to making it so. | | | |
| ▲ | wqaatwt 3 hours ago | parent | prev [-] | | Based on what I said. If e.g. Sonnet (assuming it’s significantly smaller than Opus) is unprofitable why are there a bunch of inference providers on OpenRouter serving very large models way cheaper? They don’t have a pile of money to burn for no reason. |
|
|
|
|
| ▲ | iLoveOncall 6 hours ago | parent | prev | next [-] |
| He's an interested party. His investments are worth a lot more if he says that tokens are sold at a profit. I don't understand how anyone would trust him? |
| |
| ▲ | wqaatwt 6 hours ago | parent [-] | | There are plenty of various providers on OpenRouter serving very large Chinese models like GLM for a fraction of what OpenAI/Anthropic. Presumably they are making a profit. It’s unlikely that Claude is proportionally that bigger and more expensive to serve so profit margins on inference must be pretty decent | | |
| ▲ | sarchertech 5 hours ago | parent [-] | | Do we know they are making a profit though? They could be subsidizing use to build market share the same way. They might not have billions, but at the volumes they are selling maybe they’ve got the cash to do it. Even if they are “profitable” how many Uber drivers are “profitable” because they aren’t correctly calculating asset depreciation. Maybe these guys are doing the same thing. Maybe it’s a lot of people who already had GPUs for crypto mining, and they’ve moved over to this, so that if they need to grow and buy new GPUs the costs would dramatically grow. | | |
| ▲ | mvanbaak 5 hours ago | parent | next [-] | | also, it's very much possible that the chinese companies get heavy investments from the state. Since it's very hard to get this info we have no idea wether they really make a profit or not. | | |
| ▲ | throwaway-away 4 hours ago | parent | next [-] | | I agree, and find that very plausible. I mean, for the CCP a few billions to subsidize domestic AI companies is a tiny investment with a potential huge payoff. It prevents (or at least make it harder for) US companies to build a monopoly on LLM tech and it could help popping the bubble which would weaken the US economy. In fact, if I remember correctly, the AI infrastructure build-out is what is keeping the US from a technical recession. | |
| ▲ | wqaatwt 2 hours ago | parent | prev [-] | | The R&D is of course subsidized but a lot/most(?) of these inference providers are not Chinese | | |
| ▲ | sarchertech an hour ago | parent [-] | | If the Chinese companies are subsidized anyone who wants to compete with them has to match their price. |
|
| |
| ▲ | wqaatwt 2 hours ago | parent | prev [-] | | > subsidizing use to build market share the same way To an extent maybe, but that market is almost entirely commoditized already. Besides Cerebras and maybe Groq (which already charge a slight premium) all the other providers are more less interchangeable. > Maybe it’s a lot of people who already had GPUs for crypto mining I’m not sure the type of GPUs that were most popular for crypto are at all useful for LLMs? | | |
| ▲ | sarchertech an hour ago | parent [-] | | >interchangeable If there’s a few providers subsidizing, that’s the price ceiling. Everyone who wants to compete has to subsidize. Now if this market had been operating for years, I’d say that it’s likely all these companies are profitable or close to it. But the market is so new and there’s so much hype, I find it very plausible that none of these guys are making a profit and they all hope to just hang in until all the subsidies go away. > I’m not sure the type of GPUs that were most popular for crypto are at all useful for LLMs? There’s some overlap. I’ve definitely read about people repurposing. |
|
|
|
|
|
| ▲ | parliament32 3 hours ago | parent | prev [-] |
| If tokens weren't being sold at a loss, Anthropic would be screaming about it from the rooftops. They've been desparately trying to make themselves not look like a money furnace lately, but it's not really working. They might be sold at-compute-cost, but that of course ignores training, salaries, and everything else. |