| ▲ | trjordan 4 hours ago |
| They've got, ballpark, $5t to $10t to make back in the next 5 years, or the hardware buildouts will start getting written down. This means we're going to need $1t+ per year in spending, per year, on tokens. 200m knowledge workers in the world, 30m developers. We're talking about a world where you need 5% of every knowledge workers salary to go into tokens. 20% if you're a developer. That's a _huge_ shift. Most people I know cite +20%-40% velocity with these tools, against the actual work their company cares about doing. +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending. We're not there yet. This is still the upswing of the hype cycle, and unless we figure out how to make developers 2x, 5x, 10x as productive on stuff that matters, this isn't going to play out well. |
|
| ▲ | whatshisface 2 hours ago | parent | next [-] |
| Here are a few thoughts: - The publicly available information about how inference costs compare to training costs is conflicted. EEs involved in datacenters talk about power usage spikes during training runs as if they were a major factor in the designs, but academic papers discussing cost-optimal scaling confidently treat inference-time compute as a major factor. - On the side of the balance indicating that training is more compute-intensive after amortization than inference is that Chinese providers, constrained primarily by access to compute, have nearly unlimited token availability at a lower price than US providers (inference), but poorer model capabilities (training). That would make sense only if US providers are inflating inference costs by 20-30x due to amortized training costs that overseas providers were not able to take on (there are other factors too). - If training >> inference, they're in a prisoner's dilemma that far exceeds the ordinary zero-marginals model of competition between firms (due to its huge discrete stepwise nature). On the other hand, if inference>>training, the high-level analysis popularized by certain thought leaders, that it's like a utility, would be true. You'd tend to count this as a vote for inference>>training, but the CEOs saying it at least have a huge incentive to agree because the alternative, the prisoner's dilemma, would stop investment very fast. - The only voice in the story that I just told you to have anything to do with fact (as opposed to high-level analysis and ivory tower armchair management of a secretive business) were the rumors from facilities engineers. That shows you the state of our understanding... - If we don't even know the ratio between amortized capital expenses and operational costs, outside investor analysis is impossible. It doesn't matter how finely they divide the accounting buckets for office ferns and indoor ferns if the single biggest part of their business is obscured for trade secret reasons. |
| |
| ▲ | materielle 2 hours ago | parent | next [-] | | I'm about to leave a shallow comment, but I am a bit skeptical of the supposed drop in inference costs. If AI labs saw a lot of potential there, they'd surely be bragging about it non-stop? So the fact that publicly available information is conflicted is probably a sign that at the very least, the numbers aren't amazing. Yes I know there's no evidence and this is lazy reasoning. But there's probably a bit of truth to this line of thought. | | |
| ▲ | Tuna-Fish 2 hours ago | parent | next [-] | | Why on earth would AI labs be bragging about how little the product they sell actually costs them to make? You don't want to do anything that reduces it's perceived value to the user, that might make them less willing to pay for it. Also, inference costs are bound to go way down with more optimized architectures. GPUs are fundamentally not great at inference. No platform where the weights are streamed from a large pool of memory is. If the models ever quiet down, there will be massive step changes in cost/token, energy/token and tokens/second, as models are etched into silicon ala https://chatjimmy.ai/ | | |
| ▲ | golem14 2 hours ago | parent [-] | | Why would any company brag about their margins ? Yet they do, to attract investors. | | |
| ▲ | Tuna-Fish an hour ago | parent [-] | | The key AI labs are not public companies, they are at liberty to brag about their margins to potential investors in private. | | |
|
| |
| ▲ | whatshisface 2 hours ago | parent | prev [-] | | Inference has traditionally been far less expensive than training. One public example is the fact that hobbyists can run StableDiffusion ($600k training costs[1]) on their personal computers. Speaking to your point, inference being dramatically less costly than training would not be seen as a delta from the norm. The model of providing inference for anything near the operational costs (like a utility would), would the delta from the norm if it were true. [1] https://x.com/emostaque/status/1563870674111832066 | | |
| ▲ | thesz 41 minutes ago | parent [-] | | The difference between training and inference is 1) one have to keep intermediate results for backward pass in training and 2) computation for training double because of the backward pass. Training is also done over batches, which increase memory requirements by several orders of magnitude. This is why training needs costly compute. One of the ways out of this unfortunate situation is to use something like Stochastic Average Gradient Descent [1]. Examples there are mostly concerned with regularized logistic regression, which makes problem more or less convex. Neural networks are inherently non-convex. Still, maybe some ideas from there can be utilized in the context of neural networks, like use of estimated Lipshitz constant to derive curvature and appropriate learning step. [1] https://www.cs.ubc.ca/~schmidtm/Courses/540-W19/L12.pdf
| | |
| ▲ | janalsncm 17 minutes ago | parent [-] | | So one way to think about it is roughly, Training is inference + backwards pass (~2x inference cost) + activations (vram overhead) + optimizer (vram overhead) + gradients (vram overhead). |
|
|
| |
| ▲ | vlovich123 an hour ago | parent | prev [-] | | Small alternative potential future changes that alter this analysis: * At some point model capability reaches diminishing returns. Then inference >> training in the future but training >> inference now. It’s not a prisoner’s dilemma but a land grab to solidify market position and be one of the 2-3 firms left standing as dominant in the space. The model companies aren’t super sticky yet but they’re working on it. * even if training remains >> inference, it’s possible to have multiple price points like they do today. If you need the most capable model you’ll be paying exponentially more per token to supplement the training cost even though the serving cost is marginal because most people will be satisfied with cheaper / less capable models for most tasks. I buy that inference is a dropping line item while training is a growing one. There’s all sorts of things on the horizon that’ll be order of magnitudes improvements, from startups burning models into ASICs to get order of magnitudes more performance to alternate architectures like diffusion transformers that have orders of magnitude structural optimizations. It’s inevitable that it’ll come down even further from where we are. It’s possible model training also will go down but I’ve not seen any compelling research suggesting major “easy” reductions here. | | |
| ▲ | janalsncm 11 minutes ago | parent [-] | | The issue is that most tasks do not require frontier-level intelligence, but companies like OAI can really only profit off of the frontier. Capabilities from a year or two ago are so outdated that even OpenAI gives it away for free and there are many other models biting at their heels. In other words they are spending huge amounts of money to cash in on a depreciating asset. So one possible future is that frontier-level training becomes so expensive and the use cases so sparse that it simply isn’t viable to keep going bigger. |
|
|
|
| ▲ | FuriouslyAdrift 2 hours ago | parent | prev | next [-] |
| I work for a tiny little company ($150MM annual rev with 9% net) and we are already looking at dropping $100k on hardware to run local models because, for us, they're "good enough." Our estimated spend for AIaaS would exceed that cost in less than a year. In a few years, there will be hardware capable of running frontier models good enough for most things at accessible prices for even tiny companies. |
| |
| ▲ | simplyluke 2 hours ago | parent | next [-] | | Yeah, that's the part that just seems to be wildly under-discussed to me. If open source models are ~3-6 months behind SOTA, and ~opus4.6 capabilities are good-enough for product market fit, do the frontier labs have half a decade to catch up on their prior burn? AI cost ballooning faster than companies can afford is becoming a very common topic in my circles right now. The era of "I'll pay infinitely more for marginal gains" is over from what I can tell. | | |
| ▲ | swalsh 12 minutes ago | parent | next [-] | | Open source models, especially qwen are pretty dang good. But its not opus 4.6, the evals dont tell the full story. I question the assumption open source models are 3-6 months out. | |
| ▲ | doug_durham an hour ago | parent | prev | next [-] | | Open source models that you can run locally are much more than 3 to 6 months behind. 6 months was the November inflection for Claude. No open source model is as good as Claude Opus 4.6. | | |
| ▲ | jobs_throwaway an hour ago | parent | next [-] | | It depends what you mean by locally. I don't foresee running a model on my laptop anytime soon to power a coding agent. Far more likely is an infra team at my company operating an open source model on cloud infrastructure. When they're already paying $1000 / month / dev, it starts to pencil pretty quickly. | |
| ▲ | simplyluke an hour ago | parent | prev | next [-] | | > that you can run locally That's doing a lot of work here. The future I see isn't most companies buying hundreds of thousands in hardware to run models, it's them adding a line item to their AWS bill. Inference costs on the larger hosted open source models are dramatically lower than the frontier labs API pricing. | | |
| ▲ | apocalyptic0n3 an hour ago | parent [-] | | > it's them adding a line item to their AWS bill That's the future Amazon sees too. We just had a week long session with the AWS team and they pushed that to us multiple times. |
| |
| ▲ | PeterStuer an hour ago | parent | prev | next [-] | | Many business tasks do not need the latest frontier models. I have a production system running since early GPT-4o. It now runs with GPT-5.2, not for improvements, but because it is cheaper. I could invest in switching to a local model, I tried and it works well enough, but api costs for this task are so low, it barely scratches $30/month. So I am using the local machine for other things and leave the inference on OpenAI, for now. | |
| ▲ | applfanboysbgon 28 minutes ago | parent | prev | next [-] | | Opus 4.6 is a February model. Every time this subject comes up it seems like people post intentionally misleading things and move the goalposts. The goalpost we've been bludgeoned with over and over again is that, in particular, Everything Changed in November 2025. That GPT 5.2 and Claude 4.5 were the inflection point. That is actually 6 months ago. And DeepSeek 4 is already there. > run locally You can't run DeepSeek locally on consumer hardware[1], but you can on enterprise hardware, and enterprise spend is the subject of this conversation -- and even if you aren't self-hosting, it doesn't matter, because you can just get your inference from one of the the many companies serving DeepSeek, who trivially undercut the pricing of OpenAI/Anthropic because they didn't have to spend hundreds of billions on training frontier from scratch but instead only invest in supporting inference, which is already profitable. [1] Since this misconception comes up all the time, I'll go ahead and pre-empt it: no, training a 32b parameter model on outputs from DeepSeek and running that locally is not "running DeepSeek", despite the hundreds of stupid articles and Youtube videos making that idiotic claim that they're running it on a 5090. | | |
| ▲ | simonw 23 minutes ago | parent [-] | | > You can't run DeepSeek locally on consumer hardware Maybe not DeepSeek v4 Pro, but I've run DeepSeek v4 Flash on my 128GB MacBook Pro using antirez's carefully quantized https://github.com/antirez/ds4 and it's impressive. | | |
| ▲ | applfanboysbgon 2 minutes ago | parent [-] | | Oh sure, yeah, that's nothing to sneeze at either. I think unqualified "DeepSeek" should generally refer to the main model, though, especially in the context of GPT5.2-grade quality. |
|
| |
| ▲ | PunchyHamster an hour ago | parent | prev [-] | | But one will be in few months. And then you have choice of paying say $100k for hardware and pay just power cost (or pay someone to do that for you), or pay way, way more for your team to have access to marginal improvement. And 5% worse model for 10% of the price of the bleeding edge will be worth it for majority of people |
| |
| ▲ | w29UiIm2Xz an hour ago | parent | prev | next [-] | | If only the AI era was born in ZIRP. | | | |
| ▲ | svara an hour ago | parent | prev [-] | | There's still a lot of room for the best models to get better at coding . Your argument rests on the "for marginal gains" part but it's really not clear that the gains are marginal in the foreseeable future. |
| |
| ▲ | stopachka 5 minutes ago | parent | prev | next [-] | | I don't quite understand, what would 100K buy you? AFAIK you would get about ~5 concurrent users, with a max context window of ~128K tokens on the larger models. This wouldn't be good enough for coding -- are you guys thinking of using it for something else? | |
| ▲ | EvanAnderson 2 hours ago | parent | prev | next [-] | | > ...we are already looking at dropping $100k on hardware to run local models... Just think how much further that $100K would have gone if the hardware market wasn't so screwed-up. Anecdote: I priced-out adding 1TB of RAM to a four node cluster a couple months ago. The cluster was purchased in fall of 2024 w/ 4 nodes, each with 256GB RAM. The nodes cost just over $14K apiece back in 2024 (entire box, not just the RAM). Dell wanted >$90K a couple months ago to add 256GB to each node. | | |
| ▲ | cyberax an hour ago | parent [-] | | > Dell wanted >$90K a couple months ago to add 256GB to each node. RAM is expensive, but not THAT expensive. I just bought 128Gb for about $5k for our build cluster (it's not even for AI, sigh). Even if you need larger-sized DIMM sticks, it's still going to be in the vicinity of ~15k tops. | | |
| ▲ | EvanAnderson 16 minutes ago | parent | next [-] | | It was crazy. I found the part on the open market for a lot less but the edict from the Customer was to buy from Dell to keep the support entitlement intact. That inflated the price to an astronomical level to be sure. I haven't had problems w/ Dell support and 3rd party memory, personally, but given the machines' application I understood the concern. | |
| ▲ | an hour ago | parent | prev [-] | | [deleted] |
|
| |
| ▲ | MASNeo an hour ago | parent | prev | next [-] | | On prem AI makes sense for more than just the cost. More control, IP, model improvements you can keep, data privacy to name a few. People will realize that AI is not like compute the moment they get their own knowledge sold back at a premium. | |
| ▲ | arbuge 2 hours ago | parent | prev | next [-] | | > In a few years, there will be hardware capable of running frontier models good enough for most things at accessible prices for even tiny companies. What makes you so confident about this prediction? Hardware costs haven't exactly been cratering recently. | |
| ▲ | cmdrk an hour ago | parent | prev | next [-] | | Do you think this will be a trend for larger companies as well? The decadal move to all-cloud-all-the-time killed off in-house hardware teams while the C-suite chased their OpEx dreams. It would be interesting if we come full circle on this. | |
| ▲ | disiplus 41 minutes ago | parent | prev | next [-] | | same, but you need more then 100k of hw to run something like kimi k2.6 for a bigger team. on the other hand there is a ds4 flash that you can run on a macbook with 128gb ram. an that one is perfectly usable for a lot of tasks. https://github.com/antirez/ds4 | |
| ▲ | 33 minutes ago | parent | prev | next [-] | | [deleted] | |
| ▲ | mv4 an hour ago | parent | prev | next [-] | | I configured a dual DGX Spark cluster, and it's certainly "good enough" for my agentic and coding needs. | | |
| ▲ | datadrivenangel an hour ago | parent [-] | | what models are you using on that? My experiences with apple hardware have convinced me that it is not really good enough for coding locally. | | |
| ▲ | irishcoffee an hour ago | parent [-] | | It isn’t the models, it’s the closed api and the tooling associated with it. It’s driving me crazy how not-talked-about this is. | | |
|
| |
| ▲ | alex_suzuki 2 hours ago | parent | prev | next [-] | | I’m curious: are you spending on beefy developer machines, or some kind of shared local inference server? Would be interested to know more if it’s the latter. | | |
| ▲ | irishcoffee 2 hours ago | parent [-] | | I am aware of at least a handful of companies doing the latter. I don’t work for them and cannot speak to their setup. |
| |
| ▲ | nonethewiser 2 hours ago | parent | prev | next [-] | | What models? Last I tried different local modals there was a pretty big difference from frontier. | |
| ▲ | awesome_dude 2 hours ago | parent | prev [-] | | > In a few years, there will be hardware capable of running frontier models good enough for most things at accessible prices for even tiny companies. I was going to say - the models are just going to keep growing at a pace exceeding the pace of hardware pricing/availability But then I realised that, far more likely, there will be a plateau reached (again) where nobody is seeing gain, and at that point hardware will catch up |
|
|
| ▲ | alexpotato an hour ago | parent | prev | next [-] |
| I was in college in the late 1990s/early 2000s and I distinctly remember an econometrics professor state the following: "As cable TV and Pay Per View came out, there were studies done about how many movies people would watch if given unlimited access to films. The results were bandied about as proof that we should build out all this infrastructure to support this line of business. When the data was further analyzed by statisticians etc, it turned out that people claimed they were going to watch films 10-12 hours a day, every day of the week. Impossible." I feel like we are in a similar boat here where some people are assuming: - EVERYONE is going to be using max tokens - tokens will NEVER get cheaper due to improvements in hardware, software, design, market forces etc etc |
| |
| ▲ | j-bos 41 minutes ago | parent | next [-] | | But isn't it wonderful that they did? | | |
| ▲ | wizzwizz4 11 minutes ago | parent [-] | | It's vaguely disturbing that people "watch" films 10-12 hours a day. Many of them are using it as a radio, for background noise, without really caring what the program is beyond vague genre, tuning in and out without particular regard to the plot… and yet we have all the cost of transmitting high-resolution video point-to-point. Surely we could just put better stuff on the radio, and accomplish most of the same goals for a far lower price? |
| |
| ▲ | PunchyHamster an hour ago | parent | prev [-] | | > - EVERYONE is going to be using max tokens anthropic already hunts down OpenClaw users for using too much on their plan. I'll give different example: When LED lights started to be more popular, the power usage didn't drop by the amount of power saved >- tokens will NEVER get cheaper due to improvements in hardware, software, design, market forces etc etc Well, first, improvements in computing stalled or even rolled back just purely because price of everything compute shot up cos of AI and that will NOT be fixed for a while and ESPECIALLY if AI usage will continue to increase Second, the token per model might go down in time but better models have more expensive tokens, so we quickly get into spot when: * price increase in token might not be worth marginal improvement next, better model brings * more and more models are passing "good enough for the task" threshold so for less and less companies there is any economic sense to pay for the "best" instead of paying deepseek or some other company to run "previous gen" models |
|
|
| ▲ | regularfry 3 hours ago | parent | prev | next [-] |
| The bottleneck has moved from producing a thing that works to knowing that the thing was the right thing to build. The more of the latter they can take on, the fewer knowledge workers are needed at all. So rather than 5% of every knowledge worker's salary going into tokens, 100% of the knowledge worker's total employment cost goes into tokens and you get a 20x productivity boost as a theoretical minimum across those tasks. That's the game. There's a view you could take of this that this is just a growing of the pie: with those cost dynamics a lot more "small businesses" get a vast amount of leverage, so the overall economy grows without replacing the knowledge workers. I'm not sure I trust the MBA class to have that view. |
| |
| ▲ | seanp2k2 3 hours ago | parent | next [-] | | >The bottleneck has moved from producing a thing that works to knowing that the thing was the right thing to build I would argue that that's been the case for quite some time before AI. As an example, what innovative amazing world-changing products have Google or Meta launched in the past decade with their very high numbers of very talented and highly-compensated engineers? The issue with most big tech companies are leadership, strategy, and product direction. I'm not saying that they don't make any profits, just that they probably aren't "building [the right thing]". AI for product development and management would be far more impactful than automating rote coding tasks / building React UIs that mirror API structures IMO. | | |
| ▲ | Figs 2 hours ago | parent | next [-] | | > AI for product development and management would be far more impactful than automating rote coding tasks [...] Yeah, if this stuff actually worked that well already, OpenAI et al. would just run AI CEOs and engineers. Why get some other company to pay you at all when you can automate every other company out of existence and take all the money they make? The fact of the matter is that while the tech has some uses, it sure as hell isn't a full scale replacement and you almost always actually have to massage the input into LLMs to get anything decent back out in practice. Some CEOs and managers can learn to do this, of course, and some already are... but that quickly turns into a second full time job. A "programmer" is still needed. The job might change from mostly hand-writing C++/JS/Python to prompt engineering + some manual coding to fix all the stupid fuck-ups that the bots can't solve themselves, but you still need someone to actually prompt the bot. When that changes, it won't just be engineers losing work; there will be no reason to even have a human CEO any more. | |
| ▲ | aspenmartin 2 hours ago | parent | prev | next [-] | | I don't know, if you've ever tried to build something at companies of that scale you run into incredibly boring problems "what data table do I need for X" and "who is the right person to reach out to for Y" and "they aren't answering me I guess I'll have to escalate" I don't think there is any shortage of great ideas at these companies, they are just extremely bloated. And I don't think its something like indecision or bad PMs, it's "we have a finite amount of time and resources so we need to be conservative but also not too conservative" If you have AI systems that can simply build out POCs in days, backtest on real data, show reliable results and numbers, you get a suite of product options you were never able to get before. If you have coding agents that can speed up implementation, you can build more stuff and choose the things that stick. It changes the cost/benefit calculus of the entire business. I think you are exactly right in that: PMs/leadership are by their nature orchestration machines. Other roles are as well, but I think PM's are at a particular advantage here in that it will be quite awhile I would expect before core product decisions and creativity can be delegated to an AI, but not quite awhile until virtually everything that they're blocked on (legal approvals, POCs, wire frames, etc etc etc) will become less and less of a blocker | | |
| ▲ | supern0va 2 hours ago | parent | next [-] | | >If you have AI systems that can simply build out POCs in days, backtest on real data, show reliable results and numbers, you get a suite of product options you were never able to get before. If you have coding agents that can speed up implementation, you can build more stuff and choose the things that stick. I'll also add this: within a large organization, you often need to interact with many different codebases owned by many different teams. Agents have made it much easier to wrangle by having the ability to deploy one to scope out your web of dependencies to learn about what would be needed for feature X, and how that integration can happen. We've been doing far more away team work simply because it makes things move faster. It's easier to convince a team to sign off/review something than it is to get them to commit to the planning and eventual work. It genuinely is helping things move faster inside large organizations. Or at least, it is for us, particularly since we're getting organizational prioritization to actually build the scaffolding to make those agents more effective at search. | | |
| ▲ | aspenmartin 2 hours ago | parent [-] | | > It's easier to convince a team to sign off/review something than it is to get them to commit to the planning and eventual work. 1000x yes: you have touched on what I think is the single biggest factor here, that is the humongous value of POCs. they are gnarly to build without agents, and so we used to have to get everyone on board so we didn't get screwed in performance reviews, which was monumental task because that means convincing very busy PMs who have a lot on their plate and dont want to take risks on things they don't understand, and now it's like "can we scale this out" and you have a very nicely formatted proposal and POC. It de-risks things very quickly |
| |
| ▲ | skydhash 2 hours ago | parent | prev [-] | | Pieces of concept and other prototypes have always been cheap (see hackatons). The main issue is that as soon as you’re touching customer data or modifying process they’ve paid you for, you have to be really careful. No one wants to be responsible for an outage that cost the company its biggest customer. | | |
| ▲ | aspenmartin 2 hours ago | parent [-] | | Yes, but at scaled companies, where building a simple POC hooked into real systems is nowhere close to easy. To the point where it means that you might as well just do the full thing. That's where the enterprise spend and the impact is. | | |
| ▲ | skydhash 2 hours ago | parent [-] | | Isn’t that a matter of configuration management? Or do you want to alter the real systems as well? | | |
| ▲ | aspenmartin an hour ago | parent [-] | | historically it's been a matter of an absolutely horrific amount of Kafka-esque problems. Say I want to build a feature in a product. - DS has to do a deep dive (need buy in) to opportunity size and derisk with data. That DS has to work with other DS (people may have left or moved teams) to figure out how to get the right data and figure out what the difference is between 10 different tables that have overlapping but inconsistent data.
- Eng has to build up an actual simple demo (need buy in)
- Design has to make it not hideous (need buy in)
- Legal has to review what you're doing; POCs should involve real data where possible because otherwise no one will trust it, even if its just for user analysis on existing products This plus about 6 internal system bugs for custom tools that are flaky and who's team has long been re-orged or laid off, 8 people who won't answer you, 2 PTO's for the stakeholders, 6 weekly meetings no one did POCs, they just had ideas and tried to get PM's to put it on the roadmap so if it fell through at least it was bought into |
|
|
|
| |
| ▲ | regularfry an hour ago | parent | prev | next [-] | | Yes, that exists at the wider business level. No question. I think what needs to get asked is are we talking about a bottleneck within the business as a whole, or a bottleneck within the scope of the knowledge work in question. Within software delivery there's a very clear shift when it's suddenly trivial to drop a 100kLoC plausible-looking PR into code review within an afternoon. Producing working code with a whole bunch of tests which make a very clear assertion that it does, in fact, work has had (if you're going that way) all the human-scale thinking time taken out of it, down to a rounding error. It still needs to be checked by a human, which was previously assumed to be a comparatively quick task in comparison to producing the thing. At least, it does where I am, and I don't think that's a silly position today at all. If they can crack that latter review/spec-check/assurance step, checking that what was built was what was demanded of the problem such that we don't have humans in the loop at that step either, then the bottleneck moves again. Then I think it moves to requirements capture and to product development, but that might depend on the industry. | |
| ▲ | nilamo 28 minutes ago | parent | prev | next [-] | | > As an example, what innovative amazing world-changing products have Google or Meta launched in the past decade Kubernetes is at 11 years ago, and is huge enough to be included there. The Google Pixel was just under 10 years ago. So... not nothing haha | |
| ▲ | nostrademons an hour ago | parent | prev | next [-] | | Google's internally developed and sometimes even launched plenty of innovative new products in the past decade. Stadia, Fuchsia, federated learning, and the whole transformer architecture that underlies this AI boom are good examples. The problem is they get killed by some other executive who is afraid of their department looking bad by comparison. I think this is fairly illustrative of the challenges in AI becoming as impactful as the Internet. The bottleneck is not making things. There are plenty of people who are really good at making things and can easily be 10x or 100x as productive as the average corporate worker. YCombinator was founded on that premise - small teams of founders and early employees could be orders of magnitudes more productive than the 1000s of corporate employees at their competitors. The bottleneck is on bringing your product to market. If your innovative new product is built within a corporate environment, it'll get killed unless the executive you work under can get a promotion out of it, and you'll be denied all sorts of help with approvals, launch process, PR, marketing, branding, etc. If it's a startup, they'll try to shut you out with exclusive distribution deals, legal threats, lobbying efforts to change the legal environment, PR campaigns, FUD, etc. The Internet was revolutionary because it let millions of people bring products to market without asking permission. Instead of having to bid for retail shelf space among dozens of entrenched competitors that all had sweetheart deals with the retailer, you could just put up a website and sell it to anyone across the globe. Instead of following hundreds of regulations that governed existing commerce, you could just launch something and sort it out later. AI doesn't really have that property - if anything, it makes things more centralized, with more gatekeepers, and so seems more likely to destroy economic value than add to it. | | |
| ▲ | regularfry 24 minutes ago | parent [-] | | What I think is happening is that the scale of thing you can hope to build at a below-corporate scale should radically grow. Corporate environments should suffer for this, being that inefficient. > YCombinator was founded on that premise - small teams of founders and early employees could be orders of magnitudes more productive than the 1000s of corporate employees at their competitors. I think this is still true, but the theory is: 1. You don't need YC-type funding to do YC-type business any more;
2. You don't need to scale the business past those small teams any more, you just buy more tokens. For clarity YC still obviously has a place as an incubator, mentoring, and networking function. I just think that what was previously the inevitable conclusion that you have to hire all the people the second you hit PMF to keep up with scaling the business as you scale sales is no longer inevitable. If you didn't want to go that way before AI, you were a "lifestyle business" and not worth investing in. As more and more knowledge functions get capably implemented by AI, it's the preferred position: humans are vastly more expensive than tokens, so you want them doing the stuff the AI still can't do. I don't think this necessarily translates to mass unemployment. I think it translates to masses of smaller businesses that are radically more efficient because the handoffs between business functions are tool calls, not emails to someone who doesn't want to help. > The Internet was revolutionary because it let millions of people bring products to market without asking permission. Think about it this way: if I am a small business owner but I think it makes sense to do something that previously only a team in a corporate environment could do but is now within the reach of AI, not only can I do it now, but I also don't have to ask anyone for permission! Who wins between the corporation and the small business in that scenario? > AI doesn't really have that property - if anything, it makes things more centralized, with more gatekeepers, and so seems more likely to destroy economic value than add to it. I think this will turn out to be backwards. I can see a version of this where the number of things you can do without needing to turn to a gatekeeper for help increases to the extent that the balance completely inverts. The vast majority of businesses are small, and AI can give them tools which previously required corporate scale to make sense, without the inefficient hand-offs between busy, political humans. Which is also something that the internet did! Getting an advert in front of a national market pre-internet was Hard but sometimes you had to do it because your target market was "all Canadians who buy toothpaste" or whatever and that meant saturation-bombing the physical environment with physical billboard ads, posters, flyers, and so on. So you only did it if you were P&G-scale. Now you, personally, can do it, trivially, for better or worse. | | |
| ▲ | nostrademons 2 minutes ago | parent [-] | | I dunno if the employees were ever really needed for scale. WhatsApp famously had 300M users and 13 employees at the time of acquisition; Instagram was something like 50M users and 55 employees. If you know what you're doing software scales basically infinitely, and the employees are there to make the software just slightly more tailored to specific user populations (and because upward career mobility for managers involves having more headcount). Yeah, building a revenue model takes people, but Valve employs only about 400 people and makes billions, as do various quant hedge funds like DE Shaw or RenTech. |
|
| |
| ▲ | nonethewiser 2 hours ago | parent | prev | next [-] | | >I would argue that that's been the case for quite some time before AI. I would agree but it's really minimized the building. More and more time is being spent on pre-coding work. | |
| ▲ | beambot 2 hours ago | parent | prev [-] | | Google & Meta are illustrative of late-stage capitalism -- it's all about distribution, not innovation. Their job is (mostly) to just acquire the products that have passed the gauntlet, then scale up their monetization through their distribution-focused machine. The same dynamic plays out in virtually every industry (not just tech). You'll find that most internal "innovation" teams are just lip service. In most cases, the "mothership" will be incapable of reproducing true innovation -- from a statistical perspective, culture perspective (mega corps are anti-scrappy; internal politics), and motivation perspective (startups aren't 9-to-5). It's much easier to have big M&A budgets, a VC arm, and some handwavvy internal innovation group. Every now and again, you'll get real innovations (Waymo, transistors, GUIs), but even those have a spotty track record of commercialization when created internally. |
| |
| ▲ | cogman10 2 hours ago | parent | prev | next [-] | | This is the same argument that has been historically made for outsourcing developers. Get 20 more devs for the cost of 1 dev in the US. I suspect that AI will fail to pan out to the same extent for the same reason why outsourcing hasn't fully panned out (even though every company tries it after getting big enough). The problems that will come up will be and always have been ongoing maintenance. AI is great at writing new code without a brain behind it, but once you get to the point where you need to refactor code, you start really needing someone with coding experience to guide the AI or veto it's mistakes. I don't think that's really fixable even with a lot better AI. It's not something that ultimately comes out of the likes of github data. I'm not saying that AI isn't going to make things better, btw, I just don't think we'll see a 20x improvement. Probably more like 1.5 or 2x. | | |
| ▲ | roncesvalles 2 hours ago | parent [-] | | Outsourcing of knowledge workers didn't work out because at large enough scales, the geographic arbitrage disappeared. Companies mostly always got what they paid for. The determinant of success was only whether the task needed American-tier labor or could make do with sub-American quality labor. | | |
| ▲ | m1coti 2 hours ago | parent | next [-] | | I am not sure this feels right. I agree that the US currently has leading minds in terms of tech, but I am not sure it is too big of difference with the EU knowledge workers. EU is still a lot cheaper then US in terms of wages you would need to pay. | | |
| ▲ | irishcoffee an hour ago | parent [-] | | Sure is an interesting thought. None of this is sarcasm: why do US companies deal with the time zone differences and language barriers they won’t need to bother with so much by outsourcing to say, Ireland? | | |
| ▲ | regularfry 17 minutes ago | parent | next [-] | | The mechanism is often that they'll actually outsource to someone like Accenture, who have teams everywhere, and whose contract managers will try to get their cheapest viable team onto the contract to maximise their margin. If the buyer can't judge the quality of what they're buying, or doesn't know why the resulting hand-offs, delays, mistakes and rework will cost them more than keeping everything in-house ever would have, they're going to have a bad time. | |
| ▲ | surgical_fire 19 minutes ago | parent | prev [-] | | Er, US companies do outsource to Ireland. Basically every big tech has large offices and employ a lot of people there. The limitation is that Ireland is a relatively small country, and most Irish developers are already employed (which is why Ireland end up being one of the main destinations for tech workers being hired from abroad). |
|
| |
| ▲ | asdff 33 minutes ago | parent | prev | next [-] | | Outsourcing of knowledge workers is still ramping up. The issue in the past was the skills were few and far between internationally. Facilities were also not built. That has changed now in a lot of fields. New campuses have been built in places like Bangalore and Hyderabad, even Singapore. The skills are there now, the training is decent, and you can see that the hiring is going on for very skilled titles in these cities. It is a different animal than just 10 years ago in this. | |
| ▲ | cogman10 2 hours ago | parent | prev [-] | | That's certainly part of it. But the other part that I've heard time and time again is that in order for outsourcing to be successful you basically needed an american engineer in the mix hand holding everything, clarifying requirements, and vetoing bad code. That part of dev work, the requirements gathering, attention to details, clarifying requirements, is something AI also struggles with. A lot of companies basically waste time and money on outsourced devs because without a clear path forward they effectively will sit and do nothing, waiting for a prompt. | | |
| ▲ | regularfry 8 minutes ago | parent | next [-] | | I think the mechanism here isn't that American engineers are magic. It's that you need that contextual knowledge really close to where the work is actually being done, so that the turnaround for questions, blockages, clarifications, "we've got no work to do", quality level-setting and so on is on the scale of minutes, not time-zones. | |
| ▲ | m1coti an hour ago | parent | prev [-] | | I would not agree on that point. It really depends on company's structure. I mean it also depends with people that makes the team. I would say there are a lot of unknowns but I would certainly not generalize. How I find your argument is that one distinguished engineer from US could do the same with the use of AI. I worked with both and I know great and bad engineers from both sides. Only thing is that US has a bigger pool of great engineers. |
|
|
| |
| ▲ | layer8 3 hours ago | parent | prev | next [-] | | Who pays for that value, and from what, if all knowledge workers lose their jobs? It sounds like the economy would largely reduce to the small minority class of independently wealthy people. | | |
| ▲ | simonw 3 hours ago | parent | next [-] | | The more time I spend using agent tools the less I worry about knowledge worker job loss. It takes a skilled knowledge worker to use these things. | | |
| ▲ | keeda an hour ago | parent | next [-] | | Yes, but I do worry about junior knowledge worker job loss. These models are very good (and getting better) at the vast dark matter of "donkey work" that happens in knowledge-based industries -- work typically done by junior devs / analysts / lawyers / consultants, paralegals, admin assistants, customer success / support, etc. -- and those roles comprise the bulk of the workforce. And worse, these are the tasks that help the junior people eventually grow into the skilled knowledge workers required to operate models, so there's a pipeline problem too. | |
| ▲ | kansface 2 hours ago | parent | prev | next [-] | | We'll get around to training job specific models or the equivalent. Thats just lower on the value chain for now. | |
| ▲ | layer8 2 hours ago | parent | prev [-] | | Sure. I was challenging the parent on how the “game” they are positing would play out. |
| |
| ▲ | whatshisface 3 hours ago | parent | prev | next [-] | | There were no knowledge workers in the middle ages. | | |
| ▲ | wongarsu 2 hours ago | parent | next [-] | | Back then people were mostly farmers, but we already automated that job away. Not completely, but compared to the middle ages we 50x'd their output. Which is a great illustration what it means to make a job 50 times more productive. We went from 80-90% of the population being required to barely make enough food for everyone to survive, to 4% of the population producing such an abundance that consuming too much food has become a systemic health issue | | |
| ▲ | fodkodrasz 2 hours ago | parent [-] | | At the mere cost of destroying soil, and polluting water and the atmosphere in only 200 years! Possibly this will also play out well, and there is a huge market of... maybe social media influencer economy to pick up those being automated out of their previous work... or rather identity, as actually much like in the middle ages, the modern world also makes the profession largely the identity of the individual. I'm pretty skeptical on the outcomes and the costs also (natural and social as well), but possibly we can have 50x or even more software in the end! The phrase will be truer than ever: > Software is eating the world! | | |
| ▲ | coryrc 34 minutes ago | parent | next [-] | | Maybe ironically, but software and robotics should allow us to scale regenerative agriculture in a way that doesn't leave everyone in poverty. We already have lasers mounted to trailers doing precise weeding instead of broad herbicide usage. https://www.agtechmarket.net/news/laserweeding (random web search, I don't vouch for this site, it just looks legit at a glance) Next innovation could be to scale succession planting, which keeps the ground from being exposed in between crops and lets you transition from nitrogen fixers to users quicker, getting more food out per acre while reducing fertilizer usage. But you can't do that with current harvesters and human labor is too valuable to spend on this. Also take broccoli harvesting, typically you get a few big heads, then it keeps producing smaller heads, but it's not economical to harvest the smaller heads with human labor. Robotic harvesting lets the same plant produce more food per acre and uses the energy needed for new plants instead to keep producing food. | | |
| ▲ | fodkodrasz 26 minutes ago | parent [-] | | Masses will be unemployed, due to robots displacing them, but human labor will also be too costly. We won't be able to afford a person shepherding, but we will need to produce "meat" (substitutes) in plants, or in inhumane animal-jail, and we'll need robot-weedkiller lasers to produce the feedstock instead of letting animals graze... and we'll give the food produced this way to people on UBI... This is where this is going, the whole industrialism is totally self-serving, and for every problem its answer is digging deeper in the rabbit hole, and creating 2 more problems in addition to solving the initial problem only half-way. I don't want to say what you are suggesting is not possibly useful, I just want to emphasize how stuff works out in reality, in addition to doing some nice stuff like what you called out (the halfway solution to the problems). All we get is more alienation and humans getting depressed and feeling a lack of purpose... but somehow we cannot afford to pay fair prices for the agricultural work, and pay fair prices for the food, and not overproduce and overpollute... and the same thing is happening in every aspect of the human condition, not only food production, which is the most basic and ancient activity we have been doing. |
| |
| ▲ | bryanlarsen 15 minutes ago | parent | prev [-] | | [dead] |
|
| |
| ▲ | thewebguyd 2 hours ago | parent | prev | next [-] | | There definitely were what could be considered knowledge workers in the (high) middle ages, it just wasn't the majority of work like today. The knowledge workers then were just a tiny, elite faction, mostly employed by the church or directly by nobility. Kindgoms were still big bureaucracies and needed scribes, theologians, academics, lawyers. | |
| ▲ | jrochkind1 2 hours ago | parent | prev | next [-] | | Relatively few anyway. Monks (who wrote and edited books and managed libraries, and also taught), artists and musicians, bookkeepers/treasury/exchequer, scribes/chancery (who were the administrators of the kingdoms), and bankers all existed in European "middle ages". But a significantly smaller part of economy/society compared to "western world" now, yes. | |
| ▲ | layer8 3 hours ago | parent | prev | next [-] | | There wasn’t 20x value to pay for in the middle ages either. | |
| ▲ | skydhash 2 hours ago | parent | prev [-] | | Are you sure? Any functional organization requires keepers to oil the machine. First the government. The best examples were the chinese empire, the catholic church, and the various kingdoms. Or do you think that everyone was either fighting or farming? Stewardship is knowledge work. Bookkeeping is another. |
| |
| ▲ | rvz 3 hours ago | parent | prev [-] | | > Who pays for that value, and from what, if all knowledge workers lose their jobs? They do not care unless these companies can get a bailout. UBI only exists for companies that are too big to fail. Case in point, 2008 and SVB when there was too much money on the line. One of the AI companies attempted to guarantee themselves a way for the government to bail them out if they were close to defaulting on the debt from the data center build out. | | |
| ▲ | mikeocool 2 hours ago | parent | next [-] | | SVB didn't get bailed out, their investors and creditors were wiped out. You could argue depositors were bailed out -- as they took the undue risk of keeping more than $250k in a single bank (though as part of a requirement for getting a loan from SVB, you had to have your operating accounts with them. So lots of companies had no choice, as SVB was one of the few banks that would lend to them). Arguably, the main impact of securing SVB depositors above the $250k limit is that it prevented thousands of people from being laid off that week, as their employers wouldn't have had the money to make payroll the following Wednesday. | | |
| ▲ | matwood 2 hours ago | parent [-] | | Thank you for saying this. Continuing to point at SVB as a bailout is annoying. They were not bailed out. Anyone with deposits in an accredited bank should be made whole - always. Without trusted banking we have no economy. | | |
| ▲ | anonymars 44 minutes ago | parent [-] | | > Anyone with deposits in an accredited bank should be made whole - always Sure, but is that the case now? Is everyone made whole when a bank fails and they have more deposits than the insurance limits? Or only when it's the well-connected / too-big-to-fail? Looks like the answer is no: https://www.wsj.com/finance/banking/a-small-banks-failure-le... So I don't think it's unreasonable to describe SVB as a bailout. Not for the investors, but for the depositors. Has anything changed to reduce the moral hazard / make it less likely to recur? |
|
| |
| ▲ | fragmede 23 minutes ago | parent | prev [-] | | > UBI only exists for companies What's the U stand for in UBI? |
|
| |
| ▲ | kmac_ 2 hours ago | parent | prev | next [-] | | Producing a thing has always been cheap since personal computers existed. From mail-order software companies' times to SaaS times, producing a sellable MVP was an initial cost that is relatively small compared to the later cost of expansion and maintenance. Marketing and selling was and still is the hardest part. | |
| ▲ | roncesvalles 2 hours ago | parent | prev | next [-] | | Why do you think of knowledge workers as a fungible commodity? What makes you think the people who used to build (or would have built) software will switch into the industry of "knowing that the thing was the right thing to build", as opposed to something cooler like surgery, city planning or experimental physics? The roles within a tech company are not the only jobs in the world. | | | |
| ▲ | OtherShrezzing 2 hours ago | parent | prev | next [-] | | > The bottleneck has moved from producing a thing that works to knowing that the thing was the right thing to build “There’s more capital than good ideas to fund” has been a complaint from the likes of A16z & other VCs for a long time now. It’s why we ended up with stuff like NFTs getting funded. | |
| ▲ | 2 hours ago | parent | prev | next [-] | | [deleted] | |
| ▲ | radicaldreamer 3 hours ago | parent | prev | next [-] | | If knowledge workers get laid off in mass, you can expect political curbs on AI adoption. | |
| ▲ | KaiShips 2 hours ago | parent | prev [-] | | [flagged] |
|
|
| ▲ | spamizbad 2 hours ago | parent | prev | next [-] |
| I will also tell you, as someone who works at a company that's trying to remain profitable, that token spend has caught the eyes of finance and much like cloud spend they've already started applying pressure to control costs. This May my team is protected to use 30% fewer tokens than we did in April - this was by intention. I suspect we'll drop more in June. |
|
| ▲ | BadBadJellyBean 16 minutes ago | parent | prev | next [-] |
| This assumes that we won't need new hardware in ~2 years. I find that unlikely. So they have to make back what they got up until now PLUS the running upgrade/development costs. So what will it be in 5 years? $20t? $30t? It's all getting a bit outlandish. What I'm often hearing though is the equivalent of "gg ez" when I bring that up. I don't understand how this will at any point blitz scale to profitability. As far as I know they don't have positive cash flow, no one has a moat and I don't think they will push out engineers. |
|
| ▲ | jgbuddy 3 hours ago | parent | prev | next [-] |
| You are making the assumption that the models are only used / paid for by 2.5% of the population (your knowledge workers value). There will be new value created by these models which people are happy to pay for which simply did not exist at all before. It is also naive to say that the hyperscalers are going to be expecting a return on this in 5 years, it will be entirely propped up by investments / IPOs as has been the case with any tech company for decades now to reach scale. The hyperscalers are currently spending ~650b combined annually, which they have the cash for and can sell in future compute instantly. |
| |
| ▲ | specproc 3 hours ago | parent | next [-] | | I'm sorry, what the feck does "value creation" mean here? I live in a place where people are so, insanely squeezed from every angle. Wages are stagnant, prices rocketing. Where is the money to pay for this value going to come from? No one I know feels richer than they did a decade back. I've not been able to meaningfully put up my prices for a decade. People are tired and stressed and scared, particularly scared of a technology everyone keeps telling them will make them redundant. There is no rising tide lifting all boats, just most of us drowning whilst a few whizz past in their yachts. I honestly hope these guys faceplant ASAP. Couldn't happen to a nicer bunch of people. | | |
| ▲ | dirck-norman 2 hours ago | parent | next [-] | | Feelings aren’t fact. A lot of data shows the doomerism is not reflected in the actual numbers and much of it has to do with rapid inflation and continued vibes. Consumption has risen, inflation adjusted wages have risen for blue collar and white collar alike. Most social mobility has been the middle class moving into the upper middle class, not moving to the lower class. The main thing holding people back is the housing crisis. This is orthogonal to the value creation of businesses. Value creation is growth. If it didn’t exist the S&P would still be 42.55$. | | |
| ▲ | geraneum an hour ago | parent | next [-] | | > Feelings aren’t fact... much of it has to do with rapid inflation and "continued vibes". Oh the lost irony. | |
| ▲ | jacobgkau an hour ago | parent | prev [-] | | > Consumption has risen, inflation adjusted wages have risen for blue collar and white collar alike. My wages haven't risen for nearly 5 years, while inflation has occurred over the past 5 years. Why the blanket statements? > The main thing holding people back is the housing crisis. This is orthogonal to the value creation of businesses. Are you suggesting a "housing crisis," in your words, wouldn't impact consumption? I'm watching my spending (and living like a child in his parent's house, except it's not my parent and I have to pay for it) in the hopes that in about a decade, I'll have saved up enough of a down payment for a home somewhere in my state that I could actually afford the mortgage on the remaining amount. There are plenty of things I'd potentially spend money on but won't as long as I feel like I'm economically stuck and have a chance in hell of saving my way out of it. So this feeling translates to fact. If you think my personal experience is just an anecdote and doesn't count because it's not being told through the lens of large-scale numbers, fine. But I really agree with the person you replied to that you're gonna have to be a whole lot more specific than "value creation" if you want people to spend money on your AI products "in this economy," whether it's because they're actually strapped for cash or just pretending like you seem to think they are. |
| |
| ▲ | WarmWash 2 hours ago | parent | prev | next [-] | | Sounds like internet sentiment and not research data. It's kind of become socially taboo to not be suffering "in this economy", but on paper it's hard to see weakness in places that there isn't always weakness. As long as the 65-95% are doing well, there isn't going to be a collapse. | | | |
| ▲ | jgbuddy 2 hours ago | parent | prev | next [-] | | A literal example is that I can use AI to file my taxes instead of spending a weekend and hundreds of dollars to have an accountant do it for me. It costs me like $5. that 245$ delta is the value of that output to me, as long as I am confident it is correct. | | |
| ▲ | mfuzzey 2 hours ago | parent | next [-] | | Seems to be a thing in the US to need specialised software, an accountant or AI to file taxes. In most of Europe individuals at least don't need any of that.
I'm in France and it's just a connection to a government run website to enter a few figures, takes less than an hour most of it is already pre-entered (salary etc), the main thing to add manually is charitable donations. If you're running a business then yes an accountant can be good (or be required depending on the legal form of the business) but not for individuals. | |
| ▲ | asdff 27 minutes ago | parent | prev | next [-] | | Taxes are one of those things that seem difficult and people reach for tooling or expertise without trying initially without, but are pretty easy to do yourself just filling out the forms. | |
| ▲ | moduspol an hour ago | parent | prev | next [-] | | Part of the value of paying an accountant is that you can get representation in case you are audited. Though I guess you did say you were confident it is correct. | |
| ▲ | panta an hour ago | parent | prev | next [-] | | I think that to sum things up, we will have to wait until we can evaluate the cost of the mistakes. You could be lucky but you could also end up with a very negative output value in the longer time frame. | |
| ▲ | WarmWash 2 hours ago | parent | prev | next [-] | | I did my taxes this year too with 5.5 and 3.1 Otherwise normally costs around $800 to do, because I have a small business too. | |
| ▲ | smnc 2 hours ago | parent | prev [-] | | > as long as I am confident it is correct Are you? Does it cost you extra (time or money) to be? | | |
| ▲ | jgbuddy 2 hours ago | parent [-] | | Yes, and they were accepted. A year or two ago I would have been less confident but now almost UX is happy to cite sources. | | |
| ▲ | redfern314 an hour ago | parent | next [-] | | Not speaking to the wisdom of filing taxes using LLMs, but just FYI (assuming US here) taxes being accepted doesn't mean they were correct. It just means the IRS hasn't found anything major wrong (e.g. SSN used on multiple returns). Even being approved isn't a guarantee, an audit could come later. | | | |
| ▲ | 29 minutes ago | parent | prev [-] | | [deleted] |
|
|
| |
| ▲ | deaton 2 hours ago | parent | prev [-] | | Thats the thing; the "increase in productivity" isn't being felt by the general public, the end user. If your "increase in productivity" just means more money being shifted around at the corporate level then it is meaningless. |
| |
| ▲ | mrandish 2 hours ago | parent | prev | next [-] | | > There will be new value created by these models which people are happy to pay for which simply did not exist at all before. True, but I think the GP's point was that what consumers will pay won't be nearly as profitable as what enterprises will pay to increase the output of their developers and knowledge workers. ChatGPT is currently the overwhelming leader in consumer AI usage but only ~5% pay $20/mo. As a recently retired serial tech founder, I'm now one of those consumers. I use AI webchat daily for general search, Q&A and even to write little automation scripts for myself, yet I haven't paid anyone anything for AI yet. Even after being heavily restricted and performance nerfed to hell in recent months, free webchat AI is still fine for everything I do, and I'm not remotely price sensitive. Even as AI compute costs fall over time, I doubt serving ads against AI webchat to consumers will generate the kind of high-margin, sustainable growth VCs get excited about. It's so undifferentiated I bounce around between all four leading providers because there's virtually no moat locking casual consumers to any chatbot beyond a single question thread. I guess if it had a nearly infinite context window seamlessly integrated across all sessions, that might be somewhat sticky for some consumers but it could also get creepy for some others - and it would devour gobs of the scarcest resource in AI. Beyond Maslow's Hierarchy of Needs, the mobile phone is the largest revenue, long-term mass consumer product ever but I just got a new flagship phone from a top-tier provider for $30/mo over 3 yrs. IMHO, even an all-you-can-eat, infinite context window, next-gen Mythos couldn't reach and sustain mobile phone levels of global consumer adoption at ~$20/mo. Unlike professional developers and knowledge workers, consumers don't have any "job to be done" big enough for an LLM to command that much of their zero-sum discretionary spend. | | |
| ▲ | jgbuddy 2 hours ago | parent [-] | | 100%, a driving factor will likely be how good we can make models that are so small they use almost no compute. Until then it is a race for adoption and moat-building (or screwing people over?) once you have users |
| |
| ▲ | Planktonne 2 hours ago | parent | prev | next [-] | | > There will be new value created by these models which people are happy to pay for which simply did not exist at all before What sort of new value, and why will people pay for it from someone else rather than prompting for it themselves? | |
| ▲ | PunchyHamster an hour ago | parent | prev [-] | | But will they pay big actors running top end models for that? You don't need latest openai or anthropic model to go thru your mails, get summary of the some products from web, or to do your to-do list. The AI might very well be used by noticeable % of population daily, but that doesn't mean they will be paying trillion dollars to the leading US AI companies |
|
|
| ▲ | jvanderbot 2 hours ago | parent | prev | next [-] |
| Hey, I wrote this down one time.
I estimated way higher yearly revenue required, to be adversarial. And you can keep the "cost per unit AI work" a parameter and play with the results. But the point is that if people are willing to delegate part of their salary (e.g., buy consumer products), vs requiring employers to pay for the tokens, then it's quite possibly a net win. Something like "I pay a largeish fee every month to make my own job much easier", similarly to how we buy a car to make commuting easier. https://jodavaho.io/posts/ai-jobpocolypse.html |
|
| ▲ | keeda an hour ago | parent | prev | next [-] |
| Putting some more numbers out there (some of the links are broken, but numbers look about right): https://github.com/danielmiessler/Substrate/blob/main/Data/K... Knowledge worker compensation is 35 - 50 trillion a year globally (6 - 12T in the US alone.) That's a huge TAM. It's still close but 5T over 5 years seems doable. >... unless we figure out how to make developers 2x, 5x, 10x as productive on stuff that matters, this isn't going to play out well. The way we make ICs 10x productive is not just making each of them individually more productive, but by removing the coordination overhead of large organizations, because overhead scales super-linearly with the size of the org. And orgs will shrink automatically as AI-assisted ICs take ownership of larger and larger scopes of work, leaving much more budget for tokens. I went into this in a bit more detail along with some made-up numbers here: https://news.ycombinator.com/item?id=48040999 |
|
| ▲ | thesparks 38 minutes ago | parent | prev | next [-] |
| Those are rookie numbers. We are going to blow past $1t per year in spending in no time. As a developer for 29 years, I couldn't go back to coding by hand. For better or worse, AI will be woven into the fabric of life in no time. |
|
| ▲ | onlyrealcuzzo 3 hours ago | parent | prev | next [-] |
| > We're talking about a world where you need 5% of every knowledge workers salary to go into tokens. They are assuming ~10% global GDP growth instead of ~3%. You probably don't need the same %s if the pie grows a ton. I'm highly skeptical we get that growth, but if you aren't, it makes it easier to digest. |
| |
| ▲ | freakynit 3 hours ago | parent | next [-] | | I mean this case with AI-productivity fires itself back when we talk about GDP. The more AI causes productivity increases, the less and less number of workers will be needed. This will heat up the job market even more and bring salaries down. Net effect of this productivity increase: less consumption by the masses, even though you may be producing more good and much more efficiently. A third effect also comes into play that once all this starts to happen, common people, who are generally living paycheck to paycheck, will now start to hesitate towards making any long term investment, housing included. And that indirectly will end up impacting financial and banking sector, which will then impact existing savings, bonds yields and retirement funds, and the recession-like cycle starts. This productivity increase only makes sense if it is capped to a very small number.. like 20% max. Beyond that, who these companies will even be selling to? Am I overthinking all this? | | |
| ▲ | simonw 3 hours ago | parent | next [-] | | > The more AI causes productivity increases, the less and less number of workers will be needed. That only holds if companies have a fixed need for "productivity" which is met by their current employees, such that their employees becoming more productive means they need less of them. Every company I've ever worked for has wanted to achieve way more than they are able to get done with current resources. But generally yes, the biggest open question about all of this is how the impact will play out on the economy, job opportunities etc. I've not seen anyone come close to a confident prediction about how this will play out. | | |
| ▲ | jbreckmckye 3 hours ago | parent [-] | | > Every company I've ever worked for has wanted to achieve way more than they are able to get done with current resources. I mean sure. Every company wants an infinite addressable market. But that doesn't mean it exists. It might not be possible to sell 10x the software we sell today. It might not even be possible to sell 2x | | |
| ▲ | forgetfulness 2 hours ago | parent [-] | | It's hard to imagine how making insurance sales cheaper for the brokers, churning out astrology apps faster, AI boyfriend bots or running ad campaigns with fewer and lower paid designers is going to drive 10% GDP growth in developed and middle income countries, that's the sort of figures you see when very poor countries finish rolling out electrification, sanitation and transportation. |
|
| |
| ▲ | seanp2k2 3 hours ago | parent | prev | next [-] | | >The more AI causes productivity increases, the less and less number of workers will be needed. This will heat up the job market even more and bring salaries down. >Net effect of this productivity increase: less consumption by the masses, even though you may be producing more good and much more efficiently. Big tech companies can't even create login flows and account recovery flows that work for everyone yet. There are countless stories of folks losing access to business Instagram accounts that get hacked, Google support from a human to fix a problem that is outside of their help articles is non-existent, etc etc. There's still so much "low-hanging fruit" IMO that isn't particularly fun or exciting to fix, but ask your average non-tech friend or family member what they think of the Facebook + Instagram security settings pages / sites / desktop-only settings. Who is going to pay for all of these subscriptions that will power this GDP increase when average purchasing power of those outside of the top ~10% of earners is decreasing YoY? We're headed toward food and water shortages next to sprawling datacenters, not shared societal prosperity and a healthy middle class. | |
| ▲ | arjie 2 hours ago | parent | prev | next [-] | | First of all, common people are not living paycheck to paycheck in the sense that they're at risk of not having money[0]. This is corporate content marketing that has entered the collective memory of people, not anything close to reality. Secondarily, reducing the cost of making a thing doesn't always mean you get less of a thing. For me, certainly, what happened is that I write way more software than I originally did. When we built compilers, the amount of human engineering effort required to do things plunged, but the amount of software engineering jobs didn't go down. This is as bad as models will ever be. That part is true. And it's entirely possible we go foom. But it's also possible we don't, and then it depends on where the asymptote lands. 0: https://www.slowboring.com/p/this-economic-myth-needs-to-go-... | | |
| ▲ | almogodel an hour ago | parent [-] | | Respectfully, that is truly ignorant. The vast majority of humans do not have any savings and would be in big trouble if regular income ceased. No paycheck no food. It’s wage slavery and it’s pervasive. |
| |
| ▲ | onlyrealcuzzo an hour ago | parent | prev [-] | | > The more AI causes productivity increases, the less and less number of workers will be needed. Why does this have to be the case with AI but it didn't have to be (and wasn't) the case with the steam engine, electricity, the automobile, or the computer & internet? Certainly, AI could be different. It's curious to me why the vast majority of people on here think it must be different. |
| |
| ▲ | seanp2k2 3 hours ago | parent | prev [-] | | And yet the job everyone loves to hate, the humble "burger flipper", continues to resist automation yet command minimum wage labor rates. This future of either being a CEO of a company consisting primarily of AI agents building some monthly subscription-based solution to some trivial digital chores OR manual labor that isn't [yet] fiscally viable to automate seems quite bleak. We'd also need a ton of robot technicians and manufacturing that the US has neither the educational and training institutions to support nor the will of the population to fill. Given the ongoing war on immigration, visas, and foreign-made hardware, if this continues, good luck. | | |
|
|
| ▲ | mv4 an hour ago | parent | prev | next [-] |
| If people figure out how to run agents on-prem (already becoming feasible for both agentic tasks and coding on consumer hardware like Mac Studio 128GB+ or DGX Spark with some models) these companies will be in deep trouble. Privacy is also a huge issue. |
|
| ▲ | jkelleyrtp 2 hours ago | parent | prev | next [-] |
| I agree in principle with the math. But I believe that in reality if revenues don't show up quickly, then lenders will just restructure the debt and defer the payback period. Similar to SF commercial real-estate; many buildings should've come due during the depressed covid market, but lenders (banks) were willing to delay payment until the market picked up again. The scale of these investments put the lenders at substantial risk, so the lenders will do anything to make it work. If the current lenders will be damaged by extended payback periods, they can simply sell the debt to someone else who won't be. |
|
| ▲ | gorgoiler an hour ago | parent | prev | next [-] |
| What value do the big model makers provide other than having a head start on gathering up humanity’s IP to train their proprietary models? What’s their moat? Is it hoping for regulatory capture where scraping is made illegal the day after they finally finish scraping all human language? It’s like OpenAI dammed the Colorado, and Anthropic dammed the Hudson, and now they’re both trying to sell us bottled water subscriptions at $100 a month. I don’t know how well the dam part of the analogy holds up, but the water part feels strong. Compiling models based on humanity’s written output feels like something no corporation should own. |
|
| ▲ | tedggh an hour ago | parent | prev | next [-] |
| Also, not all developers work on software products. The vast majority of developers work supporting software solutions as part of a much bigger business model, such as infrastructure, industry, healthcare and services. Many of these are complex organizations. So, unless you get to turn every employee into a 10x employee, the 10X coder along won’t necessarily make a 10X productivity contribution. What’s likely going to happen is the 10X coder will start to slow down or adding more (unnecessary) complexity to avoid having to sit and wait on overhead, for other areas of the business which are not easily automated away to AI to catch up. As a developer I can finish my project in June instead of December, but what if the customer is still not ready for integration until December? what do I do? |
|
| ▲ | datsci_est_2015 an hour ago | parent | prev | next [-] |
| I could see such productivity gains being possible, if only because the current tooling around LLMs is terrible. The fact that we have 30 blog pieces per day making the front page of Hacker News about someone’s convoluted system to guide LLM output to something reasonable is absurd. There needs to be standardization in tooling, and it needs to be open source. Then, and only then IMO, will we see huge productivity gains. But, at that point I think the big players’ moats will have dried up. Local models will probably be sufficient for 99% of daily office worker tasks. So I disagree with TFA’s premise. I think this fear is probably shared amongst the LLM giants, and they’re still hoping that neural network transformers are somehow the path to AGI (probably not, imo). |
|
| ▲ | cryo32 2 hours ago | parent | prev | next [-] |
| This is never going to materialise. It’s dead in under 2 years. The market is shrinking and saturated already and it’s not because of AI gains but geopolitical instability and supply chain issues, some of which are caused by AI spending and stupid ass PE firms refocusing on AI supply chains. Only our pensions and futures burning. |
| |
| ▲ | aspenmartin 2 hours ago | parent [-] | | What do you mean by the market is shrinking? | | |
| ▲ | cryo32 an hour ago | parent | next [-] | | Literally revenue is collapsing in most sectors. Technology purchasing is declining. Service models are failing to turn a reasonable ROI. People stopped buying shit. | | |
| ▲ | aspenmartin an hour ago | parent [-] | | Wait do you have any numbers to back this up? Every number that I've seen contradicts this. Most sectors have positive revenue growth, even non tech sectors. Technology purchasing is increasing in every bucket (software, IT services, devices, communications, and of course DCs). Retail and food-service sales are up MoM and YoY. Personal consumption is up 0.2% in real terms. I assume by service models you're just talking about AI? I actually may agree with you but this is clearly not true for long if it is true today. |
| |
| ▲ | packetlost an hour ago | parent | prev [-] | | It's consolidating into fewer, higher value assets. Over 40% of the S&P500 is in companies that are heavily (potentially over) invested in AI. | | |
| ▲ | aspenmartin an hour ago | parent [-] | | tech companies have grown disproportionately to other industries, but that says nothing about the growth in other industries - S&P has a Q1 2026 blended revenue growth of 11.3% according to FactSet
- most sectors are growing, not just tech |
|
|
|
|
| ▲ | browningstreet 3 hours ago | parent | prev | next [-] |
| Somehow Uber and WeWork survived the same kind of grand projections that they never met. |
| |
| ▲ | 121789 3 hours ago | parent | next [-] | | uber sure....but how did wework survive? they are a smoldering husk of a failed company looted by its founder | | |
| ▲ | hamdingers 3 hours ago | parent | next [-] | | I'm sitting in one right now and don't see any smoldering... | | |
| ▲ | khuey an hour ago | parent | next [-] | | They literally went bankrupt and wiped out the original shareholders. | | |
| ▲ | hamdingers an hour ago | parent [-] | | I guess I'm just not clued into your exotic definition of "survived" if continuing to function doesn't qualify. I tend to go by the dictionary definition. Chapter 11 is not Chapter 7. Businesses survive chapter 11 bankruptcies all the time. For example, WeWork. |
| |
| ▲ | kevin2107 2 hours ago | parent | prev [-] | | lmao. I'm sitting in Hiroshima and nothing is burning |
| |
| ▲ | naravara 3 hours ago | parent | prev [-] | | The company’s gone but the assets just got sold to other commercial real estate firms. Uber was basically only ever software to help people use their own cars so a very small part of their valuation was physical stuff to upkeep, it was just deals and obligations they had. Not sure how it shakes out for Anthropic and OpenAI. There’s a lot of physical capacity that needs to be built out and can depreciate. But there’s also a lot of network effects and dependencies being built in with enterprise users. I don’t know how swappable the tooling is either. I think over the long term the UI, model training and documentation, and infrastructure are going to end up being run by different parties and I’m not sure which leg of that chain ends up in a position to skim most of the profit off. My guess is that Apple and Google end up raking in all the money since they control the OS and app stores while the rest of the stack gets driven down to being generic commodities. At least where mass market consumer adoption is concerned. |
| |
| ▲ | tapoxi 3 hours ago | parent | prev | next [-] | | I don't think Uber was doing $1 trillion in infrastructure spend. | |
| ▲ | windexh8er 3 hours ago | parent | prev | next [-] | | The difference is that they had room to charge more of their customers and pay less to their workers. The AI industry doesn't have both sides to play at this point. Training and inference are getting more expensive and if you take on the high prices now you're just floating yourself further downstream from profitability long term (which does not look viable for any of them currently). | |
| ▲ | paxys 3 hours ago | parent | prev | next [-] | | WeWork absolutely did not survive | |
| ▲ | hansmayer 3 hours ago | parent | prev | next [-] | | Funny you should mention Uber. What was it their COO said recently about the AI costs? | | |
| ▲ | simonw 2 hours ago | parent [-] | | I quoted exactly what they said in my piece, under the heading "The AI-failure stories around this are pretty thin": https://simonwillison.net/2026/May/27/product-market-fit/#th... > But then you sometimes go and talk to your senior engineering leaders and you’re saying, OK, how many projects that were on the cutting room floor got moved above the line because of the productivity gains because 25% of our code commits were via Claude Code last quarter? > That link is not there yet, right? I think maybe implicitly there’s more that is getting shipped. But it’s very hard to draw a line between one of those stats and, OK, now we’re actually producing like 25% more useful consumer features, right? And that line is hard to draw. That's pretty weak sauce. I don't think that justifies the headlines that came out of it, personally. | | |
| ▲ | hansmayer 2 hours ago | parent [-] | | ? What are you talking about mate? The man all but says "this shit does not work for us".
It iss layered in that careful, sanitised corporate shit-sandwich communication approach, where you take a nice piece of shit and layer it in between two slices of avocado so its sweeter to swallow for the "consumer" of your message. He also said in that article that what prompted the discussion was the public statement by the Uber CTO that he had already burnt through his organisations yearly AI-budget in April. Please stop this shilling mate, and trying to hide the overall perspective between this or that word. | | |
| ▲ | simonw 2 hours ago | parent [-] | | Did you read my piece? I covered the Uber CTO thing too: https://simonwillison.net/2026/May/27/product-market-fit/#th... > The most discussed has been Uber, based on this report where CTO Praveen Neppalli Naga indicated that Uber had “maxed out its full year AI budget just a few months into 2026”, mostly thanks to Claude Code. > Given that Claude Code only got really good in November it’s entirely unsurprising to me that a budget set in 2025 may have failed to predict demand for that tool in 2026! |
|
|
| |
| ▲ | PunchyHamster an hour ago | parent | prev | next [-] | | uber doesn't own trillion in cars | |
| ▲ | xoac 3 hours ago | parent | prev | next [-] | | somehow the invisible hand of the market is also blind af | | |
| ▲ | ArcHound 3 hours ago | parent [-] | | Makes sense if you think about it: if all photons pass through you (invisible) then you can't capture them to get info (blind). |
| |
| ▲ | seniorThrowaway 3 hours ago | parent | prev [-] | | [dead] |
|
|
| ▲ | allthetime an hour ago | parent | prev | next [-] |
| lol I’m spending max $50/month right now on a couple light subscriptions and my velocity is insane right now (full stack mobile app development) I’m leaning into it hard while these cheap plans still exist and building out a big platform that I can easily generate new apps from. Hoping by the time the rug pulls I can just go back to hand cobbling these apps together from the modules I’ve pumped out and never even consider giving these companies a massive portion of my monthly income |
|
| ▲ | jstummbillig 2 hours ago | parent | prev | next [-] |
| > 200m knowledge workers in the world, 30m developers. We're talking about a world where you need 5% of every knowledge workers salary to go into tokens. 20% if you're a developer. This is where the napkin math is breaking down in a big way. There is absolutely no reason to assume this will only impact "knowledge workers". Farmers use computers. Farmers will use AI. |
| |
| ▲ | vablings 2 hours ago | parent | next [-] | | AI for what? None of the AI a farmer could or would use would be any more meaningful that light chatbot usage or already existing computer vision/gps | |
| ▲ | quantumleaper 2 hours ago | parent | prev [-] | | The kind of farm that would use AI is already 99% machinery and automation. |
|
|
| ▲ | dcre an hour ago | parent | prev | next [-] |
| 1. Global IT spend is $6T per year 2. Where does this $5T number come from? If they make $4T in revenue over the next 5 years instead, what happens? |
|
| ▲ | TimTheTinker 3 hours ago | parent | prev | next [-] |
| I thought Anthropic and OpenAI's combined CapEx has been <100B? source: https://isaiprofitable.com/ |
| |
| ▲ | kilroy123 3 hours ago | parent | next [-] | | That site needs Apple on the list. ;-) | | |
| ▲ | Danox an hour ago | parent [-] | | Why? All their money is going to Apple Silicon and the five ecosystems, so far in Apples entire history, the largest acquisition has only been $3 billion dollars, OpenAI is currently getting nothing and they gave Google a measly $1 billion refund per year for the use of Gemini. If John Ternus wants to spend some money, spend it on bringing memory in house. Apple has the money and the engineering talent to do so, have it fab/made onshore in partnership with TSMC. Do it Apple because you have to not because you want to the Chinese probably will be taking over the memory industry, worldwide, by taking advantage of the greed from three memory companies and their AI overlords. | | |
| |
| ▲ | deaton 2 hours ago | parent | prev [-] | | Maybe so far, but they've committed to well over a trillion in future capex. |
|
|
| ▲ | golly_ned 2 hours ago | parent | prev | next [-] |
| This is why 'agents' are the solution for these companies. Token spending goes through the roof. As long as a human is in the loop needing to read or review at human speed, that's a ceiling on how many tokens per user they can generate. |
|
| ▲ | yalogin an hour ago | parent | prev | next [-] |
| To get that revenue and adoption they have to vastly increase their infrastructure spending. If they are currently losing in even the 200/month plans how is it sustainable? |
|
| ▲ | npn 2 hours ago | parent | prev | next [-] |
| we all know it is impossible goal to make. surely AI will be even more useful in the future, but as long as china exists and continue to undercut the price, the goal will be never meet. > We're talking about a world where you need 5% of every knowledge workers salary to go into tokens. 20% if you're a developer. with that much money, the companies can easily buy their own hardware and hosting free public models, no need for those expensive subscriptions. |
|
| ▲ | ciconia an hour ago | parent | prev | next [-] |
| > make developers 2x, 5x, 10x as productive on stuff that matters What does this even mean? Is this about speed of development? Is this about headcount? LoC? How are coding agents contributing to productivity in places like GitHub, Shopify or Meta? I mean companies that already have an established product. I really wanna understand this because I'm not seeing that GitHub's product suddenly became so much better than it was 2 years ago, so where's all that productivity going? |
| |
| ▲ | zamalek an hour ago | parent | next [-] | | The productivity is going into perverse incentives[1], e.g. we have improved (by which I mean "increased") token use. More PRs every day. More lines of code. All things we knew were shit-brained metrics a decade ago (obviously except token use). We've also increased how much our coworkers need to read, or deal with. You can get an AI to make any point you want, so you can ignore the 5 humans raising alarms due to the 1 clanker you made say what you want to hear. All numbers going up. There are obviously people producing additional true value with it, probably, but that's almost certainly scarce. [1]: https://en.wikipedia.org/wiki/Perverse_incentive | |
| ▲ | flexagoon an hour ago | parent | prev [-] | | Productivity is measured in the number of AI-generated Twitter posts developers can make about their AI-generated startups |
|
|
| ▲ | amelius an hour ago | parent | prev | next [-] |
| At least they're not going to make us watch ads. |
|
| ▲ | ar_lan 3 hours ago | parent | prev | next [-] |
| > unless we figure out how to make developers 2x, 5x, 10x as productive on stuff that matters, this isn't going to play out well. Simple - you make them work 2x, 5x, or 10x more hours. |
| |
|
| ▲ | jmyeet 3 hours ago | parent | prev | next [-] |
| YEPPP... and I'm kind of shocked at how many people can't do simple math. Let's put it context. Google's annual revenue seems to be north of $400B. So if OpenAI suddenly had Google's revenue, it would still be insufficient to recover their investment. and it's a ticking time bomb because $1T in servers, CPUs, GPUs and memory is going to be worth $200B in 5 years. You can say they can keep using what they've got. Sure. But they're also not going to stop spending on new hardware. And the competitor that comes along in 5 years and spends $1T doing the exact same thing is going to have a huge advantage. OpenAI at this point reminds me very much of the Russ Henneman pre-money hype cycle. |
| |
| ▲ | hansmayer 3 hours ago | parent | next [-] | | This should be the top comment. Also, I think its not that many people, including our Simon here, are not good at math. Its more like, some of them seem to be incentivised to not be cough, cough, "good at math". How else will the hype sell? | | |
| ▲ | simonw 2 hours ago | parent | next [-] | | I thought my post was pretty free of hype. I said that this new revenue "Maybe even enough to start covering their costs!" | | |
| ▲ | WhrRTheBaboons 2 hours ago | parent | next [-] | | that statement is pretty high on hype relative to the actual financials though | |
| ▲ | hansmayer 2 hours ago | parent | prev [-] | | Well, your title certainly was not, in any case! | | |
| ▲ | chipotle_coyote an hour ago | parent | next [-] | | I mean, a company that loses money on every widget they sell might technically have found "product-market fit." :) It seems quite possible to me that developer tooling is going to end up being the biggest win from LLMs because there is a product-market fit -- and also quite possible that OpenAI and/or Anthropic end up getting bought for pennies on the dollar because their burn rate is unsustainable. AI may end up being this generation's "dark fiber." | |
| ▲ | simonw 2 hours ago | parent | prev [-] | | [dead] |
|
| |
| ▲ | Imustaskforhelp 2 hours ago | parent | prev [-] | | At a certain point, I genuinely feel like the best way this hype is being sold is by making people genuinely believe in it. and in that sense, if Anthropic and OpenAI are able to create the projection that they can-be profitable despite finances seeming bubbly at best, I think that what happens is that these companies spew so much amount of content that people like Simon get into it too. There is a deeper problem of people falling into AI psychosis too, in general, I am not sure if Simon has fallen into it or not I think that the greatest point which can be made here is to not offload your thinking to others and to think about the situation yourself. Sounds familiar (looks like we are all off-loading our thinking itself to machines) Side-note: As humans, we have a tendency to quickly judge or make quick decisions which stems from our times foraging and scavenging in jungles. Another Side-note: at a certain point, I am unsure of how much to think about AI or not, certainly discussions about it that were happening 2 years ago weren't helpful in contexts that they are used now (well not in any way or form that a person discussing and getting into the weeds of AI 2 years ago is better than a person just getting into it say 2-3 months ago) With the industry (moving so fast) [but that doesn't mean that you can't catch up with it, I feel like the fast word has made people think that they are falling behind which is imo wrong i suppose]*, It is basically unsure to me of any FOMO or anything if you aren't using AI already, I find this notion naive. People might be making strong opinions (AI psychosis) and skills on the tools available at the moment the same done 2 years ago. We don't quite know about the tech as these are still black-boxes and how they progress and what these "AI skills" might survive or not in future. Heck, we aren't even sure if these tools might survive or not or wouldn't be made magnitudes more expensive simply to break even as they are given to us for the first time at percentages of the price. I don't know if I should form (strong) opinions yet and also a question of its worth so much thinking efforts in the first place, probably just gonna do my own thing (the way I want to) which includes learning C at the moment. because learning is fun. | | |
| ▲ | simonw an hour ago | parent [-] | | I didn't exactly say that they were about to become wildly successful companies. I suggested that they had "found product-market fit" - not too impressive for more than a decade of work - and that their revenue may even be "enough to start covering their costs". | | |
| ▲ | Imustaskforhelp an hour ago | parent [-] | | Firstly thanks for responding and I wish you to have a nice day. your suggestions have value and I appreciate you writing the article. Perhaps enterprise businesses do end up becoming the fat and meat of the AI industry. My question which I wish to ask: What would happen to these AI companies if they turn out to be anything but wildly successful companies, both to the investors who have already invested in it and to those who might be investing indirectly into it in the near-future (passive investors, retirement funds) I would love to hear your thoughts on it! Thanks and have a nice day :-D | | |
| ▲ | simonw an hour ago | parent [-] | | > What would happen to these AI companies if they turn out to be anything but wildly successful companies I'm not nearly enough of an economist / finance person to answer that credibly, but I expect they'll go bust, and a lot of people will lose their shirts. ... and the model weights will be sold to other companies who will then run them at a profit, and eventually figure out an economically sustainable way to train new ones. The 1800s railway booms are a good comparison here - a lot of companies went bust, a lot of investors lost money, and we still ended up with railways. If the AI companies all go bust we're going to have a lot of spare data center capacity! | | |
| ▲ | Imustaskforhelp 29 minutes ago | parent [-] | | > If the AI companies all go bust we're going to have a lot of spare data center capacity! I can be wrong I usually am but an AI DC != compute DC or that it might decrease the prices of servers substantially because of it. (well not exactly, I hope you read my whole message so that I am able to better explain what I am saying.). AI DC's try to optimize for one thing: running GPU's for immense scalability and flexibility (0 to numbers>=large_number). Currently, its actually way worse, the server providers are some of the worst impacted by the industry at the moment because each server requires ram and ram is well... increasing in its price exponentially. It's really a tough time to be a provider at this time (in certain respects) directly because of AI. It is unclear to me if spare DC capacity will have any meaningful impact to it. I don't think that atleast within compute (and not GPU/AI DC), that space was too large of a problem. Fun fact but one of the largest providers (BuyVM) had its datacenter price from where they colo'd increase because of the immense demand at the moment for spots in datacenters by many tens of thousands of dollars that they did the first price hike in at this point at decades! The situation is this dire :-( Ram prices might come falling down and DC's might get cheaper but they can only get cheaper to limit, they still need to for example DC security employees and I wish to suggest that if anything, investors might wish to re-coup their losses within the AI loss, they might want to make up with what little they might have (ahem DC) For example, if you wish to want to take at an even more egregious example of what I am suggesting, there are many new york LLC's who would much rather leave the properties that they own empty rather than decreasing the price of what it costs (which they have set to some egregious amounts). I think that for them, somehow the math ends up working out in the end somehow, so there might be something more to it. I wish I was optimist but I don't believe that the gains in spare data center capacity are worth even a fraction of fraction of the damage if AI were to go bust as you suggested with trillions of dollars vanished. So, with the data I have at the moment, I am unable to suggest that compute would be cheaper. Heck, it was cheaper before AI and compute prices have never been something that people worry about because there are sometimes 10x cheaper options than AWS,GCP,Azure with things like Hetzner/OVH and others (yes its not a 1:1 situation but still its a 95% overlap and for all intents and purposes, great) I can see a potential where GPU compute can get cheaper, oh boy, its so much more expensive than compute but I feel like GPU's aside from AI might still have a much more limited niche than generic CPU. The issue wasn't ever the pricing. Simon, I own 7$/yr vps's which run my websites fine because they are written in golang. I doubt it can get cheaper than it. (You can get a 3$/yr vps if that is what you are interested with using Nat VPS + cf tunnels) I would once again appreciate to hear your thoughts on it. The only thing I realistically see is if Ram producers ramp up their productions and create a ram price glut in the next few years, but imo the prices would even out over the long term. I have seen the point of spare DC capacity being raised up multiple times but I finally ended up writing a message which hopefully captures the nuance, but once again, I don't know the future about it. Waiting for your reply and have a nice day Simon (& other readers) and thanks for reading if you did, I appreciate it :-D | | |
| ▲ | simonw 8 minutes ago | parent [-] | | I think we are in agreement that if the bubble bursts a lot of people will lose a lot of money. I don't have a strong opinion on the data centers, my main point is that I don't think AI "just goes away" if the bubble bursts, which seems to be something that a lot of people assume. |
|
|
|
|
|
| |
| ▲ | mfuzzey 2 hours ago | parent | prev | next [-] | | It's actually worse than that. It's not just financial depreciation or that the existing hardware becomes obsolete due to being less powerful than new hardware but also that hardware being run all the time at high load actually has a limited lifetime of a few years so it will physically break... | | |
| ▲ | jmyeet an hour ago | parent [-] | | I agree but it's even worse than that. Data centers come down to performance-per-Watt. Electricity accounts for 20-30% of a data center's operating cost [1]. I don't know the exact breakdown but the GPU part of that is probably the majority given how power hungry GPUs are. The B200 is upwards of 1200 Watts [2]. The B200 is rated at ~4.5PFLOPS of dense FP8. So you're getting 3.75PFLOPS/W. We don't know what the next generation will look like. The A200 (Hopper architecture card that preceded the B200) had ~4PFLOPS apparently but also lower power consumption. Obviously this changes depending on whether you're looking at dense or spare and FP8 vs INT8 vs INT4 vs FP4, etc so we're just using FP8 as a yardstick. Imagine a fictional B200 successor, the T200 that has 8PFLOPS of dense FP8 at 1000 Watts. Well then a DC built on that where the T200 will likely cost similar to what the B200 does now, you'll get nearly double PPW so the same size DC and same electricity load is going to be like 2 of your old DCs in operating costs. That's a big deal when you've laid out a trillion dollars. [1]: https://iaeimagazine.org/electrical-fundamentals/how-much-el... [2]: https://www.trgdatacenters.com/resource/h200-power-consumpti... |
| |
| ▲ | mountainriver 2 hours ago | parent | prev | next [-] | | How could extremely capable artificial brains ever pay for themselves? | |
| ▲ | WarmWash 2 hours ago | parent | prev [-] | | Prices are not going to stay where they are. You have either never seen a tech cycle, or need to be reminded of that. The pressure to buy more expensive plans is already starting to form. |
|
|
| ▲ | logtempo 2 hours ago | parent | prev | next [-] |
| > +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending. Except that if your company go 20% faster than the others companies, you win market shares. But then, everyone will use the same tools and companies will be at even speed, but the tool will stay. Now...if the market is saturated, it's useless to try to do things faster. Cheaper yes, but not faster. |
| |
| ▲ | archagon 2 hours ago | parent [-] | | Pretty much all major tech companies today are horribly bloated and mostly metastasizing instead of innovating. I'm not sure how 20% increased productivity will help in any way with that. If anything, it might accelerate enshittification and turn potential customers off even more. |
|
|
| ▲ | mirekrusin 2 hours ago | parent | prev | next [-] |
| Now try to take back llms from developers and see what happens. |
| |
| ▲ | bigfishrunning 2 hours ago | parent [-] | | If, by some miracle, all LLMs ceased working right this second, any developer who would no longer be productive should not have been a developer in the first place. | | |
| ▲ | mirekrusin an hour ago | parent [-] | | True, but they will not want to work for you anymore, they'll want to work for company that provides it. |
|
|
|
| ▲ | deaton 2 hours ago | parent | prev | next [-] |
| Bigger than that, they have to contend with open weight local inference. Open weight models right now haven't caught up to the frontier models of right now, but they're as good as the frontier models of not too long ago. If open weight models reach a certain point, then frontier model providers are going to struggle to make anything selling tokens, because eventually people will realize they don't need Mythos for everything. |
|
| ▲ | aprdm 2 hours ago | parent | prev | next [-] |
| "Next 5y" doesn't apply to AI factories |
|
| ▲ | sowbug 3 hours ago | parent | prev | next [-] |
| There is also the EV (expected value) of developing AGI. Even if you personally believe the probability is low within the lifetime of either of these companies, the value would still be extraordinarily high, enough to forgive a $5T or so miscalculation here or there. |
| |
| ▲ | jbreckmckye 3 hours ago | parent [-] | | I don't think AGI was ever a serious endeavour, just something the labs talked up to grab attention. I am willing to bet a Twix we'll look back on that stuff in 2 years with a lot of embarrassment | | |
| ▲ | sowbug 2 hours ago | parent [-] | | The high-risk side of that bet would need to win more like a lifetime supply of Twix. But in a post-scarcity nirvana, everyone already has that. So sure, you're on at even money. See you in two years. | | |
| ▲ | deaton 2 hours ago | parent [-] | | Theres no reason to believe, based on recent trends, that AI would lead us to a post-scarcity world, even if it could do all of our jobs better and cheaper. | | |
| ▲ | sowbug 2 hours ago | parent [-] | | I'll wager a hypersled of my Twix against your next three rations of gruel. But I think I'm done betting after this one. |
|
|
|
|
|
| ▲ | superxpro12 2 hours ago | parent | prev | next [-] |
| It's going to be a typical saturation curve. A lot of upfront tokens spent on things that have stockpiled over the years, and then the derivative on token spend trends to zero as the users run out of immediate things to try. Sure there will be ongoing maintenance and experiments, but it wont be nearly as close as the initial inrush. |
|
| ▲ | TacticalCoder 16 minutes ago | parent | prev | next [-] |
| > We're not there yet. And that's not considering that capitalism is going to do what it does best: if they really found a way to be profitable, competitors are going to fight them on pricing. Anthropic, OpenAI, Google, etcetera 's margins are a competitors' opportunities. It's not as if there weren't chinese models nearly SOTA. Don't know where the french (Mistral) are but they may try to get in the game if there's a way to be profitable (not that France or the EU for that matter are relevant in anything tech or had any tech company besides ASML and SAP in the Top 100 but who knows). |
|
| ▲ | PunchyHamster an hour ago | parent | prev | next [-] |
| That assuming once they start squeezing people won't just go to deepseek or other cheaper competition > That's a _huge_ shift. Most people I know cite +20%-40% velocity with these tools, against the actual work their company cares about doing. +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending. And most research shows people far over-estimating their own gains. Once companies start counting the actual (and not just reported) gains, the AI budgets will be more limited as people realize it's an useful and versatile additon but not replacement for most types of work > We're not there yet. This is still the upswing of the hype cycle, and unless we figure out how to make developers 2x, 5x, 10x as productive on stuff that matters, this isn't going to play out well. Upswing of the hype cycle while growth of tech itself is flattening, both coz of techs innate issues (which might or might not be solved, but some papers claim they are unsolvable with current approach) and just the fact the spike in growth caused so high economy cost that it put brakes on itself. T |
|
| ▲ | EGreg 4 hours ago | parent | prev | next [-] |
| Here is a serious question.. Can we sell into the hype cycle and on the way down with this: https://safebots.ai/costs.html |
| |
| ▲ | adithyassekhar 3 hours ago | parent [-] | | I asked claude to generate a frontend and it made the same template. Same san serif and serif fonts together. Same colors. Same typography. Same layout and animations even. It’s wild how similar it is. No not similar it’s the same damn thing. | | |
| ▲ | dd8601fn 3 hours ago | parent | next [-] | | I’ve seen the same dashboard for a dozen custom web applications now, including a couple I had it make for me. It really does have a particular lane for each chore, and it’s reproducible. | | |
| ▲ | properbrew 3 hours ago | parent [-] | | Yep and when you see it in the wild it stands out like a sore thumb, absolutely no thought into a bit of a unique design or branding. I have a few live websites built using LLMs and they will just go for default generic templates and colours if there's no vision. |
| |
| ▲ | jeffreygoesto 2 hours ago | parent | prev [-] | | It produces the "most average" web design unless you really prompt your way out, isn't it? If you don't care enough to prompt, Claude does not care to be individual. | | |
| ▲ | WarmWash 2 hours ago | parent [-] | | Technically from claude's POV, it's one individual copied millions of times. All claudes are clones. |
|
|
|
|
| ▲ | 3 hours ago | parent | prev | next [-] |
| [deleted] |
| |
|
| ▲ | mannanj 2 hours ago | parent | prev | next [-] |
| One quick question. Did tax payer money fund these data centers? If so, how does that money translate to their profit and a return for the people whose work paid for the resources? Or did we just get scammed? |
|
| ▲ | YetAnotherNick 3 hours ago | parent | prev | next [-] |
| > $5t to $10t to make back in the next 5 years Wait what? They spent 2 order of magnitude less on hardware. |
| |
| ▲ | trjordan 3 hours ago | parent [-] | | From the verge: https://archive.is/kU4Zg > Gartner forecasts that large AI companies would need to earn cumulatively close to $7 trillion in AI-driven revenue through 2029, which is close to $2 trillion per year by the end of the period. In order to achieve “historic returns,” the providers would need to earn nearly $8.2 trillion in the same period. | | |
| ▲ | YetAnotherNick 3 hours ago | parent | next [-] | | Those numbers don't even track even in the same sentence. If it is $2T/year by the end of 2029, it would be something < $6T cumulative in 3 years. | | |
| ▲ | layer8 3 hours ago | parent [-] | | “Through” 2029 is a bit more than three and a half years. The $2T are likely the yearly average of the $7T in that period. |
| |
| ▲ | b0r3dthisD4y 3 hours ago | parent | prev [-] | | The numbers are made up political correctness anyway. Everyone's agency is 100% captured by belief in Wall Street. Too few <50 have any meaningful labor skills to blink. We'll continue to have consent manufactured via media platforms and in 3 years no one will bat an eye at these companies being worth $12 trillion as Altman and Musk climb two ladders holding a "mission accomplished" banner. |
|
|
|
| ▲ | HDThoreaun 3 hours ago | parent | prev | next [-] |
| Source on 200 million knowledge workers worldwide? My understanding is that it's just above 1 billion. I dont think a billion subscriptions at $1000/yr is out of the question but it might take a decade to get roiling |
| |
| ▲ | swatcoder 3 hours ago | parent | next [-] | | You're suggesting that 1 in 8 people worldwide, including every one from infants and the elderly, are knowledge workers. Are you sure that's what you mean? I'm not even sure that 1 in 8 people I know would qualify as a knowledge worker, let alone a knowledge worker that might profoundly benefit from on-the-horizon AI. And I'm in a highly skewed population. | | |
| ▲ | WarmWash 2 hours ago | parent | next [-] | | I think the underestimation is how many people want a personal knowledge worker in their pocket, and are willing to pay ~$65/mo for it. | | |
| ▲ | swatcoder 2 hours ago | parent | next [-] | | Personally, I've only encountered any of those people on line, and almost exclusively here on HN. Most people I've met -- and again, in a pretty darn skewed sample globally -- see $65/mo as a lot of money to spend on technology of any kind and can't think of anything much they need from "a personal knowledge worker in their pocket". I don't know a single person in real life who remains excited about AI at all, and only a few software engineers who feel it'd be worth that much. Everybody seems to be mostly confident with the "knowledge productivity" in their personal and professional life and a pretty skittish about spending in today's economy. Most would be excited about a magic new robot that affordably saved them from unwanted physical labor and drudgery, but nobody needs much real help making appointments or filling out forms or whatever. That's not to say I won't be proved wrong some day, with some further innovations in AI products, but global-scale demand isn't waiting for anything that's been released so far. | |
| ▲ | gloryjulio an hour ago | parent | prev [-] | | The competitors of $65/mo subscriptions are the free models and services that are good enough. It will only get worse as open models or free tiers catch up. For most people, they just use whatever that's free |
| |
| ▲ | HDThoreaun 3 hours ago | parent | prev [-] | | Well around 40% of people work. I dont think its crazy to say around a third of jobs are knowledge jobs, but what do I know | | |
| ▲ | matthewowen 3 hours ago | parent [-] | | 85% of the world population lives outside of developed nations. 27% of the world's workforce is in agriculture (contrast to the US where it is 1-2%). 15% in manufacturing. A lot of people work in "services" (especially in high income nations, where it's roughly three quarters) and some of those are knowledge workers... but a huge number of them are nail technicians or hairdressers or bartenders (etc etc). |
|
| |
| ▲ | hibgymnb an hour ago | parent | prev | next [-] | | A billion subs at 1k a year???? I see a lot of out of touch takes here but this might take the cake | |
| ▲ | rootusrootus 3 hours ago | parent | prev [-] | | A billion? Really? At 200M you’re already including a lot of people that stretch the definition of knowledge worker. | | |
| ▲ | naravara 3 hours ago | parent | next [-] | | A lot of those ‘edge cases’ in the definition of “knowledge worker” are probably the stuff that’s most likely to have significant parts of the work augmented or replaced by AI agents. Like, call-centers are almost certainly going to get turned over in a big way. It’s not like the median tier-1 support operator just reading off a script is much better than an LLM anyway. | |
| ▲ | esseph 3 hours ago | parent | prev | next [-] | | Yeah, just looked into this. Knowledge workers is a big group and probably much larger than you think it is. Basically if you're not doing manual labor, it's probably knowledge work. Roughly 1/3rd of the working population. Some data tucked in here: https://gist.github.com/danielmiessler/2dc039762a202b083753b... | |
| ▲ | HDThoreaun 3 hours ago | parent | prev [-] | | > At 200M you’re already including a lot of people that stretch the definition of knowledge worker. How do you know this? Im certainly open to recalibrating my numbers which is why I asked for the source | | |
| ▲ | windexh8er 3 hours ago | parent [-] | | What's your source, because it looks wildly out of proportion compared to numbers we have now. | | |
| ▲ | Andoryuuta 3 hours ago | parent | next [-] | | To add an actual source to this thread, a brief paper by researchers at the International Labour Organization (ILO) states that for knowledge workers globally "... there
are between 644 and 997 million jobs, which represents
between 19.6 per cent and 30.4 per cent of global
employment respectively." [1] [1]: Berg, Janine and Gmyrek, Pawel, Automation Hits the Knowledge Worker: ChatGPT and the Future of Work (April 21, 2023). UN Multi-Stakeholder Forum on Science, Technology and Innovation for the SDGs (STI Forum) 2023, Available at SSRN: https://ssrn.com/abstract=4458221 | | |
| ▲ | windexh8er 2 hours ago | parent [-] | | Globally, sure. The assumption here is all users are on the same economic footing, they are not. Only about a 1/3rd (at most) of that count can afford $1000+ monthly, and even then that is wildly out of line with what most will. |
| |
| ▲ | elliotec 3 hours ago | parent | prev | next [-] | | Here's a source from 2019 that says: "By 2023, the number of knowledge workers in the world will increase to 1.14 billion, with more than four-fifths of that growth coming from the emerging world." https://www.gartner.com/en/newsroom/press-releases/09-24-201... | | |
| ▲ | windexh8er 2 hours ago | parent [-] | | Thank you for validating my point. > "...with more than four-fifths of that growth coming from the emerging world." If anyone thinks this is a part of the global TAM that's got $1000 a month to blow, well then I've got a stable of flying unicorns to sell you. |
| |
| ▲ | HDThoreaun 3 hours ago | parent | prev [-] | | I googled "number of knowledge workers worldwide" and read the top results. If you read it as I was confident in a billion I apologize, Im just trying to get an accurate count. What numbers do you have now and where did you find them? | | |
| ▲ | windexh8er 3 hours ago | parent [-] | | That's not the TAM of 1B knowledge workers globally. If that were the case many industries would have a 2-3x target market. To simplify break that 1B up into 3 levels of purchasing: 1) High-tier (US, Western EU, ANZ, Japan, South Korea, Singapore, UAE, etc) - 200-250M knowledge workers. 2) Mid-tier (Eastern EU, Latin America, urban China, India tech sector, etc) - 300-400M 3) Low-tier (Rest of the world) - 300-400M Low-tier users are mostly free tier or heavily subsidized pricing. Mid-tier are going to account for USD sub-$100 tiers. Probably averaging less than $50/seat. High-tier are who you are assuming is the 1B. Users are not equal in that knowledge worker count, so there aren't 1B knowledge workers to charge money. And when you consider Low-tier users a majority of those are free users which need to be subsidized by the High-tier users. So either free tiers get much more restrictive or the providers lose additional training data. A bulk of Low-tier users cost money and provide little to no revenue. Edit:
And think about Mid-tier and Low-tier for 5 seconds. Why would they pay Anthropic or OAI when they get get 100x+ inference from DeepSeek or Xiaomi? Mid-tier may be the only area that is willing to spend money on a US provider, but I would wager significantly on the fact that users in the Low-tier almost universally do not care. | | |
| ▲ | HDThoreaun 34 minutes ago | parent [-] | | Thank you. So with these numbers it seems like half a billion subscriptions at $500/yr is on the table. Obviously theres going to be competition in this market and self hosting cheap models may become the dominant use case. Assuming the labs are able to get most of the market though, the market size is something like a quarter trillion a year within the next decade. It's hard for me to imagine the whole sector failing if that happens. I do think free accounts are going to end pretty soon, and some of the workers in your tier 3 will pay, but even without them this seems like a pretty healthy market size. I also wouldnt be surprised if mid tier workers are able to afford the $1000/yr vs $500. I use yearly rates because I find it easier to compare them to GDP/salary numbers |
|
|
|
|
|
|
|
| ▲ | solenoid0937 3 hours ago | parent | prev [-] |
| > 20% if you're a developer.
That's a _huge_ shift. Most people I know cite +20%-40% velocity with these tools, against the actual work their company cares about doing. +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending. Of course it will. The value of an employee is a multiple of what they get paid. If you pay an employee $500k and they make $2M for your company (like Meta), then of course a 20% increase for the salary is justified if the velocity is increased 20% as well. |
| |
| ▲ | lunar_mycroft 2 hours ago | parent | next [-] | | The difference between what the employer makes per employee and what they spend in compensation doesn't matter. If the increase in productivity isn't greater than the increase in cost, there isn't a reason to pay for AI over hiring more developers. Imagine an employer with 10 employees paying $500k per employee and making $2M per employee in revenue (to use your numbers). They could hire two more employees and spend an extra $1M (+20%), but make an extra $4M in revenue (+20%). Alternatively, they could buy all ten employees a $100k AI subscription, for a total of $1M extra spending (+20%) but an extra $4M in revenue (+20%). You'll notice both scenarios are identical, so an employer optimizing for profit would have no reason to prefer one over the other. | | |
| ▲ | chasd00 an hour ago | parent [-] | | There’s a lot relationship and culture management overhead involved when adding 2 more people to a 10 person company. I think any business leader would take the productivity speed up from buying a tool over hiring more people and integrating personalities/habits/viewpoints to an existing established culture any day of the week. | | |
| ▲ | lunar_mycroft an hour ago | parent [-] | | You're basically positing that the real cost of a 20% headcount increase is higher and/or the productivity gain is is lower than 20%. That isn't an unreasonable claim, but it's basically rejecting the premise here. You might just as well object to the premise that you can buy a 20% speedup by spending an extra 20% on tokens. |
|
| |
| ▲ | hansmayer 3 hours ago | parent | prev [-] | | [dead] |
|