▲ | meshugaas 5 days ago | ||||||||||||||||||||||||||||||||||||||||
The "model as company" metaphor makes no sense. It should actually be models are products, like a shoe. Nike spends money developing a shoe, then building it, then they sell it, and ideally those R&D costs are made up in shoe sales. But you still have to run the whole company outside of that. Also, in Nike's case, as they grow they get better at making more shoes for cheaper. LLM model providers tell us that every new model (shoe) costs multiples more than the last one to develop. If they make 2x revenue on training, like he's said, to be profitable they have to either double prices or double users every year, or stop making new models. | |||||||||||||||||||||||||||||||||||||||||
▲ | renjimen 5 days ago | parent | next [-] | ||||||||||||||||||||||||||||||||||||||||
But new models to date have cost more than the previous ones to create, often by an order of magnitude, so the shoe metaphor falls apart. A better metaphor would be oil and gas production, where existing oil and gas fields are either already finished (i.e. model is no longer SOTA -- no longer making a return on investment) or currently producing (SOTA inference -- making a return on investment). The key similarity with AI is new oil and gas fields are increasingly expensive to bring online because they are harder to make economical than the first ones we stumbled across bubbling up in the desert, and that's even with technological innovation. That is to say, the low hanging fruit is long gone. | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
▲ | pegasus 5 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||
If you're going to use shoes as the metaphor, a model would be more like a shoe factory. A shoe would be a LLM answer, i.e. inference. In which case it totally makes sense to consider each factory as an autonomous economic unit, like a company. | |||||||||||||||||||||||||||||||||||||||||
▲ | skybrian 5 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||
Analogies don't prove anything, but they're still useful for suggesting possibilities for thinking about a problem. If you don't like "model as company," how about "model as making a movie?" Any given movie could be profitable or not. It's not necessarily the case that movie budgets always get bigger or that an increased budget is what you need to attract an audience. | |||||||||||||||||||||||||||||||||||||||||
▲ | Szpadel 5 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||
I believe better analogy is CPU development on next process node. each node is much more expensive to design for, but when you finally have it you basically print money. and of course you always have to develop next more powerful and power efficient CPU to keep competitive | |||||||||||||||||||||||||||||||||||||||||
▲ | vonneumannstan 5 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||
>Also, in Nike's case, as they grow they get better at making more shoes for cheaper. This is clearly the case for models as well. Training and serving inference for GPT4 level models is probably > 100x cheaper than they used to be. Nike has been making Jordan 1's for 40+ years! OpenAI would be incredibly profitable if they could live off the profit from improved inference efficiency on a GPT4 level model! | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
▲ | true_religion 5 days ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||||||||
It's model as a company because people are using the VC mentality, and also explaining competition. Model as a product is the reality, but each model competes with previous models and is only successful if it's both more cost effective, and also more effective in general at its tasks. By the time you get to model Z, you'll never use model A for any task as the model lineage cannibalizes sales of itself. |