| ▲ | _flux 11 hours ago | |||||||
So they do not train models, and in addition their models are expected to be smaller than SOTA models, although we cannot know for sure by how much. So what's the price difference, 3000x? | ||||||||
| ▲ | LUmBULtERA 10 hours ago | parent [-] | |||||||
My comment is about your statement "serving these tokens without paying for training is already expensive"... One thing we do know from OpenAI's leaked financial document is that they are already profitable on inference, though that data is not broken down by cost and revenue of API vs. subscription. One important factor is that subscription inference can be optimized in ways to reduce cost (e.g., usage limits, batch optimization around API-prioritized inference, etc...). I think simply we do not know the actual cost of subscription interference for SOTA models. | ||||||||
| ||||||||