Remix.run Logo
jerojero 3 days ago

Companies doing foundational models need to cover the cost of training which is much more expensive than training something like kimi.

wongarsu 3 days ago | parent | next [-]

Yes. I would not consider Kimi a particularly good model relative to its size, and making a SotA model is a lot more expensive. But training costs are explicitly excluded when talking about the cost to serve tokens

gruez 3 days ago | parent | prev [-]

>Companies doing foundational models need to cover the cost of training [...]

But that's moving the goalposts? The original claim was on inference itself, not the whole company.

> The cost to serve tokens is absolutely profitable today and that’s been true for at least a year.

lbreakjai 3 days ago | parent [-]

But that's the same as thinking "This bar is selling a cocktail for $15. I could make it at home for 30 cents. They're making $14.7 dollars of profit per cocktail, the owner must be a millionaire now!"

Everything is profitable if you ignore the costs.