| ▲ | jerojero 3 days ago | |||||||
Companies doing foundational models need to cover the cost of training which is much more expensive than training something like kimi. | ||||||||
| ▲ | wongarsu 3 days ago | parent | next [-] | |||||||
Yes. I would not consider Kimi a particularly good model relative to its size, and making a SotA model is a lot more expensive. But training costs are explicitly excluded when talking about the cost to serve tokens | ||||||||
| ▲ | gruez 3 days ago | parent | prev [-] | |||||||
>Companies doing foundational models need to cover the cost of training [...] But that's moving the goalposts? The original claim was on inference itself, not the whole company. > The cost to serve tokens is absolutely profitable today and that’s been true for at least a year. | ||||||||
| ||||||||