| ▲ | _flux 11 hours ago | |||||||||||||||||||||||||
I think this comes from the idea that serving these tokens without paying for training is already expensive, e.g. https://news.ycombinator.com/item?id=46613887 self-hosted solution might give you only 10-100x more affordable solution at cost. So, given the SOTA providers with even larger models also need to continously be using considerable resources for training their next models, to fund future data centers, and make profit, the token costs are more likely reflecting the real costs, rather than the subscription costs. | ||||||||||||||||||||||||||
| ▲ | LUmBULtERA 10 hours ago | parent [-] | |||||||||||||||||||||||||
Except there are plenty of inference providers worldwide (including the US) that serve open-weight models that are not subsidized, and are reasonable in cost. Or is your claim that those are all running at a loss? | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||