To my knowledge none of the players is even profitable on inference, though Google probably is, considering the continuous release of papers around kv cache optimizations, mtp etc.