Inference costs falling 2x doesn’t decrease hardware prices when demand for tokens has increased 10x.
It's the ratio. If revenue goes up 10x you can afford 10x more hardware if you can afford to do it all.