Isn't running the models for end users the biggest cost at the moment?
Running the models is a tiny fraction of the cost. The cost is all on training the new models