Apple can’t afford to run models, there are too many iPhones and not enough data centers.
Running on device is also risky because cycle limitations will make it seem dumb in comparison.