That's fair but one can dream of being able to simply run a useful LLM on CPU on your own server to simplify your app and save costs...