The only way to profitable serve AI is to have large batch sizes - run 500 requests at the same time.
If you serve a single user you'll never get your electricity price back, nevermind hardware costs.