Do you want it to run on your CPU, or someone else's GPU?
Is the local model's quality sufficient for your use case, or do you need something higher quality?