Why slow? Because to run a large model it takes many expensive GPUs or API costs.
Therby limiting the amount of people who can experiment with it.