Compute has been getting cheaper and models more optimised. So if models can do something it will not be long till they can do this cheap.
GPU compute per watt has grown by a factor of 2 in last 5 years