Those won’t be sufficient to run SOTA/trillion parameter models
And most tasks don't demand that.
Distilled models are good enough.