I wish there would be more of this research to speed things up rather than building ever larger models
Why not both?
Scaling laws are real! But they don't preclude faster processing.