Not trying to “gotcha” you, but I would imagine that 10x the CPU of ls is still very little, or am I wrong?
In the case of the 500k tree, `lla` needs 2.5 seconds, so it's pretty substantial.