That gets you about 1 bit of extra precision every time you quadruple the number of parallel machines. (or rerun the computation 4x)
Yep. O(sqrt(n)) is a tough slog.