If you have a bottleneck in terms of memory bandwidth utilization, this method is great - it would utilize the idle compute.