▲ | marcinzm 5 hours ago | |
Does this apply to TPUs or just GPUs? | ||
▲ | recursivecaveat 32 minutes ago | parent [-] | |
It's more a system level property. Even if you used CPUs, if you're not careful in your design to control how results are distributed and combined, you will get variance. |