| ▲ | jMyles 2 hours ago | |
Related but distinct: Is there an ELI5 about determinism in inference? In other words, when will the same prompt lead to the same output, and when not? And why not? | ||
| ▲ | FrasiertheLion an hour ago | parent | next [-] | |
jashulma above has a great link: https://news.ycombinator.com/item?id=47105315 | ||
| ▲ | measurablefunc an hour ago | parent | prev [-] | |
Even if you reduce all the non-determinism you still will not get consistent results b/c of floating point rounding & instruction scheduling in the GPU. There is no way to guarantee that the GPU pipelines will execute your instructions exactly in the order you want it to be executed b/c GPUs are now essentially equivalent to sufficiently smart compilers & perform all sorts of clever instruction re-ordering behind the scenes. Expecting complete reproducibility at scale is a pipe dream. | ||