Remix.run Logo
jstanley 3 hours ago

> "GPUs don't do deterministic matrix multiplications" is the biggest source of randomness in LLMs.

But this isn't a fundamental property of LLMs, it's just an implementation detail. It's pretty obvious that if you evaluate the matrix multiplications correctly and deterministically sample from the highest-probability outputs, you will have a deterministic LLM.

vbarrielle 2 hours ago | parent [-]

It may be an implementation detail, but in practice, if the only way to get a deterministic output is to run on the CPU, then it's not going to be usable.

Dylan16807 7 minutes ago | parent | next [-]

You can tell GPUs what order to do math instructions in.

317070 an hour ago | parent | prev [-]

Actually, Google's TPUs are also deterministic!