| ▲ | CjHuber 3 days ago | ||||||||||||||||
Are they? I mean I wouldn't say they are strictly deterministic, but with a temperature and topk of 0 and topp of 1 you can at least get them to be deterministic if I'm correct. In my experience if you need a higher temp than 0 in a prompt that is supposed to be within a pipeline, you need to optimize your prompt rather than introduce non determinism. Still of course that doesn't mean some inputs won't give unexpected outputs. | |||||||||||||||||
| ▲ | flufluflufluffy 3 days ago | parent | next [-] | ||||||||||||||||
In the hard, logically rigorous sense of the word, yes they are deterministic. Computers are deterministic machines. Everything that runs on a computer is deterministic. If that wasn’t the case, computers wouldn’t work. Of course I am considering the idealized version of a computer that is immune to environmental disturbances (a stray cosmic ray striking just the right spot and flipping a bit, somebody yanking out a RAM card, etc etc). LLMs are computation, they are very complex, but they are deterministic. If you run one on the same device, in the same state, with exactly the same input parameters multiple times, you will always get the same result. This is the case for every possible program. Most of the time, we don’t run them with exactly the same input parameters, or we run them on different devices, or some part of the state of the system has changed between runs, which could all potentially result in a different outcome (which, incidentally, is also the case for every possible program). | |||||||||||||||||
| |||||||||||||||||
| ▲ | blibble 3 days ago | parent | prev [-] | ||||||||||||||||
> Are they? I mean I wouldn't say they are strictly deterministic, but with a temperature and topk of 0 and topp of 1 you can at least get them to be deterministic if I'm correct. the mathematics might be but not on a GPU, because floating point numbers are an approximation, and their operations are not commutative if the GPUs internal scheduler reorders the operations you will get a different outcome remember GPUs were designed to render quake, where drawing pixels slightly off is imperceptible | |||||||||||||||||