| ▲ | aethrum 4 hours ago | |
Can't we just turn the temp down to 0? | ||
| ▲ | MyHonestOpinon 15 minutes ago | parent | next [-] | |
I suppose that even with temp down to zero the model itself changes over time. | ||
| ▲ | 16 minutes ago | parent | prev | next [-] | |
| [deleted] | ||
| ▲ | helloplanets 3 hours ago | parent | prev | next [-] | |
Even if you turn the temperature down to 0, it's not deterministic. Floating points are messy. If there is even a tiny difference when it comes to the order of operations on the actual GPU that's running the billions of parallelized floating point operations over and over, it's very possible to end up with changing top probability logits. | ||
| ▲ | kibwen 3 hours ago | parent | prev | next [-] | |
That doesn't make a difference here. Even with a nonzero temperature, an LLM could still be deterministic as long as you have control of its random seed. As the article says: "This gets to my core point. What changes with LLMs isn’t primarily nondeterminism, unpredictability, or hallucination. It’s that the programming interface is functionally underspecified by default." | ||
| ▲ | abm53 4 hours ago | parent | prev [-] | |
More to the point: is randomness of representation or implementation an inherent issue if the desired semantics of a program are still obeyed? This is not really a point about whether LLMs can currently be used as English compilers, but more questioning whether determinism of the final machine code output is a critical property of a build system. | ||