▲ | golol 5 days ago | ||||||||||||||||
Hold on a second. A transformer produces deterministically a probability distribution over the token alphabet from the context. Then one samples from this distribution. This is random and meant to be random. | |||||||||||||||||
▲ | nowittyusername 5 days ago | parent | next [-] | ||||||||||||||||
The sampling process isn't random. If you sample with identical sampling parameters and identical values for said parameters, you will always get same results. You only start getting "non deterministic" behavior when you start using more complex systems outside the scope of your control like multi gpu systems and batch processing. One llm sampled with cash prompting off and and batch processing off will always generate same results if all values are same. | |||||||||||||||||
▲ | oasisaimlessly 5 days ago | parent | prev [-] | ||||||||||||||||
It's possible to deterministically sample from a probability distribution. For example, just seed your RNG with a constant, or with the SHA256 hash of the context. | |||||||||||||||||
|