| ▲ | joquarky 7 hours ago | |
I thought that temperature can never actually be zero or it creates a division problem or something similar. I'm no ML or math expert, just repeating what I've heard. | ||
| ▲ | ChadNauseam 7 hours ago | parent [-] | |
That's an implementation detail I believe. But what I meant was just greedy decoding (picking the token with the highest logit in the LLM output), which can be implemented very easily | ||