| ▲ | Majromax 4 hours ago | |
If you remove the conditional and keep the same math, you divide by zero and get nans. In the limit as temperature goes to zero, you do in fact get maximum likelihood sampling. | ||
| ▲ | dnautics 3 hours ago | parent [-] | |
if (t==0) argmax(logits) else pick(logits) | ||