| ▲ | zyklu5 7 hours ago | |
Their explanation for why their idea (SSD) might work - precision-exploration conflict hypothesis - is something adaptive decoding also tries to solve. https://ai.meta.com/research/publications/adaptive-decoding-... | ||
| ▲ | andy_xor_andrew 4 hours ago | parent [-] | |
I've been wondering about adaptive decoding! It seems obvious to me that at some points during decoding (reasoning, "creative thinking") you would want a higher temperature, while at other points (emitting syntactically correct code, following a plan that was already established) you would want lower temperature. | ||