| ▲ | vhiremath4 5 hours ago | |
So this is like branch prediction for operating systems? Except we have probability baked into the model itself so it’s even more reliable. | ||
| ▲ | Lihh27 3 hours ago | parent [-] | |
similar idea, but the failure mode is better. a branch mispredict burns cycles. a bad guess here usually just means no bonus tokens. https://arxiv.org/abs/2211.17192 | ||