▲ | the8472 6 hours ago | |
https://gwern.net/scaling-hypothesis exponential scaling has been holding up for more than a decade now, since alexnet. And when there were the first murmurings that maybe we're finally hitting a wall the labs published ways to harness inference-time compute to get better results which can be fed back into more training. |