| ▲ | bytesandbits 3 hours ago | |||||||
we constantly underestimate the power of inference scaffolding. I have seen it in all domains: coding, ASR, ARC-AGI benchmarks you name it. Scaffolding can do a lot! And post-training too. I am confident our currently pre-trained models can beat this benchmark over 80% with the right post-training and scaffolding. That being said I don't think ARC-AGI proves much. It is not a useful task at all in the wild. it is just a game; a strange and confusing one. For me this is just a pointless pseudo-academic exercise. Good to have, but by no means measures intelligence and even less utility of a model. | ||||||||
| ▲ | nubg 2 hours ago | parent [-] | |||||||
what exactly does scaffolding mean in this context? genuine question | ||||||||
| ||||||||