| ▲ | didibus 3 hours ago | |
It helps the model makers have a harness to optimize for in their next model version. They'll specifically work to pass the next version of ARC-AGI, by evaluating what kind of dataset is missing that if they trained on would have their model pass the new version. They ideally don't directly train on the ARC-AGI itself, but they can train in similar problems/datasets to hope to learn the skills that than transfer to also solving for the real ARC-AGI. The point is that, a new version of ARC-AGI should help the next model be smarter. | ||