Remix.run Logo
scotty79 10 hours ago

ARC-AGI, while imagined as super hard for AI, was beaten enough that they had to come up with ARC-AGI-2.

hbsbsbsndk 9 hours ago | parent [-]

"AI tend to be brittle and optimized for specific tasks, so we made a new specific task and then someone optimized for it" isn't some kind of gotcha. Once ARC puzzles became a benchmark they ceased to be meaningful WRT "AGI".

scotty79 4 hours ago | parent [-]

So if DOTA became a benchmark same way Chess or Go became earlier it would be promptly beaten. It just didn't stick before people moved to more useful "games".