Remix.run Logo
Bolwin 6 hours ago

Arc agi was never a good benchmark that tested spatial understanding more than reasoning. I'm glad it's no longer popular

falcor84 6 hours ago | parent | next [-]

What do you mean? It definitely tests reasoning as well, and if anything, I expect spatial and embodied reasoning to become more important in the coming years, as AI agents will be expected to take on more real world tasks.

eugene3306 6 hours ago | parent | prev [-]

spatial or not, arc-agi is the only test that correlates to my impression with my coding requests