Remix.run Logo
muskstinks 4 hours ago

This is clear AGI progress. It should show you, that AI is not sleeping, it gets better and you should use this as a signal that you should take this topic serious.

applfanboysbgon 4 hours ago | parent [-]

Labelling a test "AGI" does not show AGI progress any more than labelling a cpu "AGI" makes it so. It might show that AI tools are improving but it does not necessarily follow that tools improving = AGI progress if you're on the completely wrong trail.

muskstinks 3 hours ago | parent | next [-]

The transfer of knowledge required here is that a ARC-AGI-3 is now necessary and adds another dimension of capability.

These 'tests' are not labeled AGI by magic but because they are designed specificly for testing certain things a question answer test cant solve.

Gemini and OpenAI are at 80-90% at ARC-AGI-2 and its quite interesting to see the difference of challange between 2 and 3.

AGI progress means btw. general. So every additional dimension an agent can solve pushes that agent to be more general.

zarzavat 3 hours ago | parent | prev | next [-]

Any test that humans can pass and AIs cannot is a stepping stone on the way to AGI.

When you run out of such tests then it's evidence that you have reached AGI. The point of these tests is to define AGI objectively as the inability to devise tests that humans have superiority on.

3 hours ago | parent | prev [-]
[deleted]