| ▲ | throw310822 3 hours ago | |||||||||||||||||||||||||||||||
The average ARC AGI 2 score for a single human is around 60%. "100% of tasks have been solved by at least 2 humans (many by more) in under 2 attempts. The average test-taker score was 60%." | ||||||||||||||||||||||||||||||||
| ▲ | modeless 2 hours ago | parent | next [-] | |||||||||||||||||||||||||||||||
Worth keeping in mind that in this case the test takers were random members of the general public. The score of e.g. people with bachelor's degrees in science and engineering would be significantly higher. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | imiric an hour ago | parent | prev [-] | |||||||||||||||||||||||||||||||
What is the point of comparing performance of these tools to humans? Machines have been able to accomplish specific tasks better than humans since the industrial revolution. Yet we don't ascribe intelligence to a calculator. None of these benchmarks prove these tools are intelligent, let alone generally intelligent. The hubris and grift are exhausting. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||