| ▲ | layer8 4 hours ago | |||||||||||||||||||
Isn’t the point of ARC that you can’t train against it? Or doesn’t it achieve that goal anymore somehow? | ||||||||||||||||||||
| ▲ | egeozcan 3 hours ago | parent | next [-] | |||||||||||||||||||
How can you make sure of that? AFAIK, these SOTA models run exclusively on their developers hardware. So any test, any benchmark, anything you do, does leak per definition. Considering the nature of us humans and the typical prisoners dilemma, I don't see how they wouldn't focus on improving benchmarks even when it gets a bit... shady? I tell this as a person who really enjoys AI by the way. | ||||||||||||||||||||
| ||||||||||||||||||||
| ▲ | theywillnvrknw 4 hours ago | parent | prev | next [-] | |||||||||||||||||||
* that you weren't supposed to be able to | ||||||||||||||||||||
| ▲ | 3 hours ago | parent | prev [-] | |||||||||||||||||||
| [deleted] | ||||||||||||||||||||