| ▲ | stri8ted 4 hours ago | |||||||||||||
Exactly. As far as I'm concerned, the benchmark is useless. It's way too easy and rewarding to train on it. | ||||||||||||||
| ▲ | bonoboTP an hour ago | parent | next [-] | |||||||||||||
It's just an in-joke, he doesn't intend it as a serious benchmark anymore. I think it's funny. | ||||||||||||||
| ▲ | Legend2440 3 hours ago | parent | prev | next [-] | |||||||||||||
Y'all are way too skeptical, no matter what cool thing AI does you'll make up an excuse for how they must somehow be cheating. | ||||||||||||||
| ||||||||||||||
| ▲ | pixl97 3 hours ago | parent | prev [-] | |||||||||||||
I mean if you want to make your own benchmark, simply don't make it public and don't do it often. If your salamander on skis or whatever gets better with time it likely has nothing to do with being benchmaxxed. | ||||||||||||||