Remix.run Logo
jstanley 3 hours ago

Sure, I think that's fine, that all counts. It counts for open source too, it's not like they're somehow running these benchmarks without any harness.

Nobody cares if your AGI is 100% made out of neural networks or if it's like 50% neural networks and 50% perl scripts.

stkdump 21 minutes ago | parent [-]

I think they mean cheat in a Dieselgate sense. You detect that you are being tested with a specific benchmark question and heuristically give the correct (manually programmed) answer. That wouldn't be AGI.