This is cool. Creative ways to do external verification is the only path to solving training on LLM slop