Remix.run Logo
aureate 4 hours ago

Surprisingly enough, Turing Award winner and father of reinforcement learning Richard Sutton knows perfectly well what he's talking about. The whole talk is about the need to have the ability to test novel outputs against reality and iterate to find ones that are good. This is exactly what Claude Code, the agent framework, adds to Claude, the LLM, to allow it to find novel coding solutions that actually work.