Imagine a PhD mortally terrified of exceptions!
Now I see why Karpathy was talking of RL up-weights as if they were a destructive straw-drawn line of a drug for an LLM's training.