Remix.run Logo
bevekspldnw 4 hours ago

How much of this is RL’ing a good coding model on every CVE ever?

sometimelurker 3 hours ago | parent [-]

most it this comes from the pretrain imo. just scale + some RL = mythos