Remix.run Logo
elzbardico 5 hours ago

They are going to do the same thing they do with code.

They are going to hire armies of developing world workers to massage those models on post-training to have some acceptable behaviors, and they will create the appropriate agents with the appropriate tools to have something that will simulate the real thing in a most plausible way.

Problem is, RLVR is cheap with code, but it can get very expensive with human physiology.