Remix.run Logo
Reinforcement learning towards broadly and persistently beneficial models(alignment.openai.com)
2 points by gmays 8 hours ago