Remix.run Logo
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning(nature.com)
7 points by mikhael 14 hours ago