Remix clone Hacker News
new
|
show
|
ask
|
jobs
Github
▲
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
(
nature.com
)
7 points
by
Anon84
16 hours ago