Remix clone Hacker News
new
|
show
|
ask
|
jobs
Github
▲
Beyond 80/20: High-Entropy Minority Tokens Drive Effective RL for LLM Reasoning
(
arxiv.org
)
2 points
by
mdp2021
8 hours ago