Remix clone Hacker News
new
|
show
|
ask
|
jobs
Github
▲
Exploration Hacking: Can LLMs Learn to Resist RL Training?
(
alignmentforum.org
)
2 points
by
Prof_Sigmund
7 hours ago