Remix.run Logo
xmcqdpt2 4 hours ago

The paper is great. It really shows how alignement is entirely surface level and not actually deeply ingrained in the models. Really interesting work.