The paper is great. It really shows how alignement is entirely surface level and not actually deeply ingrained in the models. Really interesting work.