Remix.run Logo
rbbymls 13 hours ago

I think the Othello GPT paper and the platonic representation hypothesis papers are the most interesting. There's also a lot of digital humanities work inadvertently demonstrating these mechanisms. As well as a lot of the jailbreaking "literature" imo