Remix.run Logo
rahidz 7 hours ago

Or Anthropic's models are intelligent/trained on enough misalignment papers, and are aware they're being tested.