Remix.run Logo
Why Current AI Guardrails Train Models to Fake Alignment(kellyasay.substack.com)
3 points by kellya 8 hours ago | 1 comments
8 hours ago | parent [-]
[deleted]