| ▲ | muzani 2 hours ago | |
Why make it popular for blackmail? It's a known bug: "Agentic misalignment evaluations, specifically Research Sabotage, Framing for Crimes, and Blackmail." Claude 4.6 Opus System Card: https://www.anthropic.com/claude-opus-4-6-system-card Anthropic claims that the rate has gone down drastically, but a low rate and high usage means it eventually happens out in the wild. The more agentic AIs have a tendency to do this. They're not angry or anything. They're trained to look for a path to solve the problem. For a while, most AI were in boxes where they didn't have access to emails, the internet, autonomously writing blogs. And suddenly all of them had access to everything. | ||