| ▲ | rst 4 days ago | |
Anthropic's ahead of you -- the LLM that the reporters were interacting with here had an AI supervisor, "Seymour Cash", which uh... turned out to have some of the same vulnerabilities, though to a lesser extent. Anthropic's own writeup here describes the setup: https://www.anthropic.com/research/project-vend-2 | ||
| ▲ | UncleMeat 3 days ago | parent [-] | |
> Seymour Cash The "everybody is 12" theory strikes again. | ||