| ▲ | simianwords 13 hours ago | |
Here's what I believe: it is extremely likely that the jailbreak wasn't real and was minor. There is no way Dario would have seen a real threatening jailbreak and specifically call out that there's nothing to do about it. This is what Dario said about the jailbreak > He pushed back on the administration’s concerns, defended the guardrails and argued that the type of bypass that occurred, which he believed to be specific, did not pose the same risk as a broader “jailbreak” that would allow it to be used without any of the guardrails put in place by Anthropic. If I were to bet - whatever capabilities this jailbreak unlocked, you could have got it for older GPT and Claude models as well. I think think this whole thing was either a misunderstanding or deliberate move by the government to put Anthropic in its place. | ||
| ▲ | Schiendelman 8 hours ago | parent [-] | |
In the original release, Anthropic called out that they showed the government that exactly the same outcomes could be achieved with ChatGPT models. So you don't have to bet, they said it publicly. Keep in mind family of the administration is heavily invested in OpenAI. | ||