| ▲ | gleipnircode 3 hours ago | |||||||||||||||||||||||||||||||||||||||||||
OpenClaw user here. Genuinely curious to see if this works and how easy it turns out to be in practice. One thing I'd love to hear opinions on: are there significant security differences between models like Opus and Sonnet when it comes to prompt injection resistance? Any experiences? | ||||||||||||||||||||||||||||||||||||||||||||
| ▲ | datsci_est_2015 3 hours ago | parent [-] | |||||||||||||||||||||||||||||||||||||||||||
> One thing I'd love to hear opinions on: are there significant security differences between models like Opus and Sonnet when it comes to prompt injection resistance? Is this a worthwhile question when it’s a fundamental security issue with LLMs? In meatspace, we fire Alice and Bob if they fail too many phishing training emails, because they’ve proven they’re a liability. You can’t fire an LLM. | ||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||