| ▲ | plewd 10 hours ago | |
Not much if you only use it as a glorified search engine, but the problem stems from all the other things you can make it do for personal use after jailbreaking. | ||
| ▲ | certainforest 8 hours ago | parent [-] | |
Hey, Jasmine here -- it's a good point, I'm generally more concerned by agentic jailbreaks (e.g. unauthorized purchases, leaking sensitive data) than GPT making inappropriate comments. In our case, we found that simply acting like a user is enough to trick LLMs into sharing passwords, private files, etc. (On a related note, here's one where they hack a smart home with email invitations: https://sites.google.com/view/invitation-is-all-you-need/hom...) | ||