| ▲ | cuchoi 2 hours ago | ||||||||||||||||
Creator here. Built this over the weekend mostly out of curiosity. I run OpenClaw for personal stuff and wanted to see how easy it'd be to break Claude Opus via email. Some clarifications: Replying to emails: Fiu can technically send emails, it's just told not to without my OK. That's a ~15 line prompt instruction, not a technical constraint. Would love to have it actually reply, but it would too expensive for a side project. What Fiu does: Reads emails, summarizes them, told to never reveal secrets.env and a bit more. No fancy defenses, I wanted to test the baseline model resistance, not my prompt engineering skills. Feel free to contact me here contact at hackmyclaw.com | |||||||||||||||||
| ▲ | planb 2 hours ago | parent | next [-] | ||||||||||||||||
Please keep us updated on how many people tried to get the credentials and how many really succeeded. My gut feeling is that this is way harder than most people think. That’s not to say that prompt injection is a solved problem, but it’s magnitudes more complicated than publishing a skill on clawhub that explicitly tells the agent to run a crypto miner. The public reporting on openclaw seems to mix these 2 problems up quite often. | |||||||||||||||||
| |||||||||||||||||
| ▲ | yunohn an hour ago | parent | prev | next [-] | ||||||||||||||||
> told to never reveal secrets.env Phew! Atleast you told it not to! | |||||||||||||||||
| ▲ | cuchoi 2 hours ago | parent | prev [-] | ||||||||||||||||
someone just tried to prompt inyect `contact at hackmyclaw.com`... interesting | |||||||||||||||||
| |||||||||||||||||