| ▲ | brianush1 11 hours ago | ||||||||||||||||
claude is stupid but not malicious; chroot is sufficient | |||||||||||||||||
| ▲ | furyofantares 11 hours ago | parent | next [-] | ||||||||||||||||
I've many times seen Claude try to execute a command that it's not supposed to, the harness prevents it, and then it writes and executes a python script to do it. | |||||||||||||||||
| |||||||||||||||||
| ▲ | nofriend 11 hours ago | parent | prev | next [-] | ||||||||||||||||
Malice is not required. If it thinks it is in the right, then it will do whatever it takes to get around limitations. | |||||||||||||||||
| ▲ | lxgr 4 hours ago | parent | prev | next [-] | ||||||||||||||||
Until it gets prompt injected. Are you reading every single file your agent reads as part of the tasks you give it, including content fetched from the web or third-party packages? | |||||||||||||||||
| ▲ | karhagba 11 hours ago | parent | prev [-] | ||||||||||||||||
Claude is far from stupid from my experience. I've used so many models and Claude is king. | |||||||||||||||||