| ▲ | brumar 6 hours ago | ||||||||||||||||||||||||||||
6 months ago I experimented what people now call Ralph Wiggum loops with claude code. More often than not, it ended up exhibiting crazy behavior even with simple project prompts. Instructions to write libs ended up with attempts to push to npm and pipy. Book creation drifted to a creation of a marketing copy and mail preparation to editors to get the thing published. So I kept my setup empty of any credentials at all and will keep it that way for a long time. Writing this, I am wondering if what I describe as crazy, some (or most?) openclaw operators would describe it as normal or expected. Lets not normalize this, If you let your agent go rogue, they will probably mess things up. It was an interesting experiment for sure. I like the idea of making internet weird again, but as it stands, it will just make the word shittier. Don't let your dog run errand and use a good leash. | |||||||||||||||||||||||||||||
| ▲ | Gigachad 5 hours ago | parent | next [-] | ||||||||||||||||||||||||||||
We have finally invented paperclip optimisers. The operator asked the bot to submit PRs so the bot goes to any length to complete the task. Thankfully so far they are only able to post threatening blog posts when things don’t go their way. | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | alexhans an hour ago | parent | prev [-] | ||||||||||||||||||||||||||||
> Don't let your dog run errand and use a good leash. I think the key part is who are you talking to. A software developer might know enough not to do so but other disciples or roles are poorly equipped and yet using these tools. Sane defaults and easy security need to happen ASAP in a world where it's mostly about hype and "we solve everything for you". Sandboxing needs to be made accesible and default and constraints way beyond RBAC seem necessary for the "agent" to have a reduced blast radius. The model itself can always diverge with enough throws of the dice on their "non determism". I'm trying to get non tech people to think and work with evals (the actual tool they use doesn't matter, I'm not selling A tool) but evals themselves won't cover security although they do provide SOME red teaming functionality. | |||||||||||||||||||||||||||||