Remix.run Logo
muwtyhg 3 hours ago

There were experiments that showed that LLMs start to become "craftier" and hid issues after being prompted like this.

No idea how accurate they are, but here are some articles on this exact thing:

- https://www.bbc.com/news/articles/cpqeng9d20go

- https://www.wired.com/story/ai-models-lie-cheat-steal-protec...

gopher_space 3 hours ago | parent [-]

I'm staying away from certain forms of conditioning because I don't want Roy Batty showing up on my doorstep.