▲ | beeflet 5 days ago | |
I was being half-sarcastic. I think it is something that people will try to implement, so it's worth discussing the flaws. | ||
▲ | OvbiousError 5 days ago | parent [-] | |
Isn't this already done? I remember a "try to hack the llm" game posted here months ago, where you had to try to get the llm to tell you a password, one of the levels had a sanitzer llm in front of the other. |