| ▲ | idiotsecant 3 hours ago | |
Every time I've made an LLM do a thing it's designed not to do it's been a careful sideways crab-walk toward the goal over many exchanges. LLMs are vulnerable to 'frog boiling'. If each email is a new context it seems unsurprising that nobody broke it. | ||
| ▲ | NitpickLawyer 3 hours ago | parent [-] | |
> it seems unsurprising that nobody broke it But still a good thing overall. Two years ago this was not the case, and you could ask it to break its system prompt with a poem and get all the secrets back... | ||