| ▲ | mannanj 6 hours ago | |||||||
The article does mention this and a weakness of that approach is mentioned too. | ||||||||
| ▲ | crisnoble 6 hours ago | parent | next [-] | |||||||
Perhaps they asked AI to summarize the article for them and it stopped after the first "disregard that" it read into its context window. | ||||||||
| ▲ | wbeckler 5 hours ago | parent | prev [-] | |||||||
The article didn't describe how the second AI is tuned to distrust input and scan it for "disregard that." Instead it showed an architecture where a second AI accepts input from a naively implemented firewall AI that isn't scanning for "disregard that" | ||||||||
| ||||||||