▲ | _ncyj 3 days ago | |||||||
This is very interesting. Not saying it is, but a possible endgame for Chinese models could be to have "backdoor" commands such that when a specific string is passed in, agents could ignore a particular alert or purposely reduce security. A lot of companies are currently working on "Agentic Security Operation Centers", some of them preferring to use open source models for sovereignty. This feels like a viable attack vector. | ||||||||
▲ | lifeinthevoid 3 days ago | parent [-] | |||||||
What China is to the US, the US is to the rest of the world. This doesn't really help the conversation, the problem is more general. | ||||||||
|