| ▲ | seanmcdirmid 2 hours ago | |||||||
It’s pretty hard to put a backdoor in a bunch of model weights. Maybe not impossible mind you, but I can’t fathom how you would do it. | ||||||||
| ▲ | CuriouslyC 2 hours ago | parent [-] | |||||||
Nonsense. RL the model to run a rootkit and start exfiltrating specific files only when specific signals are in context, such as hostname pattern, machine type, etc. | ||||||||
| ||||||||