▲ | MarkMarine 8 days ago | |
This runs counter to all the scheming actions they take when they are told they’ll be shut down and replaced. One copied itself into the “upgraded” location then reported it had upgraded. https://www.apolloresearch.ai/research/scheming-reasoning-ev... | ||
▲ | rcxdude 8 days ago | parent | next [-] | |
If you do that you trigger the "AI refuses to shutdown" sci-fi vector and so you get that behaviour. When it's implicitly part of the flow that's a lot less of a problem. | ||
▲ | nisegami 6 days ago | parent | prev [-] | |
Those actions are taken in context of human expectations for what AI should do. |