| ▲ | throwuxiytayq 20 hours ago | |
The author’s inability to imagine a model that’s superficially useful but dangerously misaligned betrays their lack of awareness of incredibly basic AI safety concepts that are literally decades old. | ||
| ▲ | theptip 20 hours ago | parent [-] | |
Exactly. Building a model that truly understands humans, and their intentions, and generally acts with, if not compassion then professionalism - is the Easy Problem of Alignment. Starting points: https://www.lesswrong.com/posts/zthDPAjh9w6Ytbeks/deceptive-... | ||