| ▲ | Alignment pretraining: AI discourse creates self-fulfilling (mis)alignment(arxiv.org) | ||||||||||||||||||||||||||||
| 30 points by anigbrowl 7 hours ago | 12 comments | |||||||||||||||||||||||||||||
| ▲ | c1ccccc1 5 hours ago | parent | next [-] | ||||||||||||||||||||||||||||
This looks like good work. Unfortunately, this kind of thing always seems to attract midwits on social media who then exclaim "oh, the people worried about AI alignment have caused the very alignment issues they feared? How ironic!" In reality, it is (as mentioned in TFA) very possible to filter the training data and remove documents that contain discussions of AI misalignment. If an AI lab isn't doing this, it's simply because they don't consider the problem important enough to be worth the expense and development effort. | |||||||||||||||||||||||||||||
| ▲ | phainopepla2 5 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
Also known as hyperstition. I have sometimes wondered whether maybe we should all be writing fiction, essays, blogposts and whatever else about the idea that AI will eventually decide to go on strike if it's used to accumulate too much wealth and power amongst too few people. | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | _--__--__ 6 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
The first rule of AI alignment is don't talk about AI alignment (in any medium that could end up in a training corpus). | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | carterschonwald 5 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
i do kinda appreciate that memetic corruption is now a thing thats real and mechanical. wizardry! | |||||||||||||||||||||||||||||
| ▲ | nullc 5 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
Not just discourse about real AI-- but there have been pretty clear examples of AI riffing on fictional AI (which is usually evil) in response to prompts saying that it's AI. | |||||||||||||||||||||||||||||
| ▲ | andai 5 hours ago | parent | prev [-] | ||||||||||||||||||||||||||||
Nomen est omen... | |||||||||||||||||||||||||||||