| ▲ | holmesworcester 4 hours ago | |
One major source of conflict in AI policy / AI safety is that very smart people have radically diverging intuitions about how dangerous superintelligence is and how difficult it is to align. A first group dismisses the problem entirely, saying intelligence != power and AI doesn't have "drives". A second group believes that alignment is solvable through engineering and iteration, and that we have the best chance of surviving if people with the right intentions are the ones working on it. A third believes that aligning a superintelligence is a unique category of problem, that we are nowhere close to the level of scientific understanding needed to achieve it, that we only have one shot (because once a sufficiently powerful superintelligence exists it will thwart all future attempts, and alignment techniques that worked on dumber AI will likely not work on it), and that the world will have to coordinate to avoid killing ourselves off by building superintelligence before we understand how to do it safely, the way we have coordinated to avoid nuclear war. The Anthropic and OpenAI founders, Elon, and Anthropic engineers are mostly in the second category. Some safety people at Anthropic and OAI are in the third category, but leading people in the third category think that pure safety roles at the labs are potentially impactful enough to be worth not quitting. | ||
| ▲ | kmeisthax 3 hours ago | parent [-] | |
I have a fourth, secret position: we achieved superintelligence the moment we achieved normal intelligence. Speed is a power in and of itself; and even really primitive models like GPT-2 could generate tokens faster than humans could write. They could also be parallelized on hardware that already exists. That is superintelligence in two dimensions - speed and population count. All the arguments the AI safety people are making are about superintelligence in a different dimension - that of "single-context scaling" - but the other dimensions are also relevant to the conversation. And the superintelligence currently available to us is already causing lots of documented harms. AI psychosis. Sexy suicide coaches. Slop. The problem is that those are all the harms the dirty, filthy AI ethicists talk about. The AI safety people want to talk about new and exciting harms that only the scaling dimension can bring us. My personal opinion is that if a superintelligence catastrophe actually happens, mitigating those harms will neatly move over from the safety bucket to the ethics bucket, and the safety people will start imagining some new and even worse kinds of harms the next model will make. | ||