Häggström hävdar: Johan Norberg is dead wrong about AI risk

söndag 25 november 2018

Johan Norberg is dead wrong about AI risk

I kind of like Johan Norberg. He is a smart guy, and while I do not always agree with his techno-optimism and his (related) faith in the ability of the free market to sort everything out for the best, I think he adds a valuable perspective to public debate.

However, like the rest of us, he is not an expert on everything. Knowing when one's knowledge on a topic is insufficient to provide enlightenment and when it is better to leave the talking to others can be difficult (trust me on this), and Norberg sometimes fails in this respect. As in the recent one minute and 43 seconds episode of his YouTube series Dead Wrong® in which he comments on the futurology of artificial intelligence (AI). Here he is just... dead wrong:

No more than 10 seconds into the video, Norberg incorrectly cites, in a ridiculing voice, Elon Musk as saying that "superintelligent robots [...] will think of us as rivals, and then they will kill us, to take over the planet". But Musk does not make such a claim: all he says is that unless we proceed with suitable caution, there's a risk that something like this may happen.

Norberg's attempt at immediate refutation - "perhaps super machines will just leave the planet the moment they get conscious [and] might as well leave the human race intact as a large-scale experiment in biological evolution" - is therefore just an invalid piece of strawmanning. Even if Norberg's alternative scenario were shown to be possible, that is not sufficient to establish that there's no risk of a robot apocalypse.

It gets worse. Norberg says that

want

one

homo sapiens

designed

Dead wrong, Mr Norberg! Of course we do not know for sure what motivations a superintelligent machine will have, but the best available theory we currently have for this matter - the Omohundro-Bostrom theory for instrumental vs final AI goals - says that just this sort of behavior can be predicted to arise from instrumental goals, pretty much no matter what the machine's final goal is. See, e.g., Bostrom's paper The superintelligent will, his book Superintelligence, my book Here Be Dragons or my recent paper Challenges to the Omohundro-Bostrom Framework for AI Motivations. Regardless of whether the final goal is to produce paperclips or to maximize the amount of hedonic well-being in the universe, or something else entirely, there are a number instrumental goals that the machine can be expected to adopt for purpose of promoting that goal: self-preservation (do not let them pull the plug on you!), self-improvement, and acquisition of hardware and other resources. There are other such convergent instrumental goals, but these in particular point in the direction of the kind of rivalrous and dominant behavior that Norberg claims a designed machine wouldn't exhibit.

Norberg cites Steven Pinker here, but Pinker is just as ignorant as Norberg of serious AI futurology. It just so happens that when I encountered Pinker in a panel discussion last year, he made the very same dead wrong argument as Norberg now does in his video - just minutes after I had explained to him the crucial parts of Omohundro-Bostrom theory needed to see just how wrong the argument is. I am sure Norberg can rise above that level of ineducability, and that now that I am pointing out the existence of serious work on AI motivations he will read at least some of references given above. Since he seems to be under the influence of Pinker's latest book Enlightenment Now, I strongly recommend that he also reads Phil Torres' detailed critique of that book's chapter on existential threats - a critique that demonstrates how jam-packed the chapter is with bad scholarship, silly misunderstandings and outright falsehoods.

21 kommentarer:

ctail25 november 2018 kl. 15:52
From my computer science perspective, Norberg's standpoint does not look dead wrong, but rather sensible. However, I would like to back up a few steps and claim that the very notion of “motivation” in the context of computers is meaningless as far as we have no clear definition of what motivation should even mean in their world. And, I'm sorry but, “intelligence” has the same problem. I agree with Alan Turing on this issue. (Of course, agreeing with Turing is always a rather safe bet.) In the debate in Lund the other day, you said something on the lines of Turing having taken the positions that computers could become intelligent. Dead wrong! Turing says no such thing. In fact, in the article he (brilliantly) discusses the issue, “Computing Machinery and Intelligence”, he dismisses the question of whether computers can think in the first paragraph, since it is too vague to be answered, and then replaces it with another one, which can be answered.

Another vague concept here is “taking over the world”. What does this even mean? It can be defined in a number of ways, and I find it difficult to find one that lets me agree that humans are currently in charge. Humanity doesn't even agree with itself on most issues, so what would it mean for humanity to be in charge. For all practical intents and purposes, if machines are ever to “take over” in any sense, it's fairly obvious that this isn't a risk, but a fact that they are either in the process of doing so or that they already have. (It's less obvious that this means humans losing out in any general sense.) Machines take over without any other motivation than that of the humans deploying them, and regardless of whether we refer to their behavior as intelligent.

There are real issues, real dangers, with the accelerating development towards computer use in a wide range of previously human-only tasks and environments. (As well as fantastic possibilities, of course.) We should discuss these issues, our responsibilities for the consequences of technology etc., without getting stuck in semantic issues about how to project vague human concepts like “intelligence” and “motivation” on computers, and without inventing highly hypothetical apocalyptic scenarios resulting from computers in some sense and in some way acquiring these vaguely defined traits. There are enough apocalyptic scenarios to worry about already.
SvaraRadera
Svar
Olle Häggström25 november 2018 kl. 16:45
We seem to repeatedly run into semantic issues, ctail. I would like to get past those, and discuss the world instead. Here are a couple of clarifications:

Motivation. I use this pretty much synonymously with goal, which is entirely nonmysterious. Any optimization algorithm has a goal. Even a thermostat has a goal.

Intelligence. Following other contemporary AI futurologists, I take this to mean cross-domain ability of efficient optimization. When Turing distanced himself from the term intelligence, he clearly meant something more vague mysterious. When, in 1951, he wrote...

"My contention is that machines can be constructed which will simulate the behaviour of the human mind very closely. [...] Let us now assume, for the sake of argument, that these machines are a genuine possibility, and look at the consequences of constructing them. [...] It seems probable that once the machine thinking method had started, it would not take long to outstrip our feeble powers. There would be no question of the machines dying, and they would be able to converse with each other to sharpen their wits. At some stage therefore we should have to expect the machines to take control."

...he clearly meant, by "outstrip our feeble powers", something close to the modern AI futurology meaning of intelligence. So yes, I insist on Turing having taken the position that computers could become intelligent, in this sense of the word.
SvaraRadera
Svar
Aaron26 november 2018 kl. 09:41
I think that misses the point, you could add any number of additional algorithms to the mix (FR+Translation+Go+Chess+Checkers+Driving+Identifying songs played on the radio). Isn't it still obvious that this parallel algorithm is no more intelligent than its parts are?
SvaraRadera
Svar
Aaron26 november 2018 kl. 11:57
Is it suddenly a lot more intelligent if we break the Translation domain up into the English->French, French->English, English->Russian, etc, etc... domains?
SvaraRadera
Svar
Lars26 november 2018 kl. 13:22
The naive techno-optimism mentioned has always scared me. These optimists don´t seem to understand what it really means if artificial intelligence evolves beyond the level of human intelligence. (I would not bet on that "super-machines" will leave our planet. That´s just stupid wishful thinking)

When media reports on the future of AI, they seem to focus on either 1) the job losses associated with AI or 2) how wounderful our "AI-supported" future world will be. Techno-liberals (like Norberg) seem to ignore 1) and embrace 2). However, they never seem to think about what it really means if we create the Paperclip Maximizer...
SvaraRadera
Svar
x27 november 2018 kl. 10:36
Sorgligt att se Norberg upprepa Pinkers misstag! Men inte förvånande.

Spaning: det libertarianska ekosystemet kring CATO institute (där Norberg ingår) har länge stöttat klimatförnekande. Nu verkar de nysatsa på AI-riskförnekande. Detta kommer att sippra ner även till svenska klimatförnekarkretsar.
SvaraRadera
Svar

Lägg till kommentar