Why I'm not afraid of superintelligent AI…

Nov 15, 2023

The world isn't a giant chess game.

78 Comments

Nov 16, 2023Edited

I appreciate you writing this article! I've been wondering what your thoughts are on AI risk ever since you started the blog.

As some background, I first encountered you on Full Stack Economics, and when you announced this blog I subscribed here as well. Thus far I've found it very well-written and informative. In particular I loved your deep dives into self-driving technology, and found them very useful for forming my own opinions in that arena. You're one of my primary sources for news on contemporary AI developments, and I really appreciate the blog.

With that context, I want to say that I found this article to be very disappointing. It barely engages with the arguments in favor of AI risk, either handwaving them away without justification or omitting them entirely. Several sections even contain relatively simple mathematical errors that have nothing to do with AI in particular.

I'm writing up this comment because I believe AI to be by far the most impactful technology on the horizon, and it's vital that we can make good predictions on its impacts. If AI is indeed a threat to humanity, that would eclipse the importance of nearly every other issue humanity faces, and would justify strong measures to prevent it. And if AI is *not* such a threat, it has the potential to end poverty and war, saving millions of lives. In the latter case, we have a responsibility to develop it as quickly as possible. Figuring out which prediction is correct is *really important*.

To address things one at a time:

Chess:

You say that people have been mislead by chess, because chess follows simple deterministic rules and can therefore be solved by algorithms, which doesn't apply to the real world. This is a category error; there's no sharp delineation between those two domains. The real-world, just like chess, follows a set of relatively simple deterministic rules called "physics". Each "move" leads to a known outcome, which can be brute-force searched.

The difference, of course, is that the real-world game tree is vastly larger. The average move in a game of chess has about 35 options, compared to the number of particles in the observable universe is about 10^80. However this is less relevant than you might think, since chess's game tree is *already* large enough to be intractable to brute-force searches deeper than just a few moves as in your computer science class. Chess-playing algorithms succeeded by doing aggressive tree-pruning to get the search space down to a manageable size, along with heuristic arguments hardcoded in by human experience.

The piece valuation you used in your program is exactly such a fuzzy heuristic; nothing in the rules of chess assigns a value of "5" to a rook, and the actual usefullness of a rook varies wildly based on the exact position. Humans played thousands of games of chess, learned via trial and error and intuition how useful each piece was relative to each other piece, and then hardcoded that into their computers. A chess-playing algorithm like yours is *already* doing exactly the sort of knowledge-based heuristic approach that you claim computers aren't good at.

Early chess-playing pioneers like Deep Blue did rely on humans to explicitly program in those heuristics; they weren't doing the foundational reasoning themselves. But that changed in 2017 with AlphaZero, which learns chess entirely from scratch via neural network. It trained by playing chess against itself for only 9 hours and was then pitted agains the best human-coded chess-playing program, StockFish. AlphaZero won 25 games to 3.

The sort of pure algorithmic approach to games that you describe can only be used on very simple games like tic-tac-toe, and most of the things that computers have recently started doing much better than humans at use fuzzy heuristics learned by trial and error, just like humans do. AlphaStar, for example, is a neural network that can play Starcraft better than almost all humans. (Starcraft has a vastly larger game tree than chess, being more akin to the real world in the precision with which different actions can differ, and is also a hidden-information game where the players have to reason probabilistically about what the opponents have access to or may do.) OpenAI Five does the same with Dota 2. And outside of video games, DALL-E has far surpassed human artists in generality and visual beauty and fidelity. (It's still very poor at understanding an English description and converting that to a conceptually corresponding image, but that's a different skill.)

Your understanding of the real world also seems quite simplistic in certain domains. You say "The simplicity and predictability of chess allow computers to “look ahead” and anticipate the likely consequences of any potential move. Most real-world problems are not like that." and talk about military planning as an example of this; much of military strategy is doing exactly what you claim they don't do! The field of mathematical game theory was developed largely as a way to predict the actions of other nation-states in response to possible decisions, just like one does in chess. As you point out, real-world planning is a partial information game rather than a perfect-information game like chess, but that doesn't really have anything to do with the ability to plan ahead. Planning ahead in a hidden-information game looks very much the same as in chess, except that you ascribe probabilities to each of your opponent's moves and calculate the move you can take with the highest expected value.

There's a reason why game theory and wargaming both have "game" in their names; there's no sharp delineation between "game" and "geopolitics"; they're both complicated systems of rules, agents, incentives, and payoffs. Geopolitics is the same kind of thing as board games, just a more complicated instance.

Knowledge vs. computation:

If I understand your argument correctly, it's that general artificial intelligence will require more training data than humans currently have available to give it, and that much of the data we do have is redundant.

I think you actually understate part of this argument. The first important question is whether neural networks are capable of general intelligence *at all*. Our understanding of the human brain is extremely poor, and while neural networks are similar to them in many ways, they're also different in many ways. It's entirely possible that no amount of training data could ever get a neural network to human-level intelligence. (For more on this I'd highly recommend the debate between Scott Alexander and Gary Marcus: https://www.astralcodexten.com/p/somewhat-contra-marcus-on-ai-scaling)

But assuming that neural networks are capable in theory of general intelligence, it seems unjustified to point to limited training data as a relevant constraint.

* You point to a paper that estimates we'll run out of training data by 2026. This may be true, but what about the ~2.5 years before that happens? We've already seen dramatic improvement from GPT-2 to GPT-4, and if there is some point at which the amount of training data becomes "enough", you haven't provided any estimate of where exactly that point is, and it's entirely possible that it's above GPT-4 but below the total amount of data we have to throw at GPT-5.

* Humans are generating data at a frantic rate that's only increasing as the internet plays a larger and larger part of our lives. We may "run out" of unused training data in 2026, but that would only limit growth in training dataset size to the amount of data that humanity produces in a year, which is... a lot. Even if the amount of data needed for GAI is above the 2026 threshold, we'll still get there eventually, potentially only a few years later.

* You focus on human-created data, such as English passages. This is presumably because current leading AI models are language models, which is because that's what people want. AIs that can predict human language would be very useful to humanity, so that's where most of the funding goes. But when we're talking about *general* intelligence, capable of reasoning about the world from first principles and learning in much the same way that a human baby does, why would it need to be training on human language to start out with? There's nothing fundamentally special about humans, we're just a particularly complicated part of physics. The Large Hadron Collider produces more than 1 petabyte of data *per day*. The Event Horizon Telescope collected 5.5 petabytes of data in April of 2018. What happens when someone pipes all of that into a massive AI model? Nobody's done it yet because anything short of general intelligence will be unhelpful to the physics community, so the funding just isn't there. But if the rapid pace of increasing interest in AI continues, someone will do it eventually, and an AI capable of predicting physics is also capable of predicting human behavior as a side effect, since humans run on physics.

(Continued in a reply, I ran into the comment length limit.)

Expand full comment

Reply (4)

Timothy B. Lee

Nov 21, 2023

Hi Issac! Thanks for the thoughtful and thorough comment. I'm not ignoring you, just want to give this the detailed response it deserves.

Expand full comment

Isaac King

Nov 16, 2023

Nanotechnology:

I don't know anything about nanotechnology, so the extent of my engagement with that subject will be to point out that prediction markets currently put a 70% chance on sci-fi-style rapidly self-replicating and world-altering nanotechnology being possible within the laws of physics. (https://manifold.markets/IsaacKing/is-it-physically-possible-to-design?r=SXNhYWNLaW5n)

The much more important point is this: If a superintelligent AI exists in the real world and wants humanity out of its way, *we have already lost*. Whether it uses nanotechnology or some other method is irrelevant; maybe it can't built nanobots and it has to do it the old fashioned way by spending a few years manipulating human pasties into positions of political power, so what? Talking about specific weapon technologies is completely missing the point that in almost any contest between a vastly smarter entity and a vastly dumber entity, the smarter one is going to win. The dumber entity going "I can't think of how the superintelligence would beat me, so it must be impossible" is just further evidence of that entity's dumbness. I have no idea how Magnus Carlson would beat me if we were to play chess, I could not predict his moves in advance or even begin to explain his strategy, but beat me he would.

In general, expecting that you've thought of all possible vectors for attack in a complex system is extreme overconfidence. About AlphaStar, a professional Starcraft player said: "AlphaStar is an intriguing and unorthodox player — one with the reflexes and speed of the best pros but strategies and a style that are entirely its own. The way AlphaStar was trained, with agents competing against each other in a league, has resulted in gameplay that’s unimaginably unusual; it really makes you question how much of StarCraft’s diverse possibilities pro players have really explored." Anyone who has worked in computer security can tell you about "security mindset"; the understanding that you're going up against adversaries who are equally or more intelligent than you, and that the slightest gap in any system *will* be exploited. Just being "kinda sure" is unacceptable in these sorts of environments.

This whole section sounds like your plan is "build the malevolent superintelligence and just trust that no matter how much it wants to kill us, it won't be able to figure out a way to do it", which is just... a really bad plan.

General epistemics:

Many of your arguments strike me as very odd, even ignoring the specifics. Several appear to be of the form "AI wouldn't post a risk for several years, therefore it isn't a risk at all". For example you point out that developing advanced nanotechnology would likely take several years, and present this as though it's supposed to be reassuring. Same for the growth of training datasets, as I addressed earlier. While I'll take a little solace in the fact that I probably won't be dying next year, I'd like to live a lot longer than that. Risks that are 5, 10, or 15 years out are still very much worth worrying about in my book!

If a CIA analyst discovered a plot by Russia to invade the US, do you think they'd present this as "Russia is intending to destroy the United States, but don't worry, it's going to take them several years to scale up their manufacturing capacity before they can execute their plan."? Or would they present it as "RUSSIA IS INTENDING TO DESTROY THE UNITED STATES AND WE ONLY HAVE A FEW YEARS TO PREPARE, THIS IS AN EMERGENCY WE NEED TO GET ON THIS RIGHT NOW!"?

Separately, most of these arguments also take the form of "eh, seems unlikely, so it's not a problem". This is an exceedingly strange approach to take when what's at stake is all of humanity. Even if we generously assume that these considerations bring the risk down to just 1%, a 1% chance of everyone dying is equivalent to about 80 million deaths in expectation. Asteroid impact avoidance, one of the few types of existential risk that the government spends significant funding on, uses a much stricter cutoff. The Jet Propulsion Laboratory's Sentry monitoring system, for example, tracks near-Earth objects down to a 0.00001% chance of impact any time within the next hundred years. Or take a look at this exercise that NASA ran on a hypothetical emergency situation where a large asteroid is discovered to have a 1% chance of impact in October of 2036: https://cneos.jpl.nasa.gov/pd/cs/pdc23/PDC23-ImpactRisk-Epoch1.pdf (And this is for an asteroid that would "only" destroy an area the size of a single US state!)

Heck, even when you just consider just a single individual, 1% is still super high. The average healthy young American has a much less than 1% chance of dying in 10 years; you only start getting risks that high doing skydiving and other crazy stuff. People avoid asbestos like the plague just because of a few percentage point increase in cancer risk 20+ years down the road. I don't understand why we'd treat a risk to every human alive as being less important than a risk to a single human.

Final thoughts:

My point here is not that AI has a >90% or even >50% chance of wiping out humanity; much lower numbers seem reasonable to me. But the arguments presented in this article are deeply flawed, and show no such thing.

I'd encourage you to engage with the AI safety community in more depth, as they've put years of work into this field and have much more sophisticated models of what facts and discoveries would indicate a higher or lower amount of risk. Here's a list of high quality introductions to the subject that present the arguments why AI might be dangerous: https://manifold.markets/Nikos/best-existing-short-form-introducti and here's a list of counterarguments arguing that AI is unlikely to be all that dangerous: https://www.reddit.com/r/singularity/comments/143qbk7/best_rebuttals_of_the_doomer_case_against_ai/ (Though notably almost all of those authors would put at least a 1% chance on AI causing human extinction. The lowest I'm aware of is this summary: https://arxiv.org/pdf/2306.02519.pdf, which places it at 0.4%; still alarmingly high compared to our threshold for worrying about asteroid impacts.)

Expand full comment

Jan Matusiewicz

Nov 26, 2023

It is a good question if feeding LHC data into a large neural network would force it to develop physical theory or learn to "feel" what would happen. Like a human "feels" when jumping or swimming without knowledge of underlying physics. But "AI capable of predicting physics is also capable of predicting human behavior as a side effect, since humans run on physics." is an overstretch. A cutting edge narrow AI can predict protein folding based on amino-acids sequence. Predicting even how a they interact to build a single cell is way beyond reach. Not to mention predicting human behavior based on particle physics.

Expand full comment

Understanding AI

Why I'm not afraid of superintelligent AI…