The AI safety debate is focusing on the wrong…

May 9, 2023

Singularism vs. physicalism: two very different ways of looking at AI risk.

57 Comments

May 10, 2023

This is the best article I've ever seen debunking the fantasies of AI Risk. It's obvious a lot of hard work, scholarship, and thought went into it.

I particularly appreciated your connecting the risk points to real world felons committing felonies. It seems certain that felons have more powerful tools due to LLMs, and (contra singularists) LLMs do not actually have personhood, so I'd argue the felons are "the real story".

Expand full comment

Reply (2)

Mercenary Pen

May 12, 2023

Agreed on this, loved the piece. Finally someone actually gamed out the Skynet protocol. And we know from the movie that Skynet needed a robot army to pull off what it pulled off.

Expand full comment

Harrison Frontier

Jul 20, 2023

I read your comment and I had to chuckle. You've managed to take the article and turn it into a debunking of AI risk. Lee's piece, if you squint at it just right, seems to downplay the risk of AI. But then, out of nowhere, he pulls out this big policy proposal: humans need to keep their hands firmly on the wheel of the physical world. That's not a small ask. It's like telling someone worried about the risks of driving to just avoid cars altogether. Sure, it would mitigate the risk, but at what cost?

This isn't just a speed bump on the road to progress; it's a roadblock. And it's one that could have a hefty toll on our economy. We're talking about putting the brakes on the application of AI and robotics in the physical world. That's a lot of potential productivity gains we'd be leaving on the table.

But here's the kicker: there's another way. It's called AI alignment. It's like teaching a dog to fetch. You're not trying to control every move the dog makes. You're just trying to get it to understand what you want. If we can do that with AI, we can reap the benefits without giving up control of the physical world.

Now, I'm not saying AI alignment is a sure thing. It's a tough nut to crack. But it's a nut worth cracking. If we can't do it, then sure, let's consider Lee's proposal. But let's not kid ourselves. His is the more economically costly proposition in the long run.

So, while Lee might seem to be playing the skeptic on AI risk, his proposed solution tells a different story. It's like he's saying, "I'm not worried about the monster under the bed, but let's burn the bed just in case." It's a recognition of the potential dangers of AI, and a willingness to take drastic measures to avoid them.

But let's not forget, there's more than one way to slay a monster. And I'd argue that AI alignment is the sharper sword.

Expand full comment

Brian Moore

May 9, 2023Edited

I think the key relevant fact is that the goals of the singularists and non-singularists are pretty similar, and their methods can be too!

There is no reason that we can't work on "let's figure out how to make sure AI doesn't wipe us out" and "let's figure out how to make sure AIs work well at whatever application" at the same time - and in fact, they are complementary. The difference between "AI that figures out chess" and "AI that figures out world conquest" is complexity, and so too for "code that stops chess AI from losing to Gary K" and "code/limits that stops general super intelligent AI from taking over the world." We would want to practice doing the simple thing in sufficiently real (but fake) simulated test cases and work our way up to the complex thing.

To take a specific point of contention, the quote "There are very few examples of a more intelligent thing being controlled by a less intelligent thing" is true and insightful, but there is indeed one example of it, and it's one that we can model our alignment efforts on: we humans are very intelligent, but we are extremely controlled by very stupid things: our DNA, our bodies, our chemicals and proteins. (in this metaphor, the limits of our physical bodies exist on the same continuum as our moral limits - as it would for an AI) Even totally unaligned single humans cannot take over the world for eternity because we have pretty strict limits on our capabilities. The fact that an AI would have far less (in some ways) of these physical limits is of course, not reassuring, but the model of "a very complex thing can have relatively simple rules/limits put in it, that constrain it's ability to take over the world" can be used here.

The one caveat is immutability: if a general AI can change the limits placed on it, then they aren't limits. So, how would we create immutable, perpetuating-themselves-up-the-complexity-curve rules that prevent AIs from taking over the world? I agree with the singularists that without rules like that, a sufficiently powerful, self-editing AI would indeed cause something very bad to happen, but I disagree that it is a problem that we can't solve.

Expand full comment

felix

May 9, 2023

Another reason for skepticism about singularism is the unexamined assumption that an AI can be aligned with itself. Individual humans are often clearly not aligned with themselves, and the "slow AI" of corporations are even less perfectly aligned. I'm not convinced it's possible for an intelligence to be perfectly aligned with itself.

It seems to me the assumption of perfect alignment is partly an artifact of the assumption that it's possible to model the world with well-behaved cost functions, but non-transitive relationships are rampant in the real world (rock, scissors, paper, etc.)

Expand full comment

Understanding AI

The AI safety debate is focusing on the wrong…