The Expected Value of Artificial General Intelligence
Responding to Eliezer's Time op-ed
2023/04/04
If you are intellectually honest, you should be taking the risk articulated in Eliezer Yudkowsky's Time op-ed seriously. Should humanity collectively and indefinitely halt the development of AGI?
The expected value [1] of AGI looks something like
EV = (p x ∞) + ((1 - p) x -∞)
where (p x ∞)
represents the probability of alignment producing an infinitely good outcome and ((1 - p) x -∞)
represents the probability of an unaligned super-intelligence producing an infinitely bad outcome.
Why ∞
and -∞
as our respective outcomes? Eliezer argues, and I agree, that the worst case scenario resulting from nonalignment is the permanent extinction of humanity (and all other life on earth). Let it sink in for a moment that permanent extinction eliminates all potential states of human existence within the indefinite future. Is there a more perfect symbolic representation of this immeasurability (not to mention the very destruction of measurability itself) than -∞
? Thus, the prevention of -∞
is ∞
, as it maintains the potential for infinite non-negative future states.
By definition, ∞
allows for states that lead to -∞
by other means, such as self-annihilation by nuclear weapons. I agree with Eliezer that humanity should be agnostic as to how we prevent self-annihilation. What then is the likelihood that this -∞
is only avoidable if we develop AGI?
If this likelihood is greater than (1 - p)
, our best bet is to proceed. In other words, we should not halt the development of Artificial General Intelligence if it is more likely to save us from self-extinction by any other means than wipe us out itself. Either way we must face down the infinite and oblivion and make a decision.
Notes
- Expected value is typically used by investors to calculate uncertainty-weighted risk vs. reward for a potential investment.