teleo-codex/domains/ai-alignment/permanently failing to develop superintelligence is itself an existential catastrophe because preventable mass death continues indefinitely.md at cfd9c709c30c9ca8cd9372692c08d3fb9b0f2e74

m3taversal fc510438f0 Auto: 24 files | 24 files changed, 898 insertions(+)

2026-03-06 12:35:07 +00:00

4.2 KiB

Raw Blame History

description	type	domain	created	source	confidence
Bostrom's inversion of his 2014 caution -- non-development of SI means 170k daily deaths from aging and disease persist forever, qualifying as an existential catastrophe by his own definition	claim	ai-alignment	2026-02-17	Bostrom, Optimal Timing for Superintelligence (2025 working paper); Bostrom interview with Adam Ford (2025)	experimental

"It would be in itself an existential catastrophe if we forever failed to develop superintelligence." This single sentence from Bostrom's 2025 paper represents perhaps the most dramatic evolution in the AI safety landscape. The author of the foundational text warning about SI dangers now explicitly argues that not building SI constitutes an existential catastrophe.

The argument is straightforward but its implications are radical. Approximately 170,000 people die every day from causes that a sufficiently advanced intelligence could plausibly prevent -- aging, disease, poverty, environmental degradation. If we accept Bostrom's own framework from "Superintelligence" that existential catastrophe includes permanent curtailment of humanity's potential, then a world where these deaths continue indefinitely because we chose not to develop the technology that could prevent them meets the definition. The catastrophe is not a single dramatic event but a continuous, normalized hemorrhage of human potential.

This inverts the precautionary framing that dominated AI safety discourse from 2014 through roughly 2023. In that era, the burden of proof sat with developers: demonstrate safety before scaling capability. Bostrom's evolved position shifts the burden: the status quo of human mortality is itself an ongoing catastrophe, and those advocating delay must account for the deaths that occur during that delay. This does not eliminate the case for caution -- Bostrom still acknowledges significant probability of catastrophic outcomes from misaligned SI -- but it reframes caution as a tradeoff rather than a default.

The Torres critique challenges this framing directly: being murdered by misaligned ASI differs fundamentally from dying of natural causes, and conflating the two is a category error. Additionally, the species could theoretically persist for billions of years without SI, so there is no death sentence requiring emergency surgery. These are serious objections. But Bostrom's counterpoint is that from a person-affecting utilitarian standpoint, the distinction between death from aging and death from AI matters less than the total expected loss of life-years across both scenarios.

Relevant Notes:

developing superintelligence is surgery for a fatal condition not russian roulette because the baseline of inaction is itself catastrophic -- the surgery analogy is the metaphorical expression of this claim
existential risk breaks trial and error because the first failure is the last event -- this note complicates the original framing: permanent failure to develop SI is also a "last event" in slow motion
consciousness may be cosmically unique and its loss would be irreversible -- strengthens Bostrom's argument: if consciousness is cosmically rare, maximizing conscious life-years becomes even more urgent
early action on civilizational trajectories compounds because reality has inertia -- delay in SI development compounds: each day of inaction is 170k irreversible deaths
safe AI development requires building alignment mechanisms before scaling capability -- the tension: Bostrom's urgency argument pushes against "safety first" but does not abandon it
the default outcome of an intelligence explosion is existential catastrophe based on decisive advantage orthogonality and instrumental convergence -- source-faithful treatment of Bostrom's 2014 doom argument that his 2025 position inverts by showing inaction is also catastrophic
solving the control problem is philosophy with a deadline because the value of intellectual work depends on whether it arrives before the intelligence explosion -- source-faithful treatment of Bostrom's urgency argument for redirecting intellectual resources to AI safety

Topics:

4.2 KiB Raw Blame History

4.2 KiB

Raw Blame History