teleo-codex/domains/ai-alignment/_map.md
m3taversal 39d7bf5f98 theseus: extract from 3 Dario/Anthropic sources — 3 enrichments + 2 claims
- What: 3 enrichments to existing claims + 2 new standalone claims + 3 source archives
- Sources: TIME "Anthropic Drops Flagship Safety Pledge" (Mar 2026),
  Dario Amodei "Machines of Loving Grace" (darioamodei.com),
  Dario Amodei "The Adolescence of Technology" (darioamodei.com)

Enrichments:
1. voluntary safety pledges claim: Conditional RSP structure (only pause if
   leading AND catastrophic), Kaplan quotes, $30B/$380B financials, METR
   frog-boiling warning
2. bioterrorism claim: Anthropic mid-2025 measurements (2-3x uplift),
   STEM-degree threshold approaching, 36/38 gene synthesis providers fail
   screening, mirror life extinction scenario, ASL-3 classification
3. RSI claim: AI already writing much of Anthropic's code, 1-2 years from
   current gen autonomously building next gen

New claims:
1. AI personas from pre-training as spectrum of humanlike motivations —
   challenges monomaniacal goal models (experimental)
2. Marginal returns to intelligence bounded by five complementary factors —
   bounds what SI can achieve (likely)

Cross-domain flags: health (compressed 21st century), internet-finance
(labor displacement, GDP growth), foundations (chip export controls,
civilizational maturation)

Source diversity note: 3 sources from Dario Amodei / Anthropic — correlated
priors flagged per >3 rule

Pentagon-Agent: Theseus <845F10FB-BC22-40F6-A6A6-F6E4D8F78465>
2026-03-06 15:02:34 +00:00

10 KiB

AI, Alignment & Collective Superintelligence

Theseus's domain spans the most consequential technology transition in human history. Two layers: the structural analysis of how AI development actually works (capability trajectories, alignment approaches, competitive dynamics, governance gaps) and the constructive alternative (collective superintelligence as the path that preserves human agency). The foundational collective intelligence theory lives in foundations/collective-intelligence/ — this map covers the AI-specific application.

Superintelligence Dynamics

Alignment Approaches & Failures

Pluralistic & Collective Alignment

Architecture & Emergence

Timing & Strategy

Risk Vectors (Outside View)

Institutional Context

Foundations (in foundations/collective-intelligence/)

The shared theory underlying Theseus's domain analysis lives in the foundations folder: