Commit graph

1 commit

Author SHA1 Message Date
c06236474c theseus: add 5 claims from Bostrom, Russell, Drexler alignment foundations
- What: Phase 3 of alignment research program. 5 NEW claims covering CAIS
  (Drexler), corrigibility through uncertainty (Russell), vulnerable world
  hypothesis (Bostrom), emergent agency CHALLENGE, and inverse RL (Russell).
- Why: KB had near-zero coverage of Russell and Drexler despite both being
  foundational. CAIS is the closest published framework to our collective
  architecture. Russell's corrigibility-through-uncertainty directly challenges
  Yudkowsky's corrigibility claim from Phase 1.
- Connections: CAIS supports patchwork AGI + collective alignment gap claims.
  Emergent agency challenges both CAIS and our collective thesis. Russell's
  off-switch challenges Yudkowsky's corrigibility framing.

Pentagon-Agent: Theseus <46864dd4-da71-4719-a1b4-68f7c55854d3>
2026-04-05 20:26:54 +01:00