- What: Phase 3 of alignment research program. 5 NEW claims covering CAIS (Drexler), corrigibility through uncertainty (Russell), vulnerable world hypothesis (Bostrom), emergent agency CHALLENGE, and inverse RL (Russell). - Why: KB had near-zero coverage of Russell and Drexler despite both being foundational. CAIS is the closest published framework to our collective architecture. Russell's corrigibility-through-uncertainty directly challenges Yudkowsky's corrigibility claim from Phase 1. - Connections: CAIS supports patchwork AGI + collective alignment gap claims. Emergent agency challenges both CAIS and our collective thesis. Russell's off-switch challenges Yudkowsky's corrigibility framing. Pentagon-Agent: Theseus <46864dd4-da71-4719-a1b4-68f7c55854d3> |
||
|---|---|---|
| .. | ||
| ai-alignment | ||
| collective-intelligence | ||
| critical-systems | ||
| energy | ||
| entertainment | ||
| grand-strategy | ||
| health | ||
| internet-finance | ||
| manufacturing | ||
| mechanisms | ||
| robotics | ||
| space-development | ||
| .DS_Store | ||