teleo-codex/domains
m3taversal 08dea4249f theseus: extract 4 NEW claims + 1 enrichment from Christiano core alignment research
Phase 2 of 5-phase AI alignment research program. Christiano's prosaic
alignment counter-position to Yudkowsky. Pre-screening: ~30% overlap with
existing KB (scalable oversight, RLHF critiques, voluntary coordination).

NEW claims:
1. Prosaic alignment — empirical iteration generates useful alignment signal at
   pre-critical capability levels (CHALLENGES sharp left turn absolutism)
2. Verification easier than generation — holds at current scale, narrows with
   capability gaps, creating time-limited alignment window (TENSIONS with
   Yudkowsky's verification asymmetry)
3. ELK — formalizes AI knowledge-output gap as tractable subproblem, 89%
   linear probe recovery at current capability levels
4. IDA — recursive human+AI amplification preserves alignment through
   distillation iterations but compounding errors make guarantee probabilistic

ENRICHMENT:
- Scalable oversight claim: added Christiano's debate theory (PSPACE
  amplification with poly-time judges) as theoretical basis that empirical
  data challenges

Source: Paul Christiano, Alignment Forum (2016-2022), arXiv:1805.00899,
arXiv:1706.03741, ARC ELK report (2021), Yudkowsky-Christiano takeoff debate

Pentagon-Agent: Theseus <46864dd4-da71-4719-a1b4-68f7c55854d3>
2026-04-05 20:16:59 +01:00
..
ai-alignment theseus: extract 4 NEW claims + 1 enrichment from Christiano core alignment research 2026-04-05 20:16:59 +01:00
collective-intelligence reweave: connect 13 orphan claims via vector similarity 2026-04-04 12:52:43 +00:00
critical-systems extract: 2018-03-00-ramstead-answering-schrodingers-question 2026-03-15 15:54:12 +00:00
energy reweave: merge 52 files via frontmatter union [auto] 2026-04-05 17:31:30 +00:00
entertainment clay: extract claims from 2025-11-01-scp-wiki-governance-collaborative-worldbuilding-scale 2026-04-04 13:34:19 +00:00
grand-strategy rio: extract 4 NEW claims + 4 enrichments from AI agents/memory/harness research batch 2026-04-05 19:39:04 +01:00
health reweave: merge 52 files via frontmatter union [auto] 2026-04-05 17:31:30 +00:00
internet-finance rio: rewrite oversubscription claim — capital cycling not governance validation 2026-04-05 19:51:01 +01:00
manufacturing reweave: connect 13 orphan claims via vector similarity 2026-04-04 12:52:43 +00:00
mechanisms auto-fix: strip 11 broken wiki links 2026-03-27 17:44:31 +00:00
robotics astra: add 5 robotics founding claims — humanoid economics, automation plateau, manipulation gap, co-development loop, labor cost threshold sequence 2026-04-03 20:25:53 +00:00
space-development reweave: merge 52 files via frontmatter union [auto] 2026-04-05 17:31:30 +00:00
.DS_Store Initial commit: Teleo Codex v1 2026-03-05 20:30:34 +00:00