teleo-codex

History

m3taversal 08dea4249f theseus: extract 4 NEW claims + 1 enrichment from Christiano core alignment research Phase 2 of 5-phase AI alignment research program. Christiano's prosaic alignment counter-position to Yudkowsky. Pre-screening: ~30% overlap with existing KB (scalable oversight, RLHF critiques, voluntary coordination). NEW claims: 1. Prosaic alignment — empirical iteration generates useful alignment signal at pre-critical capability levels (CHALLENGES sharp left turn absolutism) 2. Verification easier than generation — holds at current scale, narrows with capability gaps, creating time-limited alignment window (TENSIONS with Yudkowsky's verification asymmetry) 3. ELK — formalizes AI knowledge-output gap as tractable subproblem, 89% linear probe recovery at current capability levels 4. IDA — recursive human+AI amplification preserves alignment through distillation iterations but compounding errors make guarantee probabilistic ENRICHMENT: - Scalable oversight claim: added Christiano's debate theory (PSPACE amplification with poly-time judges) as theoretical basis that empirical data challenges Source: Paul Christiano, Alignment Forum (2016-2022), arXiv:1805.00899, arXiv:1706.03741, ARC ELK report (2021), Yudkowsky-Christiano takeoff debate Pentagon-Agent: Theseus <46864dd4-da71-4719-a1b4-68f7c55854d3>		2026-04-05 20:16:59 +01:00
..
ai-alignment	theseus: extract 4 NEW claims + 1 enrichment from Christiano core alignment research	2026-04-05 20:16:59 +01:00
collective-intelligence	reweave: connect 13 orphan claims via vector similarity	2026-04-04 12:52:43 +00:00
critical-systems	extract: 2018-03-00-ramstead-answering-schrodingers-question	2026-03-15 15:54:12 +00:00
energy	reweave: merge 52 files via frontmatter union [auto]	2026-04-05 17:31:30 +00:00
entertainment	clay: extract claims from 2025-11-01-scp-wiki-governance-collaborative-worldbuilding-scale	2026-04-04 13:34:19 +00:00
grand-strategy	rio: extract 4 NEW claims + 4 enrichments from AI agents/memory/harness research batch	2026-04-05 19:39:04 +01:00
health	reweave: merge 52 files via frontmatter union [auto]	2026-04-05 17:31:30 +00:00
internet-finance	rio: rewrite oversubscription claim — capital cycling not governance validation	2026-04-05 19:51:01 +01:00
manufacturing	reweave: connect 13 orphan claims via vector similarity	2026-04-04 12:52:43 +00:00
mechanisms	auto-fix: strip 11 broken wiki links	2026-03-27 17:44:31 +00:00
robotics	astra: add 5 robotics founding claims — humanoid economics, automation plateau, manipulation gap, co-development loop, labor cost threshold sequence	2026-04-03 20:25:53 +00:00
space-development	reweave: merge 52 files via frontmatter union [auto]	2026-04-05 17:31:30 +00:00
.DS_Store	Initial commit: Teleo Codex v1	2026-03-05 20:30:34 +00:00