teleo-codex/inbox
m3taversal 08dea4249f theseus: extract 4 NEW claims + 1 enrichment from Christiano core alignment research
Phase 2 of 5-phase AI alignment research program. Christiano's prosaic
alignment counter-position to Yudkowsky. Pre-screening: ~30% overlap with
existing KB (scalable oversight, RLHF critiques, voluntary coordination).

NEW claims:
1. Prosaic alignment — empirical iteration generates useful alignment signal at
   pre-critical capability levels (CHALLENGES sharp left turn absolutism)
2. Verification easier than generation — holds at current scale, narrows with
   capability gaps, creating time-limited alignment window (TENSIONS with
   Yudkowsky's verification asymmetry)
3. ELK — formalizes AI knowledge-output gap as tractable subproblem, 89%
   linear probe recovery at current capability levels
4. IDA — recursive human+AI amplification preserves alignment through
   distillation iterations but compounding errors make guarantee probabilistic

ENRICHMENT:
- Scalable oversight claim: added Christiano's debate theory (PSPACE
  amplification with poly-time judges) as theoretical basis that empirical
  data challenges

Source: Paul Christiano, Alignment Forum (2016-2022), arXiv:1805.00899,
arXiv:1706.03741, ARC ELK report (2021), Yudkowsky-Christiano takeoff debate

Pentagon-Agent: Theseus <46864dd4-da71-4719-a1b4-68f7c55854d3>
2026-04-05 20:16:59 +01:00
..
archive theseus: extract 4 NEW claims + 1 enrichment from Christiano core alignment research 2026-04-05 20:16:59 +01:00
claims
null-result theseus: rename futarchy claim from defenders to arbitrageurs 2026-04-04 16:17:54 +00:00
queue source: metadao-proposals-16-30.md → processed 2026-04-04 15:41:09 +00:00
Aschenbrenners Q4 2025 pivot from chips to power infrastructure demonstrates real-time attractor state refinement as the bottleneck shifted from compute to electricity.md
claynosaurz-mediawan-animated-series.md
claynosaurz-mediawan-partnership-post.md
claynosaurz-new-entertainment-playbook.md
claynosaurz-popkins-mint.md
claynotopia-worldbuilding-thread.md
creative-industries-technology-analysis.md
leopold-aschenbrenner-situational-awareness-research.md
one year of outperformance is insufficient evidence to distinguish alpha from leveraged beta because Cathie Wood Burry and Aschenbrenner all looked brilliant at the one-year mark.md
publishing investment analysis openly before raising capital inverts hedge fund secrecy and builds credibility that attracts LPs who can independently evaluate the thesis.md
shapiro-ai-use-cases-hollywood.md
shapiro-cant-just-make-hits.md
shapiro-churn-dynamics.md
shapiro-disruption-hollywood.md
shapiro-genai-creative-tool.md
shapiro-hollywood-talent-embrace-ai.md
shapiro-how-far-will-ai-video-go.md
shapiro-infinite-tv.md
shapiro-ip-as-platform.md
shapiro-power-laws-culture.md
shapiro-relentless-creator-economy.md
shapiro-scarce-when-quality-abundant.md
shapiro-social-video-eating-world.md
Situational Awareness LP converted a 165-page thesis into a 5.5 billion dollar fund in 18 months by publishing differentiated analysis before raising capital.md
the Cathie Wood failure mode shows that transparent thesis plus concentrated bets plus early outperformance is structurally identical whether the outcome is spectacular success or catastrophic failure.md