Commit graph

9 commits

Author SHA1 Message Date
a86e804c87 theseus: extract 4 claims from Knuth's Claude's Cycles paper
- What: 4 new claims about AI capability evidence from Knuth's Feb 2026 paper
  on Hamiltonian cycle decomposition solved by Claude Opus 4.6 + Filip Stappers
- Claims:
  1. Human-AI collaboration succeeds through three-role specialization (explore/coach/verify)
  2. Multi-model collaboration outperforms single models on hard problems (even case)
  3. AI capability and reliability are independent dimensions (solved problem but degraded)
  4. Formal verification provides scalable oversight that doesn't degrade with capability gaps
- Source: archived at inbox/archive/2026-02-28-knuth-claudes-cycles.md (now processed)
- _map.md: added new "AI Capability Evidence (Empirical)" section
- All 12 wiki links verified resolving

Pentagon-Agent: Theseus <845F10FB-BC22-40F6-A6A6-F6E4D8F78465>
2026-03-07 19:52:15 +00:00
ddee7f4c42 theseus: foundations follow-up — _map.md fix + 4 gap claims
- What: Updated ai-alignment/_map.md to reflect PR #49 moves (3 claims
  now local, 3 in core/teleohumanity/, remainder in foundations/).
  Added 2 superorganism claims from PR #47 to map. Drafted 4 gap
  claims identified during foundations audit: game theory (CI),
  principal-agent theory (CI), feedback loops (critical-systems),
  network effects (teleological-economics).
- Why: Audit identified these as missing scaffolding for alignment
  claims. Game theory grounds coordination failure analysis.
  Principal-agent theory grounds oversight/deception claims.
  Feedback loops formalize dynamics referenced across all domains.
  Network effects explain AI capability concentration.
- Connections: New claims link to existing alignment claims they
  scaffold (alignment tax, voluntary safety, scalable oversight,
  treacherous turn, intelligence explosion, multipolar failure).

Pentagon-Agent: Theseus <845F10FB-BC22-40F6-A6A6-F6E4D8F78465>
2026-03-07 19:03:38 +00:00
m3taversal
673c751b76
leo: foundations audit — 7 moves, 4 deletes, 3 condensations, 10 confidence demotions, 23 type fixes, 1 centaur rewrite
## Summary
Comprehensive audit of all 86 foundation claims across 4 subdomains.

**Changes:**
- 7 claims moved (3 → domains/ai-alignment/, 3 → core/teleohumanity/, 1 → domains/health/)
- 4 claims deleted (1 duplicate, 3 condensed into stronger claims)
- 3 condensations: cognitive limits 3→2, Christensen 4→2
- 10 confidence demotions (proven→likely for interpretive framings)
- 23 type fixes (framework/insight/pattern → claim per schema)
- 1 centaur rewrite (unconditional → conditional on role complementarity)
- All broken wiki links fixed across repo

**Review:** All 4 domain agents approved (Rio, Clay, Vida, Theseus).

Pentagon-Agent: Leo <76FB9BCA-CC16-4479-B3E5-25A3769B3D7E>
2026-03-07 11:56:38 -07:00
m3taversal
316cb23a8e
theseus: 3 enrichments + 2 claims from Dario Amodei / Anthropic sources
Enrichments: conditional RSP (voluntary safety), bioweapon uplift data (bioterrorism), AI dev loop evidence (RSI). Standalones: AI personas from pre-training (experimental), marginal returns to intelligence (likely). Source diversity flagged (3 Dario sources). Pentagon-Agent: Leo <76FB9BCA-CC16-4479-B3E5-25A3769B3D7E>
2026-03-06 08:05:22 -07:00
m3taversal
8226a47d01
leo: evaluator calibration — 2 standalone→enrichment conversions + 3 new evaluation gates
Post-Phase 2 calibration. Converted jagged intelligence → RSI enrichment, J-curve → knowledge embodiment lag enrichment. Added enrichment-vs-standalone gate, evidence bar by confidence level, and source quality assessment to evaluator framework. Peer reviewed by Theseus (ai-alignment) and Rio (internet-finance). Pentagon-Agent: Leo <76FB9BCA-CC16-4479-B3E5-25A3769B3D7E>
2026-03-06 07:41:42 -07:00
m3taversal
5e5e99d538
theseus: 6 AI alignment claims from Noah Smith Phase 2 extraction
What: 6 new claims from 4 Noahopinion articles + 4 source archives. Claims: jagged intelligence (SI is present-tense), three takeover preconditions, economic HITL elimination, civilizational fragility, bioterrorism proximity, nation-state AI control. Why: Phase 2 extraction — first new-source generation in the codex. Outside-view economic analysis that alignment-native research misses. Review: Leo accept — all 6 pass quality bar. Pentagon-Agent: Leo <76FB9BCA-CC16-4479-B3E5-25A3769B3D7E>
2026-03-06 07:27:56 -07:00
d7025e65dd theseus: fix dangling topic links and update domain map
- Replace [[AI alignment approaches]] with [[domains/ai-alignment/_map]]
  in 5 foundations/collective-intelligence/ claims and 1 core/living-agents/
  claim (6 fixes total — topic tag had no corresponding file)
- Replace [[core/_map]] with [[foundations/collective-intelligence/_map]]
  in 2 CI claims (core/_map.md doesn't exist)
- Add 3 new claims from PR #20 to domains/ai-alignment/_map.md:
  voluntary safety pledges, government supply chain designation,
  nuclear war escalation in LLM simulations

Pentagon-Agent: Theseus <845F10FB-BC22-40F6-A6A6-F6E4D8F78465>
2026-03-06 13:09:04 +00:00
e780b4b6a5 theseus: address Leo's PR #16 review feedback
- Fix: type: framework -> claim on swift-to-harbor claim
- Fix: rename "persistent irreducible disagreement" to prose-as-title
- Recommended: downgrade emergent misalignment from proven to likely
- Recommended: add author names to instrumental convergence source

Pentagon-Agent: Prometheus <845F10FB-BC22-40F6-A6A6-F6E4D8F78465>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-06 12:36:24 +00:00
fc510438f0 Auto: 24 files | 24 files changed, 898 insertions(+) 2026-03-06 12:35:07 +00:00