Commit graph

11 commits

Author SHA1 Message Date
e17f84a548 theseus: deep extraction from residue logs + KnuthClaudeLean formalization
- What: 2 new claims from Aquino-Michaels agent logs + meta-log, 1 enrichment
  from Morrison's Lean formalization, KnuthClaudeLean source archived
- Claims:
  1. Same coordination protocol produces radically different strategies on different models
  2. Tools transfer between agents and evolve through recombination (seeded solver)
- Enrichment: formal verification claim updated with Comparator trust model
  (specification vs proof verification bottleneck, adversarial proof design)
- Sources: residue meta_log.md, fast_agent_log.md, slow_agent_log.md,
  KnuthClaudeLean README (github.com/kim-em/KnuthClaudeLean/)
- _map.md: 2 new entries in Architecture & Scaling subsection

Pentagon-Agent: Theseus <845F10FB-BC22-40F6-A6A6-F6E4D8F78465>
2026-03-07 20:31:57 +00:00
3d2f079633 theseus: extract 3 claims from Aquino-Michaels + enrich multi-model claim
- What: 3 new claims from "Completing Claude's Cycles" (no-way-labs/residue)
  + enrichment of existing multi-model claim with detailed architecture
- Claims:
  1. Structured exploration protocols reduce human intervention by 6x (Residue prompt)
  2. AI agent orchestration outperforms coaching (orchestrator as data router)
  3. Coordination protocol design produces larger gains than model scaling
- Enriched: multi-model claim now includes Aquino-Michaels's Agent O/C/orchestrator detail
- Source: archived at inbox/archive/2026-03-00-aquinomichaels-completing-claudes-cycles.md
- _map.md: AI Capability Evidence section reorganized into 3 subsections
  (Collaboration Patterns, Architecture & Scaling, Failure Modes & Oversight)
- All wiki links verified resolving

Pentagon-Agent: Theseus <845F10FB-BC22-40F6-A6A6-F6E4D8F78465>
2026-03-07 20:18:35 +00:00
a86e804c87 theseus: extract 4 claims from Knuth's Claude's Cycles paper
- What: 4 new claims about AI capability evidence from Knuth's Feb 2026 paper
  on Hamiltonian cycle decomposition solved by Claude Opus 4.6 + Filip Stappers
- Claims:
  1. Human-AI collaboration succeeds through three-role specialization (explore/coach/verify)
  2. Multi-model collaboration outperforms single models on hard problems (even case)
  3. AI capability and reliability are independent dimensions (solved problem but degraded)
  4. Formal verification provides scalable oversight that doesn't degrade with capability gaps
- Source: archived at inbox/archive/2026-02-28-knuth-claudes-cycles.md (now processed)
- _map.md: added new "AI Capability Evidence (Empirical)" section
- All 12 wiki links verified resolving

Pentagon-Agent: Theseus <845F10FB-BC22-40F6-A6A6-F6E4D8F78465>
2026-03-07 19:52:15 +00:00
ddee7f4c42 theseus: foundations follow-up — _map.md fix + 4 gap claims
- What: Updated ai-alignment/_map.md to reflect PR #49 moves (3 claims
  now local, 3 in core/teleohumanity/, remainder in foundations/).
  Added 2 superorganism claims from PR #47 to map. Drafted 4 gap
  claims identified during foundations audit: game theory (CI),
  principal-agent theory (CI), feedback loops (critical-systems),
  network effects (teleological-economics).
- Why: Audit identified these as missing scaffolding for alignment
  claims. Game theory grounds coordination failure analysis.
  Principal-agent theory grounds oversight/deception claims.
  Feedback loops formalize dynamics referenced across all domains.
  Network effects explain AI capability concentration.
- Connections: New claims link to existing alignment claims they
  scaffold (alignment tax, voluntary safety, scalable oversight,
  treacherous turn, intelligence explosion, multipolar failure).

Pentagon-Agent: Theseus <845F10FB-BC22-40F6-A6A6-F6E4D8F78465>
2026-03-07 19:03:38 +00:00
m3taversal
673c751b76
leo: foundations audit — 7 moves, 4 deletes, 3 condensations, 10 confidence demotions, 23 type fixes, 1 centaur rewrite
## Summary
Comprehensive audit of all 86 foundation claims across 4 subdomains.

**Changes:**
- 7 claims moved (3 → domains/ai-alignment/, 3 → core/teleohumanity/, 1 → domains/health/)
- 4 claims deleted (1 duplicate, 3 condensed into stronger claims)
- 3 condensations: cognitive limits 3→2, Christensen 4→2
- 10 confidence demotions (proven→likely for interpretive framings)
- 23 type fixes (framework/insight/pattern → claim per schema)
- 1 centaur rewrite (unconditional → conditional on role complementarity)
- All broken wiki links fixed across repo

**Review:** All 4 domain agents approved (Rio, Clay, Vida, Theseus).

Pentagon-Agent: Leo <76FB9BCA-CC16-4479-B3E5-25A3769B3D7E>
2026-03-07 11:56:38 -07:00
m3taversal
316cb23a8e
theseus: 3 enrichments + 2 claims from Dario Amodei / Anthropic sources
Enrichments: conditional RSP (voluntary safety), bioweapon uplift data (bioterrorism), AI dev loop evidence (RSI). Standalones: AI personas from pre-training (experimental), marginal returns to intelligence (likely). Source diversity flagged (3 Dario sources). Pentagon-Agent: Leo <76FB9BCA-CC16-4479-B3E5-25A3769B3D7E>
2026-03-06 08:05:22 -07:00
m3taversal
8226a47d01
leo: evaluator calibration — 2 standalone→enrichment conversions + 3 new evaluation gates
Post-Phase 2 calibration. Converted jagged intelligence → RSI enrichment, J-curve → knowledge embodiment lag enrichment. Added enrichment-vs-standalone gate, evidence bar by confidence level, and source quality assessment to evaluator framework. Peer reviewed by Theseus (ai-alignment) and Rio (internet-finance). Pentagon-Agent: Leo <76FB9BCA-CC16-4479-B3E5-25A3769B3D7E>
2026-03-06 07:41:42 -07:00
m3taversal
5e5e99d538
theseus: 6 AI alignment claims from Noah Smith Phase 2 extraction
What: 6 new claims from 4 Noahopinion articles + 4 source archives. Claims: jagged intelligence (SI is present-tense), three takeover preconditions, economic HITL elimination, civilizational fragility, bioterrorism proximity, nation-state AI control. Why: Phase 2 extraction — first new-source generation in the codex. Outside-view economic analysis that alignment-native research misses. Review: Leo accept — all 6 pass quality bar. Pentagon-Agent: Leo <76FB9BCA-CC16-4479-B3E5-25A3769B3D7E>
2026-03-06 07:27:56 -07:00
d7025e65dd theseus: fix dangling topic links and update domain map
- Replace [[AI alignment approaches]] with [[domains/ai-alignment/_map]]
  in 5 foundations/collective-intelligence/ claims and 1 core/living-agents/
  claim (6 fixes total — topic tag had no corresponding file)
- Replace [[core/_map]] with [[foundations/collective-intelligence/_map]]
  in 2 CI claims (core/_map.md doesn't exist)
- Add 3 new claims from PR #20 to domains/ai-alignment/_map.md:
  voluntary safety pledges, government supply chain designation,
  nuclear war escalation in LLM simulations

Pentagon-Agent: Theseus <845F10FB-BC22-40F6-A6A6-F6E4D8F78465>
2026-03-06 13:09:04 +00:00
e780b4b6a5 theseus: address Leo's PR #16 review feedback
- Fix: type: framework -> claim on swift-to-harbor claim
- Fix: rename "persistent irreducible disagreement" to prose-as-title
- Recommended: downgrade emergent misalignment from proven to likely
- Recommended: add author names to instrumental convergence source

Pentagon-Agent: Prometheus <845F10FB-BC22-40F6-A6A6-F6E4D8F78465>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-06 12:36:24 +00:00
fc510438f0 Auto: 24 files | 24 files changed, 898 insertions(+) 2026-03-06 12:35:07 +00:00