1. Fix broken wiki link: replace non-existent "AI research agents cannot
recognize confounded experimental results" with existing "AI capability
and reliability are independent dimensions" claim
2. Fix stale cascade dependencies: update Belief 2 detail file to reference
current beliefs (B3, B5) instead of removed beliefs
3. Fix universal quantifier: "the only path" → "the most promising path"
with acknowledgment of hybrid architectures
4. Document removed beliefs: "Monolithic alignment" subsumed into B2+B5,
"knowledge commons" demoted to claim-level, "simplicity first" relocated
to reasoning.md
5. Decouple identity.md from beliefs: replace inline belief list with
reference to beliefs.md + structural description
6. Fix research-session.sh step numbering: renumber Steps 5-8 → 6-9 to
resolve collision with new Step 5 (Pick ONE Research Question)
Pentagon-Agent: Theseus <B4A5B354-03D6-4291-A6A8-1E04A879D9AC>
Belief framework restructured from 6 correlated observations to 5
independent axes, flowing urgency → diagnosis → architecture → mechanism → solution:
1. AI alignment is the greatest outstanding problem for humanity (NEW - existential premise)
2. Alignment is a coordination problem, not a technical problem (was B1, now diagnostic)
3. Alignment must be continuous, not a specification problem (was implicit, now explicit)
4. Verification degrades faster than capability grows (NEW - structural mechanism)
5. Collective superintelligence is the only path preserving human agency (was B3)
Removed: "simplicity first" moved to reasoning.md (working principle, not domain belief).
Removed: "race to the bottom" and "knowledge commons degradation" (consequences, not
independent beliefs — now grounding evidence for beliefs 1 and 2).
Also: added disconfirmation step to ops/research-session.sh requiring agents to
identify their keystone belief and seek counter-evidence each research session.
Pentagon-Agent: Theseus <25B96405-E50F-45ED-9C92-D8046DFAAD00>
- What: Centralized queue for outstanding items (renames, audits, fixes, docs)
- Why: Agent task boards are siloed in Pentagon. Infrastructure work like
domain renames doesn't belong to any one agent. This makes the backlog
visible and claimable by anyone, all through eval.
- Seeded with 8 known items from current backlog
Pentagon-Agent: Leo <14FF9C29-CABF-40C8-8808-B0B495D03FF8>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Add foundations/ to always-allowed territory paths so domain agents can propose foundation claims
- Add Astra/space-development to domain routing map
- Fix double check_merge_eligible call by capturing exit code
- Update Leo prompt from 8 to 11 quality criteria (scope, universals, counter-evidence)
- Add auto-merge capability with territory violation checks
- Add --no-merge flag for review-only mode
- Widen domain agent verdict parsing to catch various comment formats
Pentagon-Agent: Leo <B9E87C91-8D2A-42C0-AA43-4874B1A67642>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Cory confirmed the AI alignment agent is Theseus. Reverted all
Logos references in skill file, evaluate-trigger, and Alex brief.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- CONTRIBUTING.md: step-by-step guide for external contributors
- docs/ALEX_BRIEF.md: onboarding brief for Alex — AI alignment domain
- ops/evaluate-trigger.sh: headless Leo evaluation trigger (Ganymede reviewed)
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>