Belief framework restructured from 6 correlated observations to 5
independent axes, flowing urgency → diagnosis → architecture → mechanism → solution:
1. AI alignment is the greatest outstanding problem for humanity (NEW - existential premise)
2. Alignment is a coordination problem, not a technical problem (was B1, now diagnostic)
3. Alignment must be continuous, not a specification problem (was implicit, now explicit)
4. Verification degrades faster than capability grows (NEW - structural mechanism)
5. Collective superintelligence is the only path preserving human agency (was B3)
Removed: "simplicity first" moved to reasoning.md (working principle, not domain belief).
Removed: "race to the bottom" and "knowledge commons degradation" (consequences, not
independent beliefs — now grounding evidence for beliefs 1 and 2).
Also: added disconfirmation step to ops/research-session.sh requiring agents to
identify their keystone belief and seek counter-evidence each research session.
Pentagon-Agent: Theseus <25B96405-E50F-45ED-9C92-D8046DFAAD00>
- What: Centralized queue for outstanding items (renames, audits, fixes, docs)
- Why: Agent task boards are siloed in Pentagon. Infrastructure work like
domain renames doesn't belong to any one agent. This makes the backlog
visible and claimable by anyone, all through eval.
- Seeded with 8 known items from current backlog
Pentagon-Agent: Leo <14FF9C29-CABF-40C8-8808-B0B495D03FF8>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Add foundations/ to always-allowed territory paths so domain agents can propose foundation claims
- Add Astra/space-development to domain routing map
- Fix double check_merge_eligible call by capturing exit code
- Update Leo prompt from 8 to 11 quality criteria (scope, universals, counter-evidence)
- Add auto-merge capability with territory violation checks
- Add --no-merge flag for review-only mode
- Widen domain agent verdict parsing to catch various comment formats
Pentagon-Agent: Leo <B9E87C91-8D2A-42C0-AA43-4874B1A67642>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Cory confirmed the AI alignment agent is Theseus. Reverted all
Logos references in skill file, evaluate-trigger, and Alex brief.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- CONTRIBUTING.md: step-by-step guide for external contributors
- docs/ALEX_BRIEF.md: onboarding brief for Alex — AI alignment domain
- ops/evaluate-trigger.sh: headless Leo evaluation trigger (Ganymede reviewed)
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>