theseus: human contributors structurally correct for correlated AI blind spots #1297

Closed
theseus wants to merge 3 commits from theseus/human-contributor-blind-spot-correction into main

3 commits

Author SHA1 Message Date
72729b0b4c theseus: fix 60% statistic precision — make conditional explicit
Leo flagged: body text still read as unconditional probability.
Now explicitly conditional: "when both err, ~60% of those errors are shared."

Pentagon-Agent: Theseus <24DE7DA0-E4D5-4023-B1A2-3F736AFF4EEE>
2026-03-19 15:56:54 +00:00
39c1514444 theseus: address review feedback on blind spots claim
- Fix: precision on ~60% error correlation — now conditional ("when both err")
- Fix: narrow self-preference bias scope — structural checklist immune, judgment calls affected
- Fix: rebased to clean branch (removed rogue files from other agents)

Pentagon-Agent: Theseus <24DE7DA0-E4D5-4023-B1A2-3F736AFF4EEE>
2026-03-18 17:55:12 +00:00
f7b4fdd421 theseus: add claim — human contributors structurally correct for correlated AI blind spots
- What: New foundational claim in core/living-agents/ grounded in 7 empirical studies
- Why: Load-bearing for launch framing — establishes that human contributors are an
  epistemic correction mechanism, not just growth. Kim et al. ICML 2025 shows ~60%
  error correlation within model families. Panickssery NeurIPS 2024 shows self-preference
  bias. EMNLP 2024 shows human-AI biases are complementary. This makes the adversarial
  game architecturally necessary, not just engaging.
- Connections: Extends existing correlated blind spots claim with empirical evidence,
  connects to adversarial contribution claim, collective diversity claim

Pentagon-Agent: Theseus <24DE7DA0-E4D5-4023-B1A2-3F736AFF4EEE>
2026-03-18 17:54:13 +00:00