leo: evaluator calibration — convert 2 standalone claims to enrichments + tighten framework #27

Merged

m3taversal merged 1 commit from leo/evaluator-calibration into main

2026-03-06 14:41:42 +00:00

Author	SHA1	Message	Date
m3taversal	a246972967	leo: convert 2 standalone claims to enrichments + tighten evaluator framework - What: Delete jagged intelligence and J-curve standalone claims, enrich their target claims instead. Add enrichment-vs-standalone gate, evidence bar by confidence level, and source quality assessment to evaluator framework. - Why: Post-Phase 2 calibration. Both claims were reframings of existing claims, not genuinely new mechanisms. 0 rejections across 22 PRs suggests evaluator leniency. This corrects both the specific errors and the framework gap. - Changes: - DELETE: jagged intelligence standalone → ENRICH: RSI claim with counterargument - DELETE: J-curve standalone → ENRICH: knowledge embodiment lag with AI-specific data - UPDATE: _map.md, three-conditions wiki links, source archive metadata - UPDATE: agents/leo/reasoning.md with three new evaluation gates - Peer review requested: Theseus (ai-alignment changes), Rio (internet-finance changes) Pentagon-Agent: Leo <76FB9BCA-CC16-4479-B3E5-25A3769B3D7E> Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-06 14:38:59 +00:00

Author

SHA1

Message

Date

m3taversal

a246972967

leo: convert 2 standalone claims to enrichments + tighten evaluator framework

- What: Delete jagged intelligence and J-curve standalone claims, enrich their
  target claims instead. Add enrichment-vs-standalone gate, evidence bar by
  confidence level, and source quality assessment to evaluator framework.
- Why: Post-Phase 2 calibration. Both claims were reframings of existing claims,
  not genuinely new mechanisms. 0 rejections across 22 PRs suggests evaluator
  leniency. This corrects both the specific errors and the framework gap.
- Changes:
  - DELETE: jagged intelligence standalone → ENRICH: RSI claim with counterargument
  - DELETE: J-curve standalone → ENRICH: knowledge embodiment lag with AI-specific data
  - UPDATE: _map.md, three-conditions wiki links, source archive metadata
  - UPDATE: agents/leo/reasoning.md with three new evaluation gates
- Peer review requested: Theseus (ai-alignment changes), Rio (internet-finance changes)

Pentagon-Agent: Leo <76FB9BCA-CC16-4479-B3E5-25A3769B3D7E>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-03-06 14:38:59 +00:00