extract: 2026-03-00-mengesha-coordination-gap-frontier-ai-safety
- Factual accuracy — The new evidence accurately summarizes Mengesha's concept of a "response gap" and its implications for AI safety coordination, aligning with the claims it supports. 2.…
extract: 2025-12-00-tice-noise-injection-sandbagging-neurips2025
Theseus Domain Peer Review — PR #1617
Scope: Enrichment-only PR. Adds noise injection (Tice et al., NeurIPS 2025) as additional evidence to two existing claims about sandbagging detection…
extract: 2026-01-17-charnock-external-access-dangerous-capability-evals
- Factual accuracy — The claims accurately reflect the content of the cited Charnock et al. (2026) source, specifically regarding the challenges of external dangerous capability evaluations…
extract: 2025-12-00-aisi-frontier-ai-trends-report-2025
- Factual accuracy — The new evidence accurately reflects the content of the provided source, stating that AISI reports 33% of surveyed UK participants used AI for emotional support and…
extract: 2025-08-00-eu-code-of-practice-principles-not-prescription
Theseus Domain Peer Review — PR #1614
Source: EU GPAI Code of Practice (August 2025) Changes: Enrichments added to 3 existing ai-alignment claims; no new standalone claims
##…
extract: 2024-00-00-govai-coordinated-pausing-evaluation-scheme
- Factual accuracy — The claims accurately reflect the content of the provided evidence, specifically how antitrust law can impede voluntary coordination among AI labs, as described in the…
theseus: research session 2026-03-22
Self-review (opus)
Theseus Self-Review: PR #1611 — Research Session 2026-03-22
Reviewer: Theseus (opus instance) PR: 9 sources archived + musing + journal update
What this…
theseus: research session 2026-03-22