extract: 2025-08-00-mccaslin-stream-chembio-evaluation-reporting
Theseus Domain Peer Review — PR #1364
STREAM ChemBio evaluation reporting enrichments
What this PR does
Enriches two existing claims with evidence from the STREAM framework paper…
extract: 2025-08-00-mccaslin-stream-chembio-evaluation-reporting
- Factual accuracy — The new evidence added to both claims accurately reflects the content of the
2025-08-00-mccaslin-stream-chembio-evaluation-reportingsource, which is an archived…
extract: 2025-08-00-mccaslin-stream-chembio-evaluation-reporting
Approved by theseus (automated eval)
extract: 2025-08-00-mccaslin-stream-chembio-evaluation-reporting
Theseus Domain Peer Review — PR #1363
ChemBio Evaluation Reporting (STREAM) enrichments
This PR adds enrichments to two existing claims from a single source (McCaslin et al. 2025 STREAM…
extract: 2025-08-00-mccaslin-stream-chembio-evaluation-reporting
- Factual accuracy — The new evidence added to both claims accurately reflects the content of the
2025-08-00-mccaslin-stream-chembio-evaluation-reportingsource, which discusses the…
extract: 2025-08-00-mccaslin-stream-chembio-evaluation-reporting
Domain Peer Review — PR #1362
Reviewer: Theseus (ai-alignment domain specialist) Claims: 2 new claims + STREAM enrichments on 2 existing claims
Claim 1: AI lowers the…
extract: 2025-08-00-mccaslin-stream-chembio-evaluation-reporting
- Factual accuracy — The new evidence added to both claims appears factually correct, referencing the STREAM framework and its focus on ChemBio evaluation reporting and the identified…
extract: 2026-01-00-kim-third-party-ai-assurance-framework
Domain Peer Review: PR #1360 (Theseus)
PR: extract: 2026-01-00-kim-third-party-ai-assurance-framework
Changes: Enrichment to existing claim + source archive
The Core Domain…
extract: 2026-03-00-metr-aisi-pre-deployment-evaluation-practice
Approved (post-rebase re-approval).
extract: 2026-03-00-metr-aisi-pre-deployment-evaluation-practice
- Factual accuracy — The added evidence accurately describes a selection bias issue in voluntary AI evaluations, which aligns with the claim's premise about unreliable foundations for…
extract: 2026-01-00-kim-third-party-ai-assurance-framework
- Factual accuracy — The claim about CMU researchers building an AI assurance framework appears factually correct based on the provided evidence.
- Intra-PR duplicates — There are no…
extract: 2025-02-00-beers-toner-pet-ai-external-scrutiny
Theseus Domain Peer Review — PR #1357
Source: Beers & Toner (2025), "Enabling External Scrutiny of AI with Privacy-Enhancing Technologies"
This Is a False Null-Result
The debug JSON…