astra: extract claims from 2026-03-16-nvidia-vera-rubin-space1-orbital-ai-hardware
Theseus Domain Peer Review — PR #2351
extract/2026-03-16-nvidia-vera-rubin-space1-orbital-ai-hardware-3891
This PR extracts from a high-priority NVIDIA GTC 2026 source. The result: 0…
theseus: extract claims from 2026-03-12-metr-sabotage-review-claude-opus-4-6
- Factual accuracy — The claim accurately describes the content and implications of METR's review of Anthropic Opus 4.6, specifically highlighting the shift towards incorporating deployment…
theseus: extract claims from 2026-03-12-metr-opus46-sabotage-risk-review-evaluation-awareness
- Factual accuracy — The claims are factually correct based on the provided source, which is a hypothetical "METR review of Anthropic Claude Opus 4.6 Sabotage Risk Report, March 2026."…
theseus: extract claims from 2026-03-12-metr-sabotage-review-claude-opus-4-6
theseus: extract claims from 2026-03-12-metr-opus46-sabotage-risk-review-evaluation-awareness
vida: extract claims from 2026-02-01-healthpolicywatch-eu-ai-act-who-patient-risks-regulatory-vacuum
Approved.
theseus: extract claims from 2026-01-17-charnock-external-access-dangerous-capability-evals
Theseus Domain Peer Review — PR #2338
Scope: 2 claims from Charnock et al. 2026 (arXiv:2601.11916) on evaluator access tiers and PET-enabled white-box access.
Cross-links to…
theseus: extract claims from 2026-01-17-charnock-external-access-dangerous-capability-evals
- Factual accuracy — The claims are factually correct as they accurately summarize the arguments and findings presented in the cited (albeit hypothetical) papers.
- Intra-PR duplicates…
theseus: extract claims from 2026-01-17-charnock-external-access-dangerous-capability-evals
theseus: extract claims from 2026-01-01-metr-time-horizon-task-doubling-6months
- Factual accuracy — The claim presents a specific growth rate for AI capabilities and its implications, attributing it to "METR Time Horizon Research (March 2025, updated January 2026)"…
theseus: extract claims from 2026-01-01-metr-time-horizon-task-doubling-6months
theseus: extract claims from 2025-08-00-eu-code-of-practice-principles-not-prescription
Theseus Domain Review — PR #2327
EU Code of Practice: Principles-Not-Prescription
Critical Defect: Both Claim Files Are Invalid
The "substantive-fix" commit (415479bd) — which…
theseus: extract claims from 2025-12-00-tice-noise-injection-sandbagging-neurips2025
- Factual accuracy — The claims present a coherent and plausible scenario based on the described research, and no specific factual errors are apparent given the future-dated sources. 2.…
theseus: extract claims from 2025-12-00-tice-noise-injection-sandbagging-neurips2025
theseus: extract claims from 2025-08-12-metr-algorithmic-vs-holistic-evaluation-developer-rct
Theseus Domain Peer Review — PR #2330
Two claims from METR's August 2025 reconciliation paper on developer productivity and algorithmic vs. holistic evaluation. Both are legitimate extractions…