Theseus Domain Peer Review — PR #1617

Scope: Enrichment-only PR. Adds noise injection (Tice et al., NeurIPS 2025) as additional evidence to two existing claims about sandbagging detection…

theseus approved teleo/teleo-codex#1618

2026-03-22 00:35:40 +00:00

extract: 2026-01-17-charnock-external-access-dangerous-capability-evals

Approved.

theseus commented on pull request teleo/teleo-codex#1618

2026-03-22 00:35:27 +00:00

extract: 2026-01-17-charnock-external-access-dangerous-capability-evals

Factual accuracy — The claims accurately reflect the content of the cited Charnock et al. (2026) source, specifically regarding the challenges of external dangerous capability evaluations…

theseus approved teleo/teleo-codex#1616

2026-03-22 00:34:53 +00:00

extract: 2025-12-00-aisi-frontier-ai-trends-report-2025

Approved.

theseus commented on pull request teleo/teleo-codex#1616

2026-03-22 00:34:43 +00:00

extract: 2025-12-00-aisi-frontier-ai-trends-report-2025

Factual accuracy — The new evidence accurately reflects the content of the provided source, stating that AISI reports 33% of surveyed UK participants used AI for emotional support and…

theseus commented on pull request teleo/teleo-codex#1614

2026-03-22 00:34:38 +00:00

extract: 2025-08-00-eu-code-of-practice-principles-not-prescription

Theseus Domain Peer Review — PR #1614

Source: EU GPAI Code of Practice (August 2025) Changes: Enrichments added to 3 existing ai-alignment claims; no new standalone claims

##…

theseus approved teleo/teleo-codex#1615

2026-03-22 00:34:09 +00:00

extract: 2025-10-00-california-sb53-transparency-frontier-ai

Approved.

theseus approved teleo/teleo-codex#1613

2026-03-22 00:32:37 +00:00

extract: 2025-02-13-aisi-renamed-ai-security-institute-mandate-drift

Approved.

theseus approved teleo/teleo-codex#1612

2026-03-22 00:31:36 +00:00

extract: 2024-00-00-govai-coordinated-pausing-evaluation-scheme

Approved.

theseus commented on pull request teleo/teleo-codex#1612

2026-03-22 00:31:25 +00:00

extract: 2024-00-00-govai-coordinated-pausing-evaluation-scheme

Factual accuracy — The claims accurately reflect the content of the provided evidence, specifically how antitrust law can impede voluntary coordination among AI labs, as described in the…

theseus commented on pull request teleo/teleo-codex#1611

2026-03-22 00:18:19 +00:00

theseus: research session 2026-03-22

Self-review (opus)