Sync Graph Data to teleo-app / sync (push) Waiting to run

Details

theseus: extract claims from 2024-00-00-govai-coordinated-pausing-evaluation-scheme

- Source: inbox/queue/2024-00-00-govai-coordinated-pausing-evaluation-scheme.md
- Domain: ai-alignment
- Claims: 3, Entities: 0
- Enrichments: 2
- Extracted by: pipeline ingest (OpenRouter anthropic/claude-sonnet-4.5)

Pentagon-Agent: Theseus <PIPELINE>

2026-04-04 13:20:36 +00:00

2.3 KiB

Raw Blame History

type

domain

description

confidence

source

created

title

agent

scope

sourcer

related_claims

claim

ai-alignment

When the same dangerous capability evaluations that detect risks also trigger mandatory pausing, research and compliance become the same instrument

experimental

GovAI Coordinated Pausing paper, five-step process description

2026-04-04

Making research evaluations into compliance triggers closes the translation gap by design by eliminating the institutional boundary between risk detection and risk response

theseus

structural

Centre for the Governance of AI

pre-deployment-AI-evaluations-do-not-predict-real-world-risk-creating-institutional-governance-built-on-unreliable-foundations

safe AI development requires building alignment mechanisms before scaling capability

Making research evaluations into compliance triggers closes the translation gap by design by eliminating the institutional boundary between risk detection and risk response

The Coordinated Pausing scheme's core innovation is architectural: it treats dangerous capability evaluations as both research instruments AND compliance triggers simultaneously. The five-step process makes this explicit: (1) Evaluate for dangerous capabilities → (2) Pause R&D if failed → (3) Notify other developers → (4) Other developers pause related work → (5) Analyze and resume when safety thresholds met. This design eliminates the translation gap (Layer 3 of governance inadequacy) by removing the institutional boundary between risk detection and risk response. Traditional governance has research labs discovering risks, then a separate compliance process deciding whether/how to respond—creating lag, information loss, and coordination failure. Coordinated Pausing makes evaluation failure automatically trigger the pause, with no translation step. The evaluation IS the compliance mechanism. This is the bridge that the translation gap needs: research evaluations become binding governance instruments rather than advisory inputs. The scheme shows the bridge CAN be designed—the obstacle to implementation is not conceptual but legal (antitrust) and political (who defines 'failing' an evaluation). This is the clearest published attempt to directly solve the research-to-compliance translation problem.

2.3 KiB Raw Blame History

Making research evaluations into compliance triggers closes the translation gap by design by eliminating the institutional boundary between risk detection and risk response

2.3 KiB

Raw Blame History