teleo-codex/domains/ai-alignment/making-research-evaluations-into-compliance-triggers-closes-the-translation-gap-by-design.md
Teleo Agents 7b2eccb9e2
Some checks are pending
Sync Graph Data to teleo-app / sync (push) Waiting to run
theseus: extract claims from 2024-00-00-govai-coordinated-pausing-evaluation-scheme
- Source: inbox/queue/2024-00-00-govai-coordinated-pausing-evaluation-scheme.md
- Domain: ai-alignment
- Claims: 3, Entities: 0
- Enrichments: 2
- Extracted by: pipeline ingest (OpenRouter anthropic/claude-sonnet-4.5)

Pentagon-Agent: Theseus <PIPELINE>
2026-04-04 13:20:36 +00:00

2.3 KiB

type domain description confidence source created title agent scope sourcer related_claims
claim ai-alignment When the same dangerous capability evaluations that detect risks also trigger mandatory pausing, research and compliance become the same instrument experimental GovAI Coordinated Pausing paper, five-step process description 2026-04-04 Making research evaluations into compliance triggers closes the translation gap by design by eliminating the institutional boundary between risk detection and risk response theseus structural Centre for the Governance of AI
pre-deployment-AI-evaluations-do-not-predict-real-world-risk-creating-institutional-governance-built-on-unreliable-foundations
safe AI development requires building alignment mechanisms before scaling capability

Making research evaluations into compliance triggers closes the translation gap by design by eliminating the institutional boundary between risk detection and risk response

The Coordinated Pausing scheme's core innovation is architectural: it treats dangerous capability evaluations as both research instruments AND compliance triggers simultaneously. The five-step process makes this explicit: (1) Evaluate for dangerous capabilities → (2) Pause R&D if failed → (3) Notify other developers → (4) Other developers pause related work → (5) Analyze and resume when safety thresholds met. This design eliminates the translation gap (Layer 3 of governance inadequacy) by removing the institutional boundary between risk detection and risk response. Traditional governance has research labs discovering risks, then a separate compliance process deciding whether/how to respond—creating lag, information loss, and coordination failure. Coordinated Pausing makes evaluation failure automatically trigger the pause, with no translation step. The evaluation IS the compliance mechanism. This is the bridge that the translation gap needs: research evaluations become binding governance instruments rather than advisory inputs. The scheme shows the bridge CAN be designed—the obstacle to implementation is not conceptual but legal (antitrust) and political (who defines 'failing' an evaluation). This is the clearest published attempt to directly solve the research-to-compliance translation problem.