extract: 2024-00-00-govai-coordinated-pausing-evaluation-scheme
Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>
This commit is contained in:
parent
4869f624f2
commit
899e063223
5 changed files with 69 additions and 1 deletions
|
|
@ -47,6 +47,12 @@ Krier provides institutional mechanism: personal AI agents enable Coasean bargai
|
|||
|
||||
---
|
||||
|
||||
### Additional Evidence (extend)
|
||||
*Source: [[2024-00-00-govai-coordinated-pausing-evaluation-scheme]] | Added: 2026-03-22*
|
||||
|
||||
GovAI's coordinated pausing proposal provides the specific legal mechanism blocking coordination: antitrust law treats collective agreements to pause development among competing AI labs as potential cartel behavior, making voluntary coordination legally risky even when technically feasible. This is the concrete instantiation of the coordination problem—not just misaligned incentives, but legal architecture that prohibits the coordination mechanism.
|
||||
|
||||
|
||||
Relevant Notes:
|
||||
- [[the internet enabled global communication but not global cognition]] -- the coordination infrastructure gap that makes this problem unsolvable with existing tools
|
||||
- [[the alignment problem dissolves when human values are continuously woven into the system rather than specified in advance]] -- the structural solution to this coordination failure
|
||||
|
|
|
|||
|
|
@ -55,6 +55,12 @@ Third-party pre-deployment audits are the top expert consensus priority (>60% ag
|
|||
|
||||
Despite UK AISI building comprehensive control evaluation infrastructure (RepliBench, control monitoring frameworks, sandbagging detection, cyber attack scenarios), there is no evidence of regulatory adoption into EU AI Act Article 55 or other mandatory compliance frameworks. The research exists but governance does not pull it into enforceable standards, confirming that technical capability without binding requirements does not change deployment behavior.
|
||||
|
||||
### Additional Evidence (confirm)
|
||||
*Source: [[2024-00-00-govai-coordinated-pausing-evaluation-scheme]] | Added: 2026-03-22*
|
||||
|
||||
GovAI's four-version escalation from voluntary pausing to legal mandate provides a roadmap showing why binding regulation is necessary: Versions 1-3 (voluntary mechanisms) all face antitrust obstacles that Version 4 (legal mandate) avoids. The necessity of government mandate is not just about enforcement but about legal structure—only regulatory requirements can mandate coordination without triggering competition law violations.
|
||||
|
||||
|
||||
|
||||
Relevant Notes:
|
||||
- [[voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints]] — confirmed with extensive evidence across multiple labs and governance mechanisms
|
||||
|
|
|
|||
|
|
@ -58,6 +58,12 @@ Government pressure adds to competitive dynamics. The DoD/Anthropic episode show
|
|||
|
||||
The research-to-compliance translation gap fails for the same structural reason voluntary commitments fail: nothing makes labs adopt research evaluations that exist. RepliBench was published in April 2025 before EU AI Act obligations took effect in August 2025, proving the tools existed before mandatory requirements—but no mechanism translated availability into obligation.
|
||||
|
||||
### Additional Evidence (extend)
|
||||
*Source: [[2024-00-00-govai-coordinated-pausing-evaluation-scheme]] | Added: 2026-03-22*
|
||||
|
||||
The coordinated pausing proposal shows that even multilateral voluntary commitments (Version 2: collective agreement) face structural barriers beyond competitive pressure—antitrust law may prohibit the coordination mechanism itself. This suggests the problem is deeper than incentive misalignment: legal architecture actively blocks the coordination that would solve the competitive race.
|
||||
|
||||
|
||||
|
||||
Relevant Notes:
|
||||
- [[the alignment tax creates a structural race to the bottom because safety training costs capability and rational competitors skip it]] -- the RSP rollback is the clearest empirical confirmation of this claim
|
||||
|
|
|
|||
|
|
@ -0,0 +1,38 @@
|
|||
{
|
||||
"rejected_claims": [
|
||||
{
|
||||
"filename": "evaluation-based-coordination-faces-antitrust-obstacles-because-collective-pausing-resembles-cartel-behavior.md",
|
||||
"issues": [
|
||||
"missing_attribution_extractor"
|
||||
]
|
||||
},
|
||||
{
|
||||
"filename": "legal-mandate-is-only-antitrust-safe-version-of-coordinated-pausing.md",
|
||||
"issues": [
|
||||
"missing_attribution_extractor"
|
||||
]
|
||||
}
|
||||
],
|
||||
"validation_stats": {
|
||||
"total": 2,
|
||||
"kept": 0,
|
||||
"fixed": 8,
|
||||
"rejected": 2,
|
||||
"fixes_applied": [
|
||||
"evaluation-based-coordination-faces-antitrust-obstacles-because-collective-pausing-resembles-cartel-behavior.md:set_created:2026-03-22",
|
||||
"evaluation-based-coordination-faces-antitrust-obstacles-because-collective-pausing-resembles-cartel-behavior.md:stripped_wiki_link:AI alignment is a coordination problem not a technical probl",
|
||||
"evaluation-based-coordination-faces-antitrust-obstacles-because-collective-pausing-resembles-cartel-behavior.md:stripped_wiki_link:voluntary safety pledges cannot survive competitive pressure",
|
||||
"evaluation-based-coordination-faces-antitrust-obstacles-because-collective-pausing-resembles-cartel-behavior.md:stripped_wiki_link:only binding regulation with enforcement teeth changes front",
|
||||
"legal-mandate-is-only-antitrust-safe-version-of-coordinated-pausing.md:set_created:2026-03-22",
|
||||
"legal-mandate-is-only-antitrust-safe-version-of-coordinated-pausing.md:stripped_wiki_link:evaluation-based-coordination-faces-antitrust-obstacles-beca",
|
||||
"legal-mandate-is-only-antitrust-safe-version-of-coordinated-pausing.md:stripped_wiki_link:only binding regulation with enforcement teeth changes front",
|
||||
"legal-mandate-is-only-antitrust-safe-version-of-coordinated-pausing.md:stripped_wiki_link:nation-states will inevitably assert control over frontier A"
|
||||
],
|
||||
"rejections": [
|
||||
"evaluation-based-coordination-faces-antitrust-obstacles-because-collective-pausing-resembles-cartel-behavior.md:missing_attribution_extractor",
|
||||
"legal-mandate-is-only-antitrust-safe-version-of-coordinated-pausing.md:missing_attribution_extractor"
|
||||
]
|
||||
},
|
||||
"model": "anthropic/claude-sonnet-4.5",
|
||||
"date": "2026-03-22"
|
||||
}
|
||||
|
|
@ -7,9 +7,13 @@ date: 2024-00-00
|
|||
domain: ai-alignment
|
||||
secondary_domains: [internet-finance]
|
||||
format: paper
|
||||
status: unprocessed
|
||||
status: enrichment
|
||||
priority: high
|
||||
tags: [coordinated-pausing, evaluation-based-coordination, dangerous-capabilities, mandatory-evaluation, governance-architecture, antitrust, GovAI, B1-disconfirmation, translation-gap]
|
||||
processed_by: theseus
|
||||
processed_date: 2026-03-22
|
||||
enrichments_applied: ["AI alignment is a coordination problem not a technical problem.md", "voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md", "only binding regulation with enforcement teeth changes frontier AI lab behavior because every voluntary commitment has been eroded abandoned or made conditional on competitor behavior when commercially inconvenient.md"]
|
||||
extraction_model: "anthropic/claude-sonnet-4.5"
|
||||
---
|
||||
|
||||
## Content
|
||||
|
|
@ -56,3 +60,11 @@ GovAI proposes an evaluation-based coordination scheme in which frontier AI deve
|
|||
PRIMARY CONNECTION: domains/ai-alignment/alignment-reframed-as-coordination-problem.md and translation-gap findings
|
||||
WHY ARCHIVED: The most detailed published proposal for closing the research-to-compliance translation gap; also provides the specific legal obstacle (antitrust) explaining why voluntary coordination can't solve the problem
|
||||
EXTRACTION HINT: The antitrust obstacle to coordinated pausing is the key claim — it explains why the translation gap requires government mandate (Version 4) not just industry coordination, connecting to the FDA vs. SEC model distinction
|
||||
|
||||
|
||||
## Key Facts
|
||||
- GovAI proposed coordinated pausing scheme has four versions: voluntary pausing, collective agreement, single auditor model, and legal mandate
|
||||
- Coordinated pausing triggers on models failing dangerous capability evaluations for: chemical weapons design, safety-critical software exploitation, disinformation synthesis, evading human control
|
||||
- Five-step coordinated pausing process: evaluate → pause if failed → notify others → others pause related work → analyze and resume when safe
|
||||
- GovAI explicitly identifies antitrust law as a practical obstacle requiring resolution
|
||||
- AI development is dominated by handful of large companies: OpenAI, Anthropic, Google DeepMind, Meta
|
||||
|
|
|
|||
Loading…
Reference in a new issue