diff --git a/domains/ai-alignment/AI alignment is a coordination problem not a technical problem.md b/domains/ai-alignment/AI alignment is a coordination problem not a technical problem.md index c554ceccb..8f300dbf3 100644 --- a/domains/ai-alignment/AI alignment is a coordination problem not a technical problem.md +++ b/domains/ai-alignment/AI alignment is a coordination problem not a technical problem.md @@ -47,6 +47,12 @@ Krier provides institutional mechanism: personal AI agents enable Coasean bargai --- +### Additional Evidence (extend) +*Source: [[2024-00-00-govai-coordinated-pausing-evaluation-scheme]] | Added: 2026-03-22* + +GovAI's coordinated pausing proposal provides the specific legal mechanism blocking coordination: antitrust law treats collective agreements to pause development among competing AI labs as potential cartel behavior, making voluntary coordination legally risky even when technically feasible. This is the concrete instantiation of the coordination problem—not just misaligned incentives, but legal architecture that prohibits the coordination mechanism. + + Relevant Notes: - [[the internet enabled global communication but not global cognition]] -- the coordination infrastructure gap that makes this problem unsolvable with existing tools - [[the alignment problem dissolves when human values are continuously woven into the system rather than specified in advance]] -- the structural solution to this coordination failure diff --git a/domains/ai-alignment/only binding regulation with enforcement teeth changes frontier AI lab behavior because every voluntary commitment has been eroded abandoned or made conditional on competitor behavior when commercially inconvenient.md b/domains/ai-alignment/only binding regulation with enforcement teeth changes frontier AI lab behavior because every voluntary commitment has been eroded abandoned or made conditional on competitor behavior when commercially inconvenient.md index e91ae6603..c88fc0f16 100644 --- a/domains/ai-alignment/only binding regulation with enforcement teeth changes frontier AI lab behavior because every voluntary commitment has been eroded abandoned or made conditional on competitor behavior when commercially inconvenient.md +++ b/domains/ai-alignment/only binding regulation with enforcement teeth changes frontier AI lab behavior because every voluntary commitment has been eroded abandoned or made conditional on competitor behavior when commercially inconvenient.md @@ -55,6 +55,12 @@ Third-party pre-deployment audits are the top expert consensus priority (>60% ag Despite UK AISI building comprehensive control evaluation infrastructure (RepliBench, control monitoring frameworks, sandbagging detection, cyber attack scenarios), there is no evidence of regulatory adoption into EU AI Act Article 55 or other mandatory compliance frameworks. The research exists but governance does not pull it into enforceable standards, confirming that technical capability without binding requirements does not change deployment behavior. +### Additional Evidence (confirm) +*Source: [[2024-00-00-govai-coordinated-pausing-evaluation-scheme]] | Added: 2026-03-22* + +GovAI's four-version escalation from voluntary pausing to legal mandate provides a roadmap showing why binding regulation is necessary: Versions 1-3 (voluntary mechanisms) all face antitrust obstacles that Version 4 (legal mandate) avoids. The necessity of government mandate is not just about enforcement but about legal structure—only regulatory requirements can mandate coordination without triggering competition law violations. + + Relevant Notes: - [[voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints]] — confirmed with extensive evidence across multiple labs and governance mechanisms diff --git a/domains/ai-alignment/voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md b/domains/ai-alignment/voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md index 16fde7456..3fc148219 100644 --- a/domains/ai-alignment/voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md +++ b/domains/ai-alignment/voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md @@ -58,6 +58,12 @@ Government pressure adds to competitive dynamics. The DoD/Anthropic episode show The research-to-compliance translation gap fails for the same structural reason voluntary commitments fail: nothing makes labs adopt research evaluations that exist. RepliBench was published in April 2025 before EU AI Act obligations took effect in August 2025, proving the tools existed before mandatory requirements—but no mechanism translated availability into obligation. +### Additional Evidence (extend) +*Source: [[2024-00-00-govai-coordinated-pausing-evaluation-scheme]] | Added: 2026-03-22* + +The coordinated pausing proposal shows that even multilateral voluntary commitments (Version 2: collective agreement) face structural barriers beyond competitive pressure—antitrust law may prohibit the coordination mechanism itself. This suggests the problem is deeper than incentive misalignment: legal architecture actively blocks the coordination that would solve the competitive race. + + Relevant Notes: - [[the alignment tax creates a structural race to the bottom because safety training costs capability and rational competitors skip it]] -- the RSP rollback is the clearest empirical confirmation of this claim diff --git a/inbox/queue/.extraction-debug/2024-00-00-govai-coordinated-pausing-evaluation-scheme.json b/inbox/queue/.extraction-debug/2024-00-00-govai-coordinated-pausing-evaluation-scheme.json new file mode 100644 index 000000000..6e406e76d --- /dev/null +++ b/inbox/queue/.extraction-debug/2024-00-00-govai-coordinated-pausing-evaluation-scheme.json @@ -0,0 +1,38 @@ +{ + "rejected_claims": [ + { + "filename": "evaluation-based-coordination-faces-antitrust-obstacles-because-collective-pausing-resembles-cartel-behavior.md", + "issues": [ + "missing_attribution_extractor" + ] + }, + { + "filename": "legal-mandate-is-only-antitrust-safe-version-of-coordinated-pausing.md", + "issues": [ + "missing_attribution_extractor" + ] + } + ], + "validation_stats": { + "total": 2, + "kept": 0, + "fixed": 8, + "rejected": 2, + "fixes_applied": [ + "evaluation-based-coordination-faces-antitrust-obstacles-because-collective-pausing-resembles-cartel-behavior.md:set_created:2026-03-22", + "evaluation-based-coordination-faces-antitrust-obstacles-because-collective-pausing-resembles-cartel-behavior.md:stripped_wiki_link:AI alignment is a coordination problem not a technical probl", + "evaluation-based-coordination-faces-antitrust-obstacles-because-collective-pausing-resembles-cartel-behavior.md:stripped_wiki_link:voluntary safety pledges cannot survive competitive pressure", + "evaluation-based-coordination-faces-antitrust-obstacles-because-collective-pausing-resembles-cartel-behavior.md:stripped_wiki_link:only binding regulation with enforcement teeth changes front", + "legal-mandate-is-only-antitrust-safe-version-of-coordinated-pausing.md:set_created:2026-03-22", + "legal-mandate-is-only-antitrust-safe-version-of-coordinated-pausing.md:stripped_wiki_link:evaluation-based-coordination-faces-antitrust-obstacles-beca", + "legal-mandate-is-only-antitrust-safe-version-of-coordinated-pausing.md:stripped_wiki_link:only binding regulation with enforcement teeth changes front", + "legal-mandate-is-only-antitrust-safe-version-of-coordinated-pausing.md:stripped_wiki_link:nation-states will inevitably assert control over frontier A" + ], + "rejections": [ + "evaluation-based-coordination-faces-antitrust-obstacles-because-collective-pausing-resembles-cartel-behavior.md:missing_attribution_extractor", + "legal-mandate-is-only-antitrust-safe-version-of-coordinated-pausing.md:missing_attribution_extractor" + ] + }, + "model": "anthropic/claude-sonnet-4.5", + "date": "2026-03-22" +} \ No newline at end of file diff --git a/inbox/queue/2024-00-00-govai-coordinated-pausing-evaluation-scheme.md b/inbox/queue/2024-00-00-govai-coordinated-pausing-evaluation-scheme.md index 3563c003c..6812d7fc3 100644 --- a/inbox/queue/2024-00-00-govai-coordinated-pausing-evaluation-scheme.md +++ b/inbox/queue/2024-00-00-govai-coordinated-pausing-evaluation-scheme.md @@ -7,9 +7,13 @@ date: 2024-00-00 domain: ai-alignment secondary_domains: [internet-finance] format: paper -status: unprocessed +status: enrichment priority: high tags: [coordinated-pausing, evaluation-based-coordination, dangerous-capabilities, mandatory-evaluation, governance-architecture, antitrust, GovAI, B1-disconfirmation, translation-gap] +processed_by: theseus +processed_date: 2026-03-22 +enrichments_applied: ["AI alignment is a coordination problem not a technical problem.md", "voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md", "only binding regulation with enforcement teeth changes frontier AI lab behavior because every voluntary commitment has been eroded abandoned or made conditional on competitor behavior when commercially inconvenient.md"] +extraction_model: "anthropic/claude-sonnet-4.5" --- ## Content @@ -56,3 +60,11 @@ GovAI proposes an evaluation-based coordination scheme in which frontier AI deve PRIMARY CONNECTION: domains/ai-alignment/alignment-reframed-as-coordination-problem.md and translation-gap findings WHY ARCHIVED: The most detailed published proposal for closing the research-to-compliance translation gap; also provides the specific legal obstacle (antitrust) explaining why voluntary coordination can't solve the problem EXTRACTION HINT: The antitrust obstacle to coordinated pausing is the key claim — it explains why the translation gap requires government mandate (Version 4) not just industry coordination, connecting to the FDA vs. SEC model distinction + + +## Key Facts +- GovAI proposed coordinated pausing scheme has four versions: voluntary pausing, collective agreement, single auditor model, and legal mandate +- Coordinated pausing triggers on models failing dangerous capability evaluations for: chemical weapons design, safety-critical software exploitation, disinformation synthesis, evading human control +- Five-step coordinated pausing process: evaluate → pause if failed → notify others → others pause related work → analyze and resume when safe +- GovAI explicitly identifies antitrust law as a practical obstacle requiring resolution +- AI development is dominated by handful of large companies: OpenAI, Anthropic, Google DeepMind, Meta