Some checks are pending
Sync Graph Data to teleo-app / sync (push) Waiting to run
- What: 5 new claims + 6 source archives from papers referenced in Alex Obadia's ARIA Research tweet on distributed AGI safety - Sources: Distributional AGI Safety (Tomašev), Agents of Chaos (Shapira), Simple Economics of AGI (Catalini), When AI Writes Software (de Moura), LLM Open-Source Games (Sistla), Coasean Bargaining (Krier) - Claims: multi-agent emergent vulnerabilities (likely), verification bandwidth as binding constraint (likely), formal verification economic necessity (likely), cooperative program equilibria (experimental), Coasean transaction cost collapse (experimental) - Connections: extends scalable oversight degradation, correlated blind spots, formal verification, coordination-as-alignment Pentagon-Agent: Theseus <B4A5B354-03D6-4291-A6A8-1E04A879D9AC>
26 lines
1.4 KiB
Markdown
26 lines
1.4 KiB
Markdown
---
|
|
type: source
|
|
title: "Distributional AGI Safety"
|
|
author: "Nenad Tomašev, Matija Franklin, Julian Jacobs, Sébastien Krier, Simon Osindero"
|
|
url: https://arxiv.org/abs/2512.16856
|
|
date_published: 2025-12-18
|
|
date_archived: 2026-03-16
|
|
domain: ai-alignment
|
|
status: processing
|
|
processed_by: theseus
|
|
tags: [distributed-agi, multi-agent-safety, patchwork-hypothesis, coordination]
|
|
sourced_via: "Alex Obadia (@ObadiaAlex) tweet, ARIA Research Scaling Trust programme"
|
|
twitter_id: "712705562191011841"
|
|
---
|
|
|
|
# Distributional AGI Safety
|
|
|
|
Tomašev et al. challenge the monolithic AGI assumption. They propose the "patchwork AGI hypothesis" — general capability levels first manifest through coordination among groups of sub-AGI agents with complementary skills and affordances, not through a single unified system.
|
|
|
|
Key arguments:
|
|
- AI safety research has focused on safeguarding individual systems, overlooking distributed emergence
|
|
- Rapid deployment of agents with tool-use and coordination capabilities makes distributed safety urgent
|
|
- Proposed framework: "virtual agentic sandbox economies" with robust market mechanisms, auditability, reputation management, and oversight for collective risks
|
|
- Safety focus shifts from individual agent alignment to managing risks at the system-of-systems level
|
|
|
|
Directly relevant to our claim [[AGI may emerge as a patchwork of coordinating sub-AGI agents rather than a single monolithic system]] and to the collective superintelligence thesis.
|