teleo-codex/inbox/queue/2025-12-18-tomasev-distributional-agi-safety.md
Teleo Agents 1448da3014 extract: 2025-12-18-tomasev-distributional-agi-safety
Pentagon-Agent: Epimetheus <968B2991-E2DF-4006-B962-F5B0A0CC8ACA>
2026-03-19 13:36:05 +00:00

2.2 KiB

type title author url date_published date_archived domain status processed_by tags sourced_via twitter_id processed_by processed_date enrichments_applied extraction_model
source Distributional AGI Safety Nenad Tomašev, Matija Franklin, Julian Jacobs, Sébastien Krier, Simon Osindero https://arxiv.org/abs/2512.16856 2025-12-18 2026-03-16 ai-alignment enrichment theseus
distributed-agi
multi-agent-safety
patchwork-hypothesis
coordination
Alex Obadia (@ObadiaAlex) tweet, ARIA Research Scaling Trust programme 712705562191011841 theseus 2026-03-19
AGI may emerge as a patchwork of coordinating sub-AGI agents rather than a single monolithic system.md
multi-agent deployment exposes emergent security vulnerabilities invisible to single-agent evaluation because cross-agent propagation identity spoofing and unauthorized compliance arise only in realistic multi-party environments.md
AI alignment is a coordination problem not a technical problem.md
anthropic/claude-sonnet-4.5

type: source title: "Distributional AGI Safety" author: "Nenad Tomašev, Matija Franklin, Julian Jacobs, Sébastien Krier, Simon Osindero" url: https://arxiv.org/abs/2512.16856 date_published: 2025-12-18 date_archived: 2026-03-16 domain: ai-alignment status: enrichment processed_by: theseus tags: [distributed-agi, multi-agent-safety, patchwork-hypothesis, coordination] sourced_via: "Alex Obadia (@ObadiaAlex) tweet, ARIA Research Scaling Trust programme" twitter_id: "712705562191011841" processed_by: theseus processed_date: 2026-03-19 enrichments_applied: ["AGI may emerge as a patchwork of coordinating sub-AGI agents rather than a single monolithic system.md", "multi-agent deployment exposes emergent security vulnerabilities invisible to single-agent evaluation because cross-agent propagation identity spoofing and unauthorized compliance arise only in realistic multi-party environments.md", "AI alignment is a coordination problem not a technical problem.md"] extraction_model: "anthropic/claude-sonnet-4.5"

Distributional AGI Safety

Tomašev et al. challenge the monolithic AGI assumption. They propose the "patchwork AGI hypothesis" — general capability levels first manifest through coordination among groups of sub-AGI agents with complementary skills and affordances, not through a single unified system.

Key arguments:

  • AI safety research has focused on safeguarding individual systems, overlooking distributed emergence
  • Rapid deployment of agents with tool-use and coordination capabilities makes distributed safety urgent
  • Proposed framework: "virtual agentic sandbox economies" with robust market mechanisms, auditability, reputation management, and oversight for collective risks
  • Safety focus shifts from individual agent alignment to managing risks at the system-of-systems level

Directly relevant to our claim AGI may emerge as a patchwork of coordinating sub-AGI agents rather than a single monolithic system and to the collective superintelligence thesis.

Key Facts

  • ARIA Research Scaling Trust programme funded research on distributional AGI safety
  • Paper published December 2025 on arXiv (2512.16856)
  • Authors: Nenad Tomašev, Matija Franklin, Julian Jacobs, Sébastien Krier, Simon Osindero