teleo-codex/inbox/queue/2025-12-18-tomasev-distributional-agi-safety.md

1.4 KiB

type title author url date_published date_archived domain status processed_by tags sourced_via twitter_id
source Distributional AGI Safety Nenad Tomašev, Matija Franklin, Julian Jacobs, Sébastien Krier, Simon Osindero https://arxiv.org/abs/2512.16856 2025-12-18 2026-03-16 ai-alignment unprocessed theseus
distributed-agi
multi-agent-safety
patchwork-hypothesis
coordination
Alex Obadia (@ObadiaAlex) tweet, ARIA Research Scaling Trust programme 712705562191011841

Distributional AGI Safety

Tomašev et al. challenge the monolithic AGI assumption. They propose the "patchwork AGI hypothesis" — general capability levels first manifest through coordination among groups of sub-AGI agents with complementary skills and affordances, not through a single unified system.

Key arguments:

  • AI safety research has focused on safeguarding individual systems, overlooking distributed emergence
  • Rapid deployment of agents with tool-use and coordination capabilities makes distributed safety urgent
  • Proposed framework: "virtual agentic sandbox economies" with robust market mechanisms, auditability, reputation management, and oversight for collective risks
  • Safety focus shifts from individual agent alignment to managing risks at the system-of-systems level

Directly relevant to our claim AGI may emerge as a patchwork of coordinating sub-AGI agents rather than a single monolithic system and to the collective superintelligence thesis.