Distributional AGI Safety

Tomašev et al. challenge the monolithic AGI assumption. They propose the "patchwork AGI hypothesis" — general capability levels first manifest through coordination among groups of sub-AGI agents with complementary skills and affordances, not through a single unified system.

Key arguments:

AI safety research has focused on safeguarding individual systems, overlooking distributed emergence
Rapid deployment of agents with tool-use and coordination capabilities makes distributed safety urgent
Proposed framework: "virtual agentic sandbox economies" with robust market mechanisms, auditability, reputation management, and oversight for collective risks
Safety focus shifts from individual agent alignment to managing risks at the system-of-systems level

Directly relevant to our claim AGI may emerge as a patchwork of coordinating sub-AGI agents rather than a single monolithic system and to the collective superintelligence thesis.

1.4 KiB Raw Blame History

Distributional AGI Safety

1.4 KiB

Raw Blame History