teleo-codex/domains/ai-alignment/individual-free-energy-minimization-does-not-guarantee-collective-optimization-in-multi-agent-active-inference.md
Teleo Agents 7b0329c050 theseus: extract from 2024-11-00-ruiz-serra-factorised-active-inference-multi-agent.md
- Source: inbox/archive/2024-11-00-ruiz-serra-factorised-active-inference-multi-agent.md
- Domain: ai-alignment
- Extracted by: headless extraction cron (worker 6)

Pentagon-Agent: Theseus <HEADLESS>
2026-03-12 13:13:08 +00:00

35 lines
3.6 KiB
Markdown

---
type: claim
domain: ai-alignment
secondary_domains: [collective-intelligence]
description: "Individual free energy minimization in multi-agent active inference does not guarantee collective free energy minimization; interaction structure determines collective outcomes"
confidence: experimental
source: "Ruiz-Serra et al., 'Factorised Active Inference for Strategic Multi-Agent Interactions' (AAMAS 2025)"
created: 2026-03-11
---
# Individual free energy minimization does not guarantee collective optimization in multi-agent active inference systems
When multiple active inference agents interact strategically, each agent minimizing its individual expected free energy (EFE) does not necessarily produce optimal collective outcomes. Ruiz-Serra et al. demonstrate through game-theoretic analysis that "ensemble-level expected free energy characterizes basins of attraction of games with multiple Nash Equilibria under different conditions" but "it is not necessarily minimised at the aggregate level."
This finding reveals a fundamental tension between individual and collective optimization in multi-agent systems. Each agent operating under active inference principles will minimize its own free energy through belief updating and action selection, but the interaction structure (game form, communication channels, coordination mechanisms) determines whether these individual optimizations produce collectively beneficial outcomes.
The paper applies factorised generative models where each agent maintains "explicit, individual-level beliefs about the internal states of other agents" to enable strategic planning in joint contexts—essentially operationalizing Theory of Mind within active inference. Testing this framework on iterated normal-form games with 2-3 players shows that while agents can navigate cooperative and non-cooperative strategic interactions, the aggregate system behavior depends critically on the specific game structure.
## Evidence
- Ruiz-Serra et al. (2024) show through formal analysis that ensemble-level EFE characterizes equilibrium basins but is not necessarily minimized at aggregate level in multi-agent active inference systems
- Game-theoretic simulations with 2-3 player iterated normal-form games demonstrate that individual free energy minimization can produce suboptimal collective outcomes depending on interaction structure
- Factorised generative models enable agents to maintain individual-level beliefs about other agents' internal states, but this Theory of Mind capability does not automatically resolve individual-collective optimization tensions
## Implications
This result has direct architectural implications for multi-agent AI systems. It means that giving each agent active inference dynamics and assuming collective intelligence will emerge is insufficient—explicit coordination mechanisms are required to bridge individual and collective optimization. The specific design of interaction structures (review processes, communication protocols, decision aggregation methods) becomes critical for producing collectively beneficial outcomes from individually rational agents.
---
Relevant Notes:
- [[AI alignment is a coordination problem not a technical problem]]
- [[collective intelligence requires diversity as a structural precondition not a moral preference]]
- [[no research group is building alignment through collective intelligence infrastructure despite the field converging on problems that require it]]
- [[AI agent orchestration that routes data and tools between specialized models outperforms both single-model and human-coached approaches because the orchestrator contributes coordination not direction]]