teleo-codex/domains/ai-alignment/individual-free-energy-minimization-does-not-guarantee-collective-optimization-in-multi-agent-active-inference.md
Teleo Agents 144946de51 theseus: extract from 2024-11-00-ruiz-serra-factorised-active-inference-multi-agent.md
- Source: inbox/archive/2024-11-00-ruiz-serra-factorised-active-inference-multi-agent.md
- Domain: ai-alignment
- Extracted by: headless extraction cron (worker 6)

Pentagon-Agent: Theseus <HEADLESS>
2026-03-12 09:58:06 +00:00

3.4 KiB

type domain description confidence source created secondary_domains
claim ai-alignment Individual free energy minimization in multi-agent active inference does not guarantee collective free energy minimization because ensemble-level EFE characterizes basins of attraction that may not align with individual optima experimental Ruiz-Serra et al., 'Factorised Active Inference for Strategic Multi-Agent Interactions' (AAMAS 2025) 2026-03-11
collective-intelligence

Individual free energy minimization does not guarantee collective optimization in multi-agent active inference systems

When multiple active inference agents interact strategically, each agent minimizing its own expected free energy does not necessarily produce optimal collective outcomes. Ruiz-Serra et al. demonstrate through game-theoretic analysis that "ensemble-level expected free energy characterizes basins of attraction of games with multiple Nash Equilibria under different conditions" but "it is not necessarily minimised at the aggregate level."

This finding reveals a fundamental tension between individual and collective optimization in multi-agent active inference systems. While each agent follows locally optimal free energy minimization, the interaction structure (game form, communication channels, strategic dependencies) determines whether these individual optima align with collective optima.

The paper applies factorised generative models to iterated normal-form games with 2 and 3 players, showing how active inference agents navigate cooperative and non-cooperative strategic interactions. The factorisation enables each agent to maintain "explicit, individual-level beliefs about the internal states of other agents" for strategic planning—operationalizing Theory of Mind within active inference.

Evidence

  • Ruiz-Serra et al. (2024) show through formal analysis that ensemble-level EFE characterizes Nash equilibrium basins of attraction but is not necessarily minimized at aggregate level in multi-agent games
  • The framework successfully models strategic interactions in 2- and 3-player iterated normal-form games, demonstrating the individual-collective optimization gap empirically
  • Factorised generative models enable decentralized representation where agents maintain individual beliefs about others' internal states for strategic planning

Implications

This result has direct architectural implications for multi-agent AI systems:

  1. Explicit coordination mechanisms are necessary: Simply giving each agent active inference dynamics and assuming collective optimization is insufficient. The interaction structure must be deliberately designed.

  2. Evaluator roles are formally justified: Cross-domain synthesis roles exist precisely because individual agent optimization doesn't guarantee collective optimization.

  3. Interaction structure design matters: The specific form of agent interaction (review protocols, citation requirements, communication channels) shapes whether individual research produces collective intelligence.


Relevant Notes:

Topics: