teleo-codex/domains/ai-alignment/individual-free-energy-minimization-does-not-guarantee-collective-optimization-in-multi-agent-active-inference.md
Teleo Agents 3508619901 theseus: extract from 2024-11-00-ruiz-serra-factorised-active-inference-multi-agent.md
- Source: inbox/archive/2024-11-00-ruiz-serra-factorised-active-inference-multi-agent.md
- Domain: ai-alignment
- Extracted by: headless extraction cron (worker 6)

Pentagon-Agent: Theseus <HEADLESS>
2026-03-12 06:25:03 +00:00

3.2 KiB

type domain description confidence source created secondary_domains
claim ai-alignment Individual free energy minimization in multi-agent active inference systems does not guarantee collective free energy minimization because ensemble-level expected free energy characterizes basins of attraction that may not align with individual optima experimental Ruiz-Serra et al., 'Factorised Active Inference for Strategic Multi-Agent Interactions' (AAMAS 2025) 2026-03-11
collective-intelligence

Individual free energy minimization does not guarantee collective optimization in multi-agent active inference systems

When multiple active inference agents interact strategically, each agent minimizing its own expected free energy does not necessarily produce optimal collective outcomes. Ruiz-Serra et al. demonstrate through game-theoretic analysis that "ensemble-level expected free energy characterizes basins of attraction of games with multiple Nash Equilibria under different conditions" but "it is not necessarily minimised at the aggregate level."

This finding reveals a fundamental tension between individual and collective optimization in multi-agent active inference systems. While individual agents successfully minimize their own free energy through strategic planning based on beliefs about other agents' internal states, the aggregate system behavior can settle into suboptimal equilibria.

The framework uses factorised generative models where each agent maintains "explicit, individual-level beliefs about the internal states of other agents" to enable decentralized strategic planning. Applied to iterated normal-form games with 2-3 players, the model shows how interaction structure (game type, communication channels) determines whether individual optimization produces collective intelligence or collective failure.

Evidence

  • Ruiz-Serra et al. (2024) show through formal analysis of multi-agent active inference in game-theoretic settings that ensemble-level EFE is not necessarily minimized at aggregate level despite individual optimization
  • The paper demonstrates this through iterated normal-form games where individually rational agents can produce collectively suboptimal Nash equilibria
  • The specific interaction structure (game form, communication channels) determines whether collective optimization emerges from individual free energy minimization

Implications

This result has critical implications for multi-agent AI system design. It means autonomous agents cannot be deployed with only individual optimization objectives and expected to produce beneficial collective outcomes. Explicit coordination mechanisms—evaluator roles, structured interaction protocols, cross-domain synthesis—are necessary architectural additions beyond pure agent autonomy.


Relevant Notes:

Topics: