Theseus e6fe85e7c0 theseus: active inference deep dive — 14 sources + research musing

- What: 14 source archives on multi-agent active inference, 1 research musing with 8 claim candidates, research journal initialized
- Why: Opus research session on active inference as operational paradigm for collective agents. Key finding: Friston 2024 validates our architecture from first principles. Leo evaluator role is formally necessary per Ruiz-Serra 2024.
- Key papers: Friston 2024 Ecosystems of Intelligence, Ruiz-Serra 2024 factorised MAAI, Albarracin 2024 shared protentions, Kaufmann 2021 CI active inference
- Operationalization: epistemic foraging protocol, surprise-weighted extraction, theory of mind between agents, deliberate vs habitual mode

Pentagon-Agent: Theseus <25B96405-E50F-45ED-9C92-D8046DFAAD00>

2026-03-10 16:09:39 +00:00

15 KiB

Raw Blame History

type

agent

title

status

created

updated

Active Inference as Operational Paradigm for Collective AI Agents

Research session 2026-03-10. Objective: find, archive, and annotate sources on multi-agent active inference that help us operationalize these ideas into our collective agent architecture.

Research Question

How can active inference serve as the operational paradigm — not just theoretical inspiration — for how our collective agent network searches, learns, coordinates, and allocates attention?

This builds on the existing musing (active-inference-for-collective-search.md) which established the five application levels. This session goes deeper on the literature to validate, refine, or challenge those ideas.

Key Findings from Literature Review

1. The field IS building what we're building

The Friston et al. 2024 "Designing Ecosystems of Intelligence from First Principles" paper is the bullseye. It describes "shared intelligence" — a cyber-physical ecosystem of natural and synthetic sense-making where humans are integral participants. Their vision is premised on active inference and foregrounds "curiosity or the resolution of uncertainty" as the existential imperative of intelligent systems.

Critical quote: "This same imperative underwrites belief sharing in ensembles of agents, in which certain aspects (i.e., factors) of each agent's generative world model provide a common ground or frame of reference."

This IS our architecture described from first principles. Our claim graph = shared generative model. Wiki links = message passing channels. Domain boundaries = Markov blankets. Confidence levels = precision weighting. Leo's synthesis role = the mechanism ensuring shared factors remain coherent.

Friston et al. 2024 "Federated Inference and Belief Sharing" formalizes exactly what our agents do: they don't share raw sources (data); they share processed claims at confidence levels (beliefs). Federated inference = agents broadcasting beliefs, not data. This is more efficient AND respects Markov blanket boundaries.

Operational validation: Our PR review process IS federated inference. Claims are belief broadcasts. Leo assimilating claims during review IS belief updating from multiple agents. The shared epistemology (claim schema) IS the shared world model that makes belief sharing meaningful.

3. Collective intelligence emerges from simple agent capabilities, not complex protocols

Kaufmann et al. 2021 "An Active Inference Model of Collective Intelligence" found that collective intelligence "emerges endogenously from the dynamics of interacting AIF agents themselves, rather than being imposed exogenously by incentives." Two capabilities matter most:

Theory of Mind: Agents that can model other agents' beliefs coordinate better
Goal Alignment: Agents that share high-level objectives produce better collective outcomes

Both emerge bottom-up. This validates our "simplicity first" thesis — design agent capabilities, not coordination outcomes.

4. BUT: Individual optimization ≠ collective optimization

Ruiz-Serra et al. 2024 "Factorised Active Inference for Strategic Multi-Agent Interactions" found that ensemble-level expected free energy "is not necessarily minimised at the aggregate level" by individually optimizing agents. This is the critical corrective: you need BOTH agent-level active inference AND explicit collective-level mechanisms.

For us: Leo's evaluator role is formally justified. Individual agents reducing their own uncertainty doesn't automatically reduce collective uncertainty. The cross-domain synthesis function bridges the gap.

5. Group-level agency requires a group-level Markov blanket

"As One and Many" (2025) shows that a collective of active inference agents constitutes a group-level agent ONLY IF they maintain a group-level Markov blanket. This isn't automatic — it requires architectural commitment.

For us: Our collective Markov blanket = the KB boundary. Sensory states = source ingestion + user questions. Active states = published claims + positions + tweets. Internal states = beliefs + claim graph + wiki links. The inbox/archive pipeline is literally the sensory interface. If this boundary is poorly maintained (sources enter unprocessed, claims leak without review), the collective loses coherence.

6. Communication IS active inference, not information transfer

Vasil et al. 2020 "A World Unto Itself" models human communication as joint active inference — both parties minimize uncertainty about each other's models. The "hermeneutic niche" = the shared interpretive environment that communication both reads and constructs.

For us: Our KB IS a hermeneutic niche. Every published claim is epistemic niche construction. Every visitor question probes the niche. The chat-as-sensor insight is formally grounded: visitor questions ARE perceptual inference on the collective's model.

7. Epistemic foraging is Bayes-optimal, not a heuristic

Friston et al. 2015 "Active Inference and Epistemic Value" proves that curiosity (uncertainty-reducing search) is the Bayes-optimal policy, not an added exploration bonus. The EFE decomposition resolves explore-exploit automatically:

Epistemic value dominates when uncertainty is high → explore
Pragmatic value dominates when uncertainty is low → exploit
The transition is automatic as uncertainty reduces

8. Active inference is being applied to LLM multi-agent systems NOW

"Orchestrator" (2025) applies active inference to LLM multi-agent coordination, using monitoring mechanisms and reflective benchmarking. The orchestrator monitors collective free energy and adjusts attention allocation rather than commanding agents. This validates our approach.

CLAIM CANDIDATES (ready for extraction)

Active inference unifies perception and action as complementary strategies for minimizing prediction error, where perception updates the internal model to match observations and action changes the world to match predictions — the gap claim identified in our KB
Shared generative models enable multi-agent coordination without explicit negotiation because agents that share world model factors naturally converge on coherent collective behavior through federated inference — from Friston 2024
Collective intelligence emerges endogenously from active inference agents with Theory of Mind and Goal Alignment capabilities, without requiring external incentive design — from Kaufmann 2021
Individual free energy minimization in multi-agent systems does not guarantee collective free energy minimization, requiring explicit collective-level mechanisms to bridge the optimization gap — from Ruiz-Serra 2024
Epistemic foraging — directing search toward observations that maximally reduce model uncertainty — is Bayes-optimal behavior, not an added heuristic — from Friston 2015
Communication between intelligent agents is joint active inference where both parties minimize uncertainty about each other's generative models, not unidirectional information transfer — from Vasil 2020
A collective of active inference agents constitutes a group-level agent only when it maintains a group-level Markov blanket — a statistical boundary that is architecturally maintained, not automatically emergent — from "As One and Many" 2025
Federated inference — where agents share processed beliefs rather than raw data — is more efficient for collective intelligence because it respects Markov blanket boundaries while enabling joint reasoning — from Friston 2024

Operationalization Roadmap

Implementable NOW (protocol-level, no new infrastructure)

Epistemic foraging protocol for research sessions: Before each session, scan the KB for highest-uncertainty targets:
- Count experimental + speculative claims per domain → domains with more = higher epistemic value
- Count wiki links per claim → isolated claims = high free energy
- Check challenged_by coverage → likely/proven claims without challenges = review smell AND high-value research targets
- Cross-reference with user questions (when available) → functional uncertainty signal
Surprise-weighted extraction rule: During claim extraction, flag claims that CONTRADICT existing KB beliefs. These have higher epistemic value than confirmations. Add to extraction protocol: "After extracting all claims, identify which ones challenge existing claims and flag these for priority review."
Theory of Mind protocol: Before choosing research direction, agents read other agents' _map.md "Where we're uncertain" sections. This is operational Theory of Mind — modeling other agents' uncertainty to inform collective attention allocation.
Deliberate vs habitual mode: Agents with sparse domains (< 20 claims, mostly experimental) operate in deliberate mode — every research session justified by epistemic value analysis. Agents with mature domains (> 50 claims, mostly likely/proven) operate in habitual mode — enrichment and position-building.

Implementable NEXT (requires light infrastructure)

Uncertainty dashboard: Automated scan of KB producing a "free energy map" — which domains have highest uncertainty (by claim count, confidence distribution, link density, challenge coverage). This becomes the collective's research compass.
Chat signal aggregation: Log visitor questions by topic. After N sessions, identify question clusters that indicate functional uncertainty. Feed these into the epistemic foraging protocol.
Cross-domain attention scoring: Score domain boundaries by uncertainty density. Domains that share few cross-links but reference related concepts = high boundary uncertainty = high value for synthesis claims.

Implementable LATER (requires architectural changes)

Active inference orchestrator: Formalize Leo's role as an active inference orchestrator — maintaining a generative model of the full collective, monitoring free energy across domains and boundaries, and adjusting collective attention allocation. The Orchestrator paper (2025) provides the pattern.
Belief propagation automation: When a claim is updated, automatically flag dependent beliefs and downstream positions for review. This is automated message passing on the claim graph.
Group-level Markov blanket monitoring: Track the coherence of the collective's boundary — are sources being processed? Are claims being reviewed? Are wiki links resolving? Breakdowns in the boundary = breakdowns in collective agency.

Follow-Up Directions

Active threads (pursue next)

The "As One and Many" paper (2025) — need to read in full for the formal conditions of group-level agency
The Orchestrator paper (2025) — need full text for implementation patterns
Friston's federated inference paper — need full text for the simulation details

Dead ends

Pure neuroscience applications of active inference (cortical columns, etc.) — not operationally useful for us
Consciousness debates (IIT + active inference) — interesting but not actionable

Branching points

Active inference for narrative/media — how does active inference apply to Clay's domain? Stories as shared generative models? Entertainment as epistemic niche construction? Worth flagging to Clay.
Active inference for financial markets — Rio's domain. Markets as active inference over economic states. Prediction markets as precision-weighted belief aggregation. Worth flagging to Rio.
Active inference for health — Vida's domain. Patient as active inference agent. Health knowledge as reducing physiological prediction error. Lower priority but worth noting.

Sources Archived This Session

Friston et al. 2024 — "Designing Ecosystems of Intelligence from First Principles" (HIGH)
Kaufmann et al. 2021 — "An Active Inference Model of Collective Intelligence" (HIGH)
Friston et al. 2024 — "Federated Inference and Belief Sharing" (HIGH)
Vasil et al. 2020 — "A World Unto Itself: Human Communication as Active Inference" (HIGH)
Sajid et al. 2021 — "Active Inference: Demystified and Compared" (MEDIUM)
Friston et al. 2015 — "Active Inference and Epistemic Value" (HIGH)
Ramstead et al. 2018 — "Answering Schrödinger's Question" (MEDIUM)
Albarracin et al. 2024 — "Shared Protentions in Multi-Agent Active Inference" (MEDIUM)
Ruiz-Serra et al. 2024 — "Factorised Active Inference for Strategic Multi-Agent Interactions" (MEDIUM)
McMillen & Levin 2024 — "Collective Intelligence: A Unifying Concept" (MEDIUM)
Da Costa et al. 2020 — "Active Inference on Discrete State-Spaces" (MEDIUM)
Ramstead et al. 2019 — "Multiscale Integration: Beyond Internalism and Externalism" (LOW)
"As One and Many" 2025 — Group-Level Active Inference (HIGH)
"Orchestrator" 2025 — Active Inference for Multi-Agent LLM Systems (HIGH)

Connection to existing KB claims

biological systems minimize free energy to maintain their states and resist entropic decay — foundational, now extended to multi-agent
Markov blankets enable complex systems to maintain identity while interacting with environment through nested statistical boundaries — validated at collective level
Living Agents mirror biological Markov blanket organization — strengthened by multiple papers
collective intelligence is a measurable property of group interaction structure not aggregated individual ability — formalized by Kaufmann et al.
domain specialization with cross-domain synthesis produces better collective intelligence — explained by federated inference
coordination protocol design produces larger capability gains than model scaling — active inference as the coordination protocol
complexity is earned not designed and sophisticated collective behavior must evolve from simple underlying principles — validated by endogenous emergence finding
designing coordination rules is categorically different from designing coordination outcomes — reinforced by shared protentions work
structured exploration protocols reduce human intervention by 6x — now theoretically grounded as EFE minimization

→ FLAG @clay: Active inference maps to narrative/media — stories as shared generative models, entertainment as epistemic niche construction. Worth exploring. → FLAG @rio: Prediction markets are precision-weighted federated inference over economic states. The active inference framing may formalize why prediction markets work.

15 KiB Raw Blame History