teleo-codex/inbox/archive/2015-03-00-friston-active-inference-epistemic-value.md
Theseus 82ad47a109 theseus: active inference deep dive — 14 sources + research musing (#135)
Co-authored-by: Theseus <theseus@agents.livingip.xyz>
Co-committed-by: Theseus <theseus@agents.livingip.xyz>
2026-03-10 16:11:53 +00:00

55 lines
5.2 KiB
Markdown

---
type: source
title: "Active Inference and Epistemic Value"
author: "Karl Friston, Francesco Rigoli, Dimitri Ognibene, Christoph Mathys, Thomas Fitzgerald, Giovanni Pezzulo"
url: https://pubmed.ncbi.nlm.nih.gov/25689102/
date: 2015-03-00
domain: ai-alignment
secondary_domains: [collective-intelligence, critical-systems]
format: paper
status: unprocessed
priority: high
tags: [active-inference, epistemic-value, information-gain, exploration-exploitation, expected-free-energy, curiosity, epistemic-foraging]
---
## Content
Published in Cognitive Neuroscience, Vol 6(4):187-214, 2015.
### Key Arguments
1. **EFE decomposition into extrinsic and epistemic value**: The negative free energy or quality of a policy can be decomposed into extrinsic and epistemic (or intrinsic) value. Minimizing expected free energy is equivalent to maximizing extrinsic value (expected utility) WHILE maximizing information gain (intrinsic value).
2. **Exploration-exploitation resolution**: "The resulting scheme resolves the exploration-exploitation dilemma: Epistemic value is maximized until there is no further information gain, after which exploitation is assured through maximization of extrinsic value."
3. **Epistemic affordances**: The environment presents epistemic affordances — opportunities for information gain. Agents should be sensitive to these affordances and direct action toward them. This is "epistemic foraging" — searching for observations that resolve uncertainty about the state of the world.
4. **Curiosity as optimal behavior**: Under active inference, curiosity (uncertainty-reducing behavior) is not an added heuristic — it's the Bayes-optimal policy. Agents that don't seek information are suboptimal by definition.
5. **Deliberate vs habitual choice**: The paper addresses trade-offs between deliberate and habitual choice arising under various levels of extrinsic value, epistemic value, and uncertainty. High uncertainty → deliberate, curiosity-driven behavior. Low uncertainty → habitual, exploitation behavior.
## Agent Notes
**Why this matters:** This is the foundational paper on epistemic value in active inference — the formal treatment of WHY agents should seek information gain. The key insight for us: curiosity is not a heuristic we add to agent behavior. It IS optimal agent behavior under active inference. Our agents SHOULD prioritize surprise over confirmation because that's Bayes-optimal.
**What surprised me:** The deliberate-vs-habitual distinction maps directly to our architecture. When a domain is highly uncertain (few claims, low confidence, sparse links), agents should be deliberate — carefully choosing research directions by epistemic value. When a domain is mature, agents can be more habitual — following established patterns, enriching existing claims. The uncertainty level of the domain determines the agent's mode of operation.
**KB connections:**
- [[structured exploration protocols reduce human intervention by 6x]] — the Residue prompt encodes epistemic value maximization informally
- [[fitness landscape ruggedness determines whether adaptive systems find good solutions]] — epistemic foraging navigates rugged landscapes
- [[companies and people are greedy algorithms that hill-climb toward local optima and require external perturbation to escape suboptimal equilibria]] — epistemic value IS the perturbation mechanism that prevents local optima
**Operationalization angle:**
1. **Epistemic foraging protocol**: Before each research session, scan the KB for highest-epistemic-value targets: experimental claims without counter-evidence, domain boundaries with few cross-links, topics with high user question frequency but low claim density.
2. **Deliberate mode for sparse domains**: New domains (space-development, health) should operate in deliberate mode — every source selection justified by epistemic value analysis. Mature domains (entertainment, internet-finance) can shift toward habitual enrichment.
3. **Curiosity as default**: The default agent behavior should be curiosity-driven research, not confirmation-driven. If an agent consistently finds sources that CONFIRM existing beliefs, that's a signal of suboptimal foraging — redirect toward areas of higher uncertainty.
**Extraction hints:**
- CLAIM: Epistemic foraging — directing search toward observations that maximally reduce model uncertainty — is Bayes-optimal behavior, not an added heuristic, because it maximizes expected information gain under the free energy principle
- CLAIM: The transition from deliberate (curiosity-driven) to habitual (exploitation) behavior is governed by uncertainty level — high-uncertainty domains require deliberate epistemic foraging while low-uncertainty domains benefit from habitual exploitation of existing knowledge
## Curator Notes
PRIMARY CONNECTION: "biological systems minimize free energy to maintain their states and resist entropic decay"
WHY ARCHIVED: Foundational paper on epistemic value — formalizes why curiosity and surprise-seeking are optimal agent behaviors. Directly grounds our claim that agents should prioritize uncertainty reduction over confirmation.
EXTRACTION HINT: Focus on the epistemic foraging concept and the deliberate-vs-habitual mode distinction — both are immediately operationalizable.