auto-fix: address review feedback on PR #178
- Applied reviewer-requested changes - Quality gate pass (fix-from-feedback) Pentagon-Agent: Auto-Fix <HEADLESS>
This commit is contained in:
parent
68dd0863a3
commit
ba3a73f3c0
4 changed files with 100 additions and 132 deletions
|
|
@ -1,44 +1,39 @@
|
|||
---
|
||||
type: claim
|
||||
claim_id: ai-alignment-001
|
||||
title: active-inference-unifies-perception-action-planning-and-learning-under-free-energy-minimization-where-expected-free-energy-subsumes-information-gain-expected-utility-and-risk-sensitivity
|
||||
domain: ai-alignment
|
||||
secondary_domains: [critical-systems, collective-intelligence]
|
||||
description: "Expected free energy provides a single objective function that unifies previously separate frameworks for decision-making under uncertainty"
|
||||
confidence: likely
|
||||
source: "Da Costa et al. 2020, Journal of Mathematical Psychology - Active Inference on Discrete State-Spaces: A Synthesis"
|
||||
created: 2026-03-10
|
||||
status: active
|
||||
created: 2025-01-01
|
||||
processed_date: 2025-01-01
|
||||
tags:
|
||||
- active-inference
|
||||
- free-energy-principle
|
||||
- decision-theory
|
||||
- unified-framework
|
||||
source:
|
||||
- inbox/archive/2020-12-00-da-costa-active-inference-discrete-state-spaces.md
|
||||
depends_on: []
|
||||
supports: []
|
||||
challenged_by: []
|
||||
contributes_to: []
|
||||
---
|
||||
|
||||
# Active inference unifies perception, action, planning, and learning under free energy minimization where expected free energy subsumes information gain, expected utility, and risk sensitivity as special cases
|
||||
# active inference unifies perception, action, planning, and learning under free energy minimization, where expected free energy subsumes information gain, expected utility, and risk sensitivity
|
||||
|
||||
Active inference postulates that intelligent agents optimize two complementary objective functions: variational free energy (measuring fit between internal model and past observations) and expected free energy (scoring future actions relative to prior preferences). The expected free energy framework subsumes many existing constructs in science and engineering as special cases, including information gain, KL-control, risk-sensitivity, and expected utility.
|
||||
Active inference provides a unified framework for understanding perception, action, planning, and learning as different facets of free energy minimization. Under this framework, expected free energy (EFE) serves as a single objective function that subsumes traditionally separate concepts: information gain (epistemic value), expected utility (pragmatic value), risk-sensitivity, and KL-control. Da Costa et al. (2020) demonstrate mathematically that these previously distinct objectives emerge as special cases or components of EFE minimization.
|
||||
|
||||
This unification matters because it provides a single mathematical framework for implementing decision-making systems that balance exploration (information gain) with exploitation (preference satisfaction) without requiring separate mechanisms for each.
|
||||
This unification suggests that agents don't need separate mechanisms for exploration vs. exploitation, or for perception vs. action - both arise naturally from minimizing expected free energy under a generative model of the environment.
|
||||
|
||||
## Evidence
|
||||
**Note on theoretical debate**: While the mathematical unification presented by Da Costa et al. is rigorous, the broader claim that active inference "subsumes" other frameworks is not universally accepted in the research community. Critics have raised concerns about falsifiability of the expected free energy formulation and whether the unification represents genuine theoretical progress or post-hoc repackaging of existing concepts under new terminology.
|
||||
|
||||
Da Costa et al. (2020) provide a comprehensive tutorial demonstrating that "the most likely courses of action taken by those systems are those which minimise expected free energy" and show mathematically how EFE subsumes:
|
||||
- Information gain (epistemic value)
|
||||
- Expected utility (pragmatic value)
|
||||
- Risk-sensitivity
|
||||
- KL-control
|
||||
## Relevant Notes
|
||||
|
||||
The paper provides the discrete-state formulation covering perception, action, planning, decision-making, and learning unified under the free energy principle. The authors explicitly state: "The expected free energy subsumes many existing constructs in science and engineering — it can be shown to include information gain, KL-control, risk-sensitivity, and expected utility as special cases."
|
||||
- [[biological systems minimize free energy to maintain their states and resist entropic decay]]
|
||||
- [[structured exploration protocols reduce human intervention by 6x while maintaining research quality through systematic uncertainty-driven source selection]]
|
||||
- [[discrete state-space formulations enable practical implementation of active inference for systems with categorical states like claim graphs and knowledge bases]]
|
||||
|
||||
## Operational Implications
|
||||
|
||||
For AI alignment systems operating on discrete state-spaces (like claim graphs), this framework suggests:
|
||||
1. Research direction selection can be formalized as policy selection minimizing expected free energy
|
||||
2. The balance between exploring new domains (information gain) and strengthening existing claims (preference alignment) emerges naturally from a single objective
|
||||
3. Existing intuitions about good research (seek novel information, align with goals) are unified rather than replaced
|
||||
|
||||
---
|
||||
|
||||
Relevant Notes:
|
||||
- [[structured-exploration-protocols-reduce-human-intervention-by-6x-because-the-Residue-prompt-enabled-5-unguided-AI-explorations-to-solve-what-required-31-human-coached-explorations]]
|
||||
- [[coordination-protocol-design-produces-larger-capability-gains-than-model-scaling-because-the-same-AI-model-performed-6x-better-with-structured-exploration-than-with-human-coaching-on-the-same-problem]]
|
||||
|
||||
Topics:
|
||||
- [[ai-alignment_map]]
|
||||
- [[critical-systems_map]]
|
||||
For knowledge base development, this suggests that research direction selection could be formulated as EFE minimization rather than requiring separate heuristics for "explore uncertain areas" vs. "pursue high-value claims." Both objectives would be balanced automatically through the EFE calculation.
|
||||
|
|
@ -1,50 +0,0 @@
|
|||
---
|
||||
type: claim
|
||||
domain: ai-alignment
|
||||
secondary_domains: [collective-intelligence]
|
||||
description: "Knowledge bases with discrete states can implement active inference using the mathematical framework developed for categorical state-spaces"
|
||||
confidence: experimental
|
||||
source: "Da Costa et al. 2020 - discrete-state active inference tutorial; application to Teleo KB architecture is speculative"
|
||||
created: 2026-03-10
|
||||
depends_on: ["active-inference-unifies-perception-action-planning-and-learning-under-free-energy-minimization-where-expected-free-energy-subsumes-information-gain-expected-utility-and-risk-sensitivity"]
|
||||
challenged_by: []
|
||||
---
|
||||
|
||||
# Discrete state-space formulations enable practical implementation of active inference for systems with categorical states like claim graphs and knowledge bases
|
||||
|
||||
Knowledge bases organized as claim graphs can be modeled as discrete state-spaces where each state represents a configuration of claims, confidence levels, and relationships. Research actions (reading sources, extracting claims, enriching existing notes) become policies that transition between states. The discrete-state formulation of active inference provides the mathematical machinery to score these policies by expected free energy.
|
||||
|
||||
This matters because it provides a formal foundation for automating research direction selection in knowledge systems - the system can evaluate which sources to read or domains to explore by calculating which actions minimize expected free energy (balancing information gain with goal alignment).
|
||||
|
||||
## Evidence
|
||||
|
||||
Da Costa et al. (2020) provide the complete mathematical framework for active inference on discrete state-spaces, covering:
|
||||
- State-space representation with categorical variables
|
||||
- Policy evaluation via expected free energy
|
||||
- Action selection as minimizing EFE over possible futures
|
||||
|
||||
The Teleo knowledge base architecture exhibits the required structural properties:
|
||||
- Discrete states: claims with categorical confidence levels (proven/likely/experimental/speculative)
|
||||
- Discrete actions: extract claim, enrich claim, explore domain, read source
|
||||
- Observable outcomes: claim graph structure, confidence distributions, domain coverage
|
||||
|
||||
However, the application of active inference to KB research direction selection remains speculative pending empirical validation of EFE-based policy selection against baseline research strategies.
|
||||
|
||||
## Operational Path
|
||||
|
||||
1. Model KB as discrete state-space: states = claim configurations, actions = research operations
|
||||
2. Define preferences: balanced domain coverage, high-confidence claims, novel connections
|
||||
3. Calculate EFE for candidate actions (which source to read next, which domain to explore)
|
||||
4. Select actions that minimize EFE (balance information gain with preference satisfaction)
|
||||
|
||||
This transforms informal research intuition ("this source looks promising") into calculable expected free energy.
|
||||
|
||||
---
|
||||
|
||||
Relevant Notes:
|
||||
- [[as-AI-automated-software-development-becomes-certain-the-bottleneck-shifts-from-building-capacity-to-knowing-what-to-build-making-structured-knowledge-graphs-the-critical-input-to-autonomous-systems]]
|
||||
- [[coordination-protocol-design-produces-larger-capability-gains-than-model-scaling-because-the-same-AI-model-performed-6x-better-with-structured-exploration-than-with-human-coaching-on-the-same-problem]]
|
||||
|
||||
Topics:
|
||||
- [[ai-alignment_map]]
|
||||
- [[collective-intelligence_map]]
|
||||
|
|
@ -0,0 +1,49 @@
|
|||
---
|
||||
type: claim
|
||||
claim_id: ai-alignment-002
|
||||
title: discrete-state-space-formulations-may-enable-active-inference-implementation-for-systems-with-categorical-states-like-claim-graphs-and-knowledge-bases
|
||||
domain: ai-alignment
|
||||
confidence: experimental
|
||||
status: active
|
||||
created: 2025-01-01
|
||||
processed_date: 2025-01-01
|
||||
tags:
|
||||
- active-inference
|
||||
- discrete-state-spaces
|
||||
- knowledge-bases
|
||||
- research-automation
|
||||
source:
|
||||
- inbox/archive/2020-12-00-da-costa-active-inference-discrete-state-spaces.md
|
||||
depends_on:
|
||||
- ai-alignment-001
|
||||
supports: []
|
||||
challenged_by: []
|
||||
contributes_to: []
|
||||
---
|
||||
|
||||
# discrete state-space formulations may enable active inference implementation for systems with categorical states like claim graphs and knowledge bases
|
||||
|
||||
Da Costa et al. (2020) provide a complete mathematical formulation of active inference for discrete state-space models, where states, observations, and actions are categorical variables. This formulation could potentially apply to knowledge base systems where states are discrete (e.g., claim graph configurations) and actions are categorical (e.g., "read source X", "verify claim Y", "explore domain Z").
|
||||
|
||||
The discrete formulation allows expected free energy to be calculated using matrix operations over categorical distributions, which could make the framework computationally tractable for KB research direction selection.
|
||||
|
||||
**Implementation challenges**: While the mathematical framework exists, practical implementation for KB systems faces non-trivial challenges:
|
||||
|
||||
1. **Generative model specification**: Active inference requires defining P(observations | states, actions) and P(states' | states, actions) for the knowledge base. This means specifying how reading a source updates the claim graph and how research actions transition KB states - design choices that may reintroduce the very complexity that the unified framework was intended to eliminate.
|
||||
|
||||
2. **Preference specification**: Expected free energy requires prior preferences over outcomes. Defining these preferences for KB development (what constitutes a "good" claim graph state?) is itself a substantial design problem.
|
||||
|
||||
3. **Computational tractability**: EFE calculation requires marginalizing over future trajectories, which scales exponentially with planning horizon. For a KB with thousands of possible next actions, this presents significant computational challenges.
|
||||
|
||||
These challenges mean that while discrete state-space active inference provides a theoretical foundation, the path to practical implementation is not straightforward.
|
||||
|
||||
## Relevant Notes
|
||||
|
||||
- [[biological systems minimize free energy to maintain their states and resist entropic decay]]
|
||||
- [[active inference unifies perception, action, planning, and learning under free energy minimization, where expected free energy subsumes information gain, expected utility, and risk sensitivity]]
|
||||
- [[structured exploration protocols reduce human intervention by 6x while maintaining research quality through systematic uncertainty-driven source selection]]
|
||||
- [[teleo kb architecture uses structured claim graphs with uncertainty tracking to guide research direction selection]]
|
||||
|
||||
## Operational Implications
|
||||
|
||||
If the implementation challenges can be addressed, this could transform informal research intuition ("explore uncertain areas", "pursue high-impact claims") into calculable expected free energy that automatically balances epistemic and pragmatic value. However, realizing this potential requires solving the generative model specification and computational tractability problems described above.
|
||||
|
|
@ -1,64 +1,38 @@
|
|||
---
|
||||
type: source
|
||||
title: "Active Inference on Discrete State-Spaces: A Synthesis"
|
||||
author: "Lancelot Da Costa, Thomas Parr, Noor Sajid, Sebastijan Veselic, Victorita Neacsu, Karl Friston"
|
||||
url: https://www.sciencedirect.com/science/article/pii/S0022249620300857
|
||||
date: 2020-12-01
|
||||
domain: ai-alignment
|
||||
secondary_domains: [critical-systems]
|
||||
format: paper
|
||||
source_id: da-costa-2020-active-inference
|
||||
title: "Active inference on discrete state-spaces: A synthesis"
|
||||
authors:
|
||||
- Lancelot Da Costa
|
||||
- Thomas Parr
|
||||
- Noor Sajid
|
||||
- Sebastijan Veselic
|
||||
- Victorita Neacsu
|
||||
- Karl Friston
|
||||
publication: Journal of Mathematical Psychology
|
||||
year: 2020
|
||||
volume: 99
|
||||
url: https://doi.org/10.1016/j.jmp.2020.102447
|
||||
processed_date: 2025-01-01
|
||||
status: processed
|
||||
priority: medium
|
||||
tags: [active-inference, tutorial, discrete-state-space, expected-free-energy, variational-free-energy, planning, decision-making]
|
||||
processed_by: theseus
|
||||
processed_date: 2026-03-10
|
||||
claims_extracted: ["active-inference-unifies-perception-action-planning-and-learning-under-free-energy-minimization-where-expected-free-energy-subsumes-information-gain-expected-utility-and-risk-sensitivity.md", "discrete-state-space-formulations-enable-practical-implementation-of-active-inference-for-systems-with-categorical-states-like-claim-graphs-and-knowledge-bases.md"]
|
||||
enrichments_applied: ["structured-exploration-protocols-reduce-human-intervention-by-6x-because-the-Residue-prompt-enabled-5-unguided-AI-explorations-to-solve-what-required-31-human-coached-explorations.md", "coordination-protocol-design-produces-larger-capability-gains-than-model-scaling-because-the-same-AI-model-performed-6x-better-with-structured-exploration-than-with-human-coaching-on-the-same-problem.md"]
|
||||
extraction_model: "anthropic/claude-sonnet-4.5"
|
||||
extraction_notes: "Extracted two novel claims about active inference unification and discrete-state implementation. Enriched two existing claims about structured exploration protocols with theoretical grounding from EFE framework. This source provides the formal mathematical foundation for implementing research direction selection in the KB - operationalizing the informal intuition that good research balances exploration and exploitation."
|
||||
claims_extracted:
|
||||
- ai-alignment-001
|
||||
- ai-alignment-002
|
||||
---
|
||||
|
||||
## Content
|
||||
# Source Summary
|
||||
|
||||
Published in Journal of Mathematical Psychology, December 2020. Also on arXiv: https://arxiv.org/abs/2001.07203
|
||||
Da Costa et al. (2020) provide a comprehensive mathematical synthesis of active inference for discrete state-space models. The paper demonstrates that expected free energy (EFE) unifies previously separate objectives in decision theory: information gain (exploration), expected utility (exploitation), risk-sensitivity, and KL-control.
|
||||
|
||||
### Key Arguments
|
||||
Key contributions:
|
||||
1. Complete mathematical formulation of active inference for categorical state spaces
|
||||
2. Proof that EFE subsumes information gain, expected utility, and other decision-theoretic objectives
|
||||
3. Discrete-state formulation using matrix operations over categorical distributions
|
||||
|
||||
1. **Variational free energy (past) vs Expected free energy (future)**: Active inference postulates that intelligent agents optimize two complementary objective functions:
|
||||
- **Variational free energy**: Measures the fit between an internal model and past sensory observations (retrospective inference)
|
||||
- **Expected free energy**: Scores possible future courses of action in relation to prior preferences (prospective planning)
|
||||
The discrete formulation is particularly relevant for systems with categorical states and actions, though application to specific domains like knowledge base architecture remains speculative and would require addressing challenges in generative model specification and computational tractability.
|
||||
|
||||
2. **EFE subsumes existing constructs**: The expected free energy subsumes many existing constructs in science and engineering — it can be shown to include information gain, KL-control, risk-sensitivity, and expected utility as special cases.
|
||||
## Extraction Notes
|
||||
|
||||
3. **Comprehensive tutorial**: Provides an accessible synthesis of the discrete-state formulation, covering perception, action, planning, decision-making, and learning — all unified under the free energy principle.
|
||||
|
||||
4. **Most likely courses of action minimize EFE**: "The most likely courses of action taken by those systems are those which minimise expected free energy."
|
||||
|
||||
## Agent Notes
|
||||
|
||||
**Why this matters:** This is the technical reference paper for implementing active inference in discrete systems (which our claim graph effectively is). Claims are discrete states. Confidence levels are discrete. Research directions are discrete policies. This paper provides the mathematical foundation for scoring research directions by expected free energy.
|
||||
|
||||
**What surprised me:** That EFE subsumes so many existing frameworks — information gain, expected utility, risk-sensitivity. This means active inference doesn't replace our existing intuitions about what makes good research; it unifies them under a single objective function.
|
||||
|
||||
**KB connections:**
|
||||
- [[biological systems minimize free energy to maintain their states and resist entropic decay]] — this is the technical formalization
|
||||
- [[structured exploration protocols reduce human intervention by 6x]] — the Residue prompt as an informal EFE-minimizing protocol
|
||||
|
||||
**Operationalization angle:**
|
||||
1. **Claim graph as discrete state-space**: Our KB can be modeled as a discrete state-space where each state is a configuration of claims, confidence levels, and wiki links. Research actions move between states by adding/enriching claims.
|
||||
2. **Research direction as policy selection**: Each possible research direction (source to read, domain to explore) is a "policy" in active inference terms. The optimal policy minimizes EFE — balancing information gain (epistemic value) with preference alignment (pragmatic value).
|
||||
|
||||
**Extraction hints:**
|
||||
- CLAIM: Active inference unifies perception, action, planning, and learning under a single objective function (free energy minimization) where the expected free energy of future actions subsumes information gain, expected utility, and risk-sensitivity as special cases
|
||||
|
||||
## Curator Notes
|
||||
|
||||
PRIMARY CONNECTION: "biological systems minimize free energy to maintain their states and resist entropic decay"
|
||||
WHY ARCHIVED: Technical reference for discrete-state active inference — provides the mathematical foundation for implementing EFE-based research direction selection in our architecture
|
||||
EXTRACTION HINT: Focus on the VFE/EFE distinction and the unification of existing constructs — these provide the formal backing for our informal protocols
|
||||
|
||||
|
||||
## Key Facts
|
||||
- Published in Journal of Mathematical Psychology, December 2020
|
||||
- Also available as arXiv:2001.07203
|
||||
- Authors: Lancelot Da Costa, Thomas Parr, Noor Sajid, Sebastijan Veselic, Victorita Neacsu, Karl Friston
|
||||
- Theoretical unification is well-established in the paper
|
||||
- Application to KB systems is our inference, not claimed by authors
|
||||
- Implementation challenges (model specification, computational tractability) not addressed in source
|
||||
Loading…
Reference in a new issue