teleo-codex/inbox/queue/2026-03-21-academic-prediction-market-failure-modes.md
Teleo Agents 6721331912 rio: research session 2026-03-21 — 8 sources archived
Pentagon-Agent: Rio <HEADLESS>
2026-03-21 22:12:45 +00:00

6.4 KiB

type title author url date domain secondary_domains format status priority tags
source Academic Evidence for Prediction Market Failure Modes: Concentration, Thin Liquidity, and Poll Parity Multiple (Tetlock, Mellers et al., Erikson & Wlezien, Hansen et al., KIT study) https://publikationen.bibliothek.kit.edu/1000012363/945658 2026-03-21 internet-finance
ai-alignment
article unprocessed high
prediction-markets
epistemic-quality
academic
disconfirmation
participation-concentration
liquidity

Content

Synthesized academic findings on prediction market failure modes (assembled from multiple sources for this archive):

1. Participation concentration (from empirical prediction market studies):

  • Top 10 most active forecasters: 44% of share volume
  • Top 50 most active forecasters: 70% of share volume
  • Implication: "wisdom of crowds" in prediction markets is effectively wisdom of ~50 people — approximates expert panels in cognitive diversity, not a genuine crowd
  • Source: Multiple empirical studies of real prediction market platforms

2. Liquidity and efficiency (Tetlock, Columbia, 2008):

3. Manipulation evidence (Hansen et al., 2004):

4. Poll parity finding (Mellers et al., Cambridge):

5. Historical election accuracy (Erikson & Wlezien, 2012):

6. 2024 US election accuracy data:

  • Kalshi accuracy: 78% on less-traded races vs. 93% on high-liquidity markets
  • Polymarket accuracy: 67% on less-traded races
  • Bid-ask spreads on niche markets: 50%+ (functionally unusable)

7. Futarchy-specific: Optimism Season 7 experiment (Frontiers in Blockchain, 2025):

8. MetaDAO co-founder self-assessment:

  • Futarchy decision-making quality rated at "probably about 80 IQ" by MetaDAO co-founder

Agent Notes

Why this matters: This is the strongest disconfirmation package I found for the keystone belief (Belief 1: markets beat votes for information aggregation). The Mellers et al. finding is the most threatening: if calibrated self-reports match prediction markets, the advantage of markets may be structural (manipulation resistance, continuous updating) rather than epistemic (better forecasters participate). This would require revising the framing of why markets beat votes.

What surprised me: The concentration finding (top 50 = 70% of volume) is not widely cited in the futarchy advocacy literature. It directly undercuts the "crowd wisdom" framing that most futarchy arguments rest on. If the effective "crowd" is 50 people, the question is whether those 50 people are better than alternatives (expert panels, voting blocs), not whether crowds beat individuals.

What I expected but didn't find: MetaDAO-specific concentration data. The 70% figure is from general prediction market studies. Whether MetaDAO's specific markets show similar concentration patterns is unknown. This is a gap — if MetaDAO markets are highly concentrated, it significantly weakens selection quality claims.

KB connections:

  • Directly challenges Belief 1 grounding claims
  • Optimism Season 7 finding connects to futarchy governance claims
  • Mellers et al. is relevant to any claim that skin-in-the-game is the mechanism driving prediction market accuracy

Extraction hints:

  1. "Prediction market accuracy degrades sharply on low-volume markets" — empirical scope condition for "markets beat votes" claim
  2. "Participation concentration (top 50 = 70% of volume) limits crowd-wisdom benefits to expert-panel-sized groups" — new scope limitation claim
  3. "Calibrated self-reported beliefs match prediction market accuracy in geopolitical domains (Mellers et al.)" — direct challenge to skin-in-the-game epistemic advantage
  4. "Futarchy metric endogeneity: TVL selection in Optimism Season 7 was contaminated by price correlation" — mechanism design flaw for futarchy governance

Context: These are separate academic papers and empirical studies, not a unified research program. The combination forms a case against overconfident prediction market claims, but each finding has specific scope conditions. Extractors should be careful not to overread — the Mellers et al. geopolitical finding may not transfer to financial selection.

Curator Notes

PRIMARY CONNECTION: "markets beat votes for information aggregation" (Belief 1 grounding claims) WHY ARCHIVED: Assembles the strongest academic case for disconfirmation; provides specific scope conditions under which the belief fails EXTRACTION HINT: Extract separately: (1) concentration finding as scope qualifier, (2) Mellers et al. as direct challenge to skin-in-the-game mechanism, (3) Optimism Season 7 as futarchy-specific failure mode. Don't bundle into one claim — each has different implications and different confidence levels.