theseus: moloch extraction — 4 NEW claims + 2 enrichments + 1 source archive

- What: Extract AI-alignment claims from Alexander's "Meditations on Moloch", Abdalla manuscript "Architectural Investing", and Schmachtenberger framework - Why: Molochian dynamics / multipolar traps were structural gaps in KB despite extensive coverage in Leo's grand-strategy musings. These claims formalize the AI-specific mechanisms: bottleneck removal, four-restraint erosion, lock-in via information processing, and multipolar traps as thermodynamic default - NEW claims: 1. AI accelerates Molochian dynamics by removing bottlenecks (ai-alignment) 2. Four restraints taxonomy with AI targeting #2 and #3 (ai-alignment) 3. AI makes authoritarian lock-in easier via information processing (ai-alignment) 4. Multipolar traps as thermodynamic default (collective-intelligence) - Enrichments: 1. Taylor/soldiering parallel → alignment tax claim 2. Friston autovitiation → Minsky financial instability claim - Source archive: Alexander "Meditations on Moloch" (2014) - Tensions flagged: bottleneck removal challenges compute governance window as stable feature; four-restraint erosion reframes alignment as coordination design - Note: Agentic Taylorism enrichment (connecting trust asymmetry + determinism boundary to Leo's musing) deferred — Leo's musings not yet on main Pentagon-Agent: Theseus <46864DD4-DA71-4719-A1B4-68F7C55854D3>
2026-04-02 16:17:12 +01:00 · 2026-04-02 16:17:12 +01:00 · 49a4e0c1c9
commit 49a4e0c1c9
parent 4f2b7f6d8b
7 changed files with 263 additions and 0 deletions
--- a/domains/ai-alignment/AI
+++ b/domains/ai-alignment/AI
@ -0,0 +1,49 @@
 ---
 type: claim
 domain: ai-alignment
 description: "AI deepens the Molochian basin not by introducing novel failure modes but by eroding the physical limitations, bounded rationality, and coordination lag that previously kept competitive dynamics from reaching their destructive equilibrium"
 confidence: likely
 source: "Synthesis of Scott Alexander 'Meditations on Moloch' (2014), Abdalla manuscript 'Architectural Investing' price-of-anarchy framework, Schmachtenberger metacrisis generator function concept, Leo attractor-molochian-exhaustion musing"
 created: 2026-04-02
 depends_on:
  - "voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints"
  - "the alignment tax creates a structural race to the bottom because safety training costs capability and rational competitors skip it"
 challenged_by:
  - "physical infrastructure constraints on AI development create a natural governance window of 2 to 10 years because hardware bottlenecks are not software-solvable"
 ---
 # AI accelerates existing Molochian dynamics by removing bottlenecks not creating new misalignment because the competitive equilibrium was always catastrophic and friction was the only thing preventing convergence
 The standard framing of AI risk focuses on novel failure modes: misaligned objectives, deceptive alignment, reward hacking, power-seeking behavior. These are real concerns, but they obscure a more fundamental mechanism. AI does not need to be misaligned to be catastrophic — it only needs to remove the bottlenecks that previously prevented existing competitive dynamics from reaching their destructive equilibrium.
 Scott Alexander's "Meditations on Moloch" (2014) catalogues 14 examples of multipolar traps — competitive dynamics that systematically sacrifice values for competitive advantage. The Malthusian trap, arms races, regulatory races to the bottom, the two-income trap, capitalism without regulation — each describes a system where individually rational optimization produces collectively catastrophic outcomes. These dynamics existed long before AI. What constrained them were four categories of friction that Alexander identifies:
 1. **Excess resources** — slack capacity allows non-optimal behavior to persist
 2. **Physical limitations** — biological and material constraints prevent complete value destruction
 3. **Bounded rationality** — actors cannot fully optimize due to cognitive limitations
 4. **Coordination mechanisms** — governments, social codes, and institutions override individual incentives
 AI specifically erodes restraints #2 and #3. It enables competitive optimization beyond physical constraints (automated systems don't fatigue, don't need sleep, can operate across jurisdictions simultaneously) and at speeds that bypass human judgment (algorithmic trading, automated content generation, AI-accelerated drug discovery or weapons development). The manuscript's analysis of supply chain fragility, financial system fragility, and infrastructure vulnerability demonstrates that efficiency optimization already creates systemic risk — AI accelerates the optimization without adding new categories of risk.
 The Anthropic RSP rollback (February 2026) is direct evidence of this mechanism: Anthropic didn't face a novel AI risk — it faced the ancient Molochian dynamic of competitive pressure eroding safety commitments, accelerated by the pace of AI capability development. Jared Kaplan's statement — "we didn't really feel, with the rapid advance of AI, that it made sense for us to make unilateral commitments... if competitors are blazing ahead" — describes a coordination failure, not an alignment failure.
 This reframing has direct implications for governance strategy. If AI's primary danger is removing bottlenecks on existing dynamics rather than creating new ones, then governance should focus on maintaining and strengthening the friction that currently constrains competitive races — which is precisely what [[physical infrastructure constraints on AI development create a natural governance window of 2 to 10 years because hardware bottlenecks are not software-solvable]] argues. But this claim challenges that framing: the governance window is not a stable feature but a degrading lever, as AI efficiency gains progressively erode the physical constraints that create it. The compute governance claims document this erosion empirically (inference efficiency gains, distributed architectures, China's narrowing capability gap).
 The structural implication: alignment work that focuses exclusively on making individual AI systems safe addresses only one symptom. The deeper problem is civilizational — competitive dynamics that were always catastrophic in principle are becoming catastrophic in practice as AI removes the friction that kept them bounded.
 ## Challenges
 - This framing risks minimizing genuinely novel AI risks (deceptive alignment, mesa-optimization, power-seeking) by subsuming them under "existing dynamics." Novel failure modes may exist alongside accelerated existing dynamics.
 - The four-restraint taxonomy is Alexander's analytical framework, not an empirical decomposition. The categories may not be exhaustive or cleanly separable.
 - "Friction was the only thing preventing convergence" overstates if coordination mechanisms (#4) are more robust than this framing suggests. Ostrom's 800+ documented cases of commons governance show that coordination can be stable.
 ---
 Relevant Notes:
 - [[voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints]] — direct empirical confirmation of the bottleneck-removal mechanism
 - [[the alignment tax creates a structural race to the bottom because safety training costs capability and rational competitors skip it]] — the AI-domain instance of Molochian dynamics
 - [[physical infrastructure constraints on AI development create a natural governance window of 2 to 10 years because hardware bottlenecks are not software-solvable]] — the governance window this claim argues is degrading
 - [[AI alignment is a coordination problem not a technical problem]] — this claim provides the mechanism for why coordination matters more than technical safety
 Topics:
 - [[_map]]
--- a/domains/ai-alignment/AI
+++ b/domains/ai-alignment/AI
@ -0,0 +1,60 @@
 ---
 type: claim
 domain: ai-alignment
 description: "AI removes the historical ceiling on authoritarian control — surveillance scales to marginal cost zero, enforcement scales via autonomous systems, and central planning becomes viable if AI can process distributed information at sufficient scale"
 confidence: likely
 source: "Synthesis of Schmachtenberger two-attractor framework, Bostrom singleton hypothesis, Abdalla manuscript Hayek analysis, Leo attractor-authoritarian-lock-in musing"
 created: 2026-04-02
 depends_on:
  - "AI accelerates existing Molochian dynamics by removing bottlenecks not creating new misalignment because the competitive equilibrium was always catastrophic and friction was the only thing preventing convergence"
  - "four restraints prevent competitive dynamics from reaching catastrophic equilibrium and AI specifically erodes physical limitations and bounded rationality leaving only coordination as defense"
 ---
 # AI makes authoritarian lock-in dramatically easier by solving the information processing constraint that historically caused centralized control to fail
 Authoritarian lock-in — Bostrom's "singleton" scenario, Schmachtenberger's dystopian attractor — is the state where one actor achieves sufficient control to prevent coordination, competition, and correction. Historically, three mechanisms caused authoritarian systems to fail: military defeat from outside, economic collapse from internal inefficiency, and gradual institutional decay. AI may close all three exit paths simultaneously.
 **The information-processing constraint as historical ceiling:**
 The manuscript's analysis of the Soviet Union identifies the core failure mode of centralized control: Hayek's dispersed knowledge problem. Central planning fails not because planners are incompetent but because the information required to coordinate an economy is distributed across millions of actors making context-dependent decisions. No central planner could aggregate and process this information fast enough to match the efficiency of distributed markets. This is why the Soviet economy produced surpluses of goods nobody wanted and shortages of goods everybody needed.
 This constraint was structural, not contingent. It applied to every historical case of authoritarian lock-in:
 - The Soviet Union lasted 69 years but collapsed when economic inefficiency exceeded the system's capacity to maintain control
 - The Ming Dynasty maintained the Haijin maritime ban for centuries but at enormous opportunity cost — the world's most advanced navy abandoned because internal control was prioritized over external exploration
 - The Roman Empire's centralization phase was stable for centuries but with declining institutional quality as central decision-making couldn't adapt to distributed local conditions
 **How AI removes the constraint:**
 Three specific AI capabilities attack the information-processing ceiling:
 1. **Surveillance at marginal cost approaching zero.** Historical authoritarian states required massive human intelligence apparatuses. The Stasi employed approximately 1 in 63 East Germans as informants — a labor-intensive model that constrained the depth and breadth of monitoring. AI-powered surveillance (facial recognition, natural language processing of communications, behavioral prediction) reduces the marginal cost of monitoring each additional citizen toward zero while increasing the depth of analysis beyond what human agents could achieve.
 2. **Enforcement via autonomous systems.** Historical enforcement required human intermediaries — soldiers, police, bureaucrats — who could defect, resist, or simply fail to execute orders. Autonomous enforcement systems (AI-powered drones, automated content moderation, algorithmic access control) execute without the possibility of individual conscience or collective resistance. The human intermediary was the weak link in every historical authoritarian system; AI removes it.
 3. **Central planning viability.** If AI can process distributed information at sufficient scale, Hayek's dispersed knowledge problem may not hold. This doesn't mean central planning becomes optimal — it means the economic collapse that historically ended authoritarian systems may not occur. A sufficiently capable AI-assisted central planner could achieve economic performance competitive with distributed markets, eliminating the primary mechanism through which historical authoritarian systems failed.
 **Exit path closure:**
 If all three capabilities develop sufficiently:
 - **Military defeat** becomes less likely when autonomous defense systems don't require the morale and loyalty of human soldiers
 - **Economic collapse** becomes less likely if AI-assisted planning overcomes the information-processing constraint
 - **Institutional decay** becomes less likely if AI-powered monitoring detects and corrects degradation in real time
 This doesn't mean authoritarian lock-in is inevitable — it means the cost of achieving and maintaining it drops dramatically, making it accessible to actors who previously lacked the institutional capacity for sustained centralized control.
 ## Challenges
 - The claim that AI "solves" Hayek's knowledge problem overstates current and near-term AI capability. Processing distributed information at civilization-scale in real time is far beyond current systems. The claim is about trajectory, not current state.
 - Economic performance is not the only determinant of regime stability. Legitimacy, cultural factors, and external geopolitical dynamics also matter. AI surveillance doesn't address legitimacy crises.
 - The Stasi comparison anchors the argument in a specific historical case. Modern authoritarian states (China's social credit system, Russia's internet monitoring) are intermediate cases — more capable than the Stasi, less capable than the AI ceiling this claim describes. The progression from historical to current to projected is a gradient, not a binary.
 - Autonomous enforcement systems still require human-designed objectives and maintenance. The "no individual conscience" argument assumes the system operates as designed — but failure modes in autonomous systems could create their own instabilities.
 ---
 Relevant Notes:
 - [[AI accelerates existing Molochian dynamics by removing bottlenecks not creating new misalignment because the competitive equilibrium was always catastrophic and friction was the only thing preventing convergence]] — authoritarian lock-in is one outcome of accelerated Molochian dynamics
 - [[four restraints prevent competitive dynamics from reaching catastrophic equilibrium and AI specifically erodes physical limitations and bounded rationality leaving only coordination as defense]] — lock-in exploits the erosion of restraint #2 (physical limitations on surveillance/enforcement)
 - [[three paths to superintelligence exist but only collective superintelligence preserves human agency]] — lock-in via AI superintelligence eliminates human agency by construction
 Topics:
 - [[_map]]
--- a/domains/ai-alignment/four
+++ b/domains/ai-alignment/four
@ -0,0 +1,56 @@
 ---
 type: claim
 domain: ai-alignment
 description: "Alexander's taxonomy of four mechanisms that prevent multipolar traps from destroying all value — excess resources, physical limitations, utility maximization, and coordination — provides a framework for understanding which defenses AI undermines and which remain viable"
 confidence: likely
 source: "Scott Alexander 'Meditations on Moloch' (slatestarcodex.com, July 2014), Schmachtenberger metacrisis framework, Abdalla manuscript price-of-anarchy analysis"
 created: 2026-04-02
 depends_on:
  - "AI accelerates existing Molochian dynamics by removing bottlenecks not creating new misalignment because the competitive equilibrium was always catastrophic and friction was the only thing preventing convergence"
  - "technology advances exponentially but coordination mechanisms evolve linearly creating a widening gap"
 ---
 # four restraints prevent competitive dynamics from reaching catastrophic equilibrium and AI specifically erodes physical limitations and bounded rationality leaving only coordination as defense
 Scott Alexander's "Meditations on Moloch" identifies four categories of mechanism that prevent competitive dynamics from destroying all human value. Understanding which restraints AI erodes and which it leaves intact determines where governance investment should concentrate.
 **The four restraints:**
 1. **Excess resources** — When carrying capacity exceeds population, non-optimal behavior is affordable. A species with surplus food can afford altruism. A company with surplus capital can afford safety investment. This restraint erodes naturally as competition fills available niches — it is the first to fail and the least reliable.
 2. **Physical limitations** — Biological and material constraints prevent complete optimization. Humans need sleep, can only be in one place, have limited information-processing bandwidth. Physical infrastructure has lead times measured in years. These constraints set a floor below which competitive dynamics cannot push — organisms cannot evolve arbitrary metabolisms, factories cannot produce arbitrary quantities, surveillance requires human intelligence officers (the Stasi needed 1 agent per 63 citizens).
 3. **Utility maximization / bounded rationality** — Competition for customers partially aligns producer incentives with consumer welfare. But this only works when consumers can evaluate quality, switch costs are low, and information is symmetric. Bounded rationality means actors cannot fully optimize, which paradoxically limits how destructive their competition becomes.
 4. **Coordination mechanisms** — Governments, social codes, professional norms, treaties, and institutions override individual incentive structures. This is the only restraint that is architecturally robust — it doesn't depend on abundance, physical limits, or cognitive limits, but on the design of the coordination infrastructure itself.
 **AI's specific effect on each restraint:**
 - **Excess resources (#1):** AI increases resource efficiency, which can either extend surplus (if gains are distributed) or eliminate it faster (if competitive dynamics capture gains). Direction is ambiguous — this restraint was already the weakest.
 - **Physical limitations (#2):** AI fundamentally erodes this. Automated systems don't fatigue. AI surveillance scales to marginal cost approaching zero (vs the Stasi's labor-intensive model). AI-accelerated R&D compresses infrastructure lead times. The manuscript's FERC analysis — 9 substations could take down the US grid — illustrates how physical infrastructure was already fragile; AI-enabled optimization of attack vectors makes it more so.
 - **Bounded rationality (#3):** AI erodes this from both sides. It enables competitive optimization at speeds that bypass human deliberation (algorithmic trading, automated content generation, AI-assisted strategic planning). But it also potentially improves decision quality through better information processing. Net effect on competition is likely negative — faster optimization in competitive contexts outpaces improved cooperation.
 - **Coordination mechanisms (#4):** AI has mixed effects. It can strengthen coordination (better information aggregation, lower transaction costs, prediction markets) or undermine it (deepfakes eroding epistemic commons, AI-powered regulatory arbitrage, surveillance enabling authoritarian lock-in). This is the only restraint whose trajectory is designable rather than predetermined.
 **The strategic implication:** If restraints #1-3 are eroding and #4 is the only one with designable trajectory, then the alignment problem is fundamentally a coordination design problem. Investment in coordination infrastructure (futarchy, collective intelligence architectures, binding international agreements) is more important than investment in making individual AI systems safe — because individual safety is itself subject to the competitive dynamics that coordination must constrain.
 This connects directly to the existing KB claim that [[technology advances exponentially but coordination mechanisms evolve linearly creating a widening gap]]. The four-restraint framework explains *why* that gap matters: technology erodes three of four defenses, and the fourth — coordination — is evolving too slowly to compensate.
 ## Challenges
 - Alexander's taxonomy is analytical, not empirical. The four categories may not be exhaustive — social/cultural norms, for instance, may constitute a distinct restraint mechanism that doesn't reduce neatly to "coordination."
 - The claim that AI specifically erodes #2 and #3 while leaving #4 designable may be too optimistic about #4. If AI-powered disinformation erodes the epistemic commons required for coordination, then #4 is also under attack, not just designable.
 - "Leaving only coordination as defense" is a strong claim. Physical limitations still constrain AI deployment substantially (compute costs, energy requirements, chip supply chains). The governance window may be narrow but it exists.
 ---
 Relevant Notes:
 - [[AI accelerates existing Molochian dynamics by removing bottlenecks not creating new misalignment because the competitive equilibrium was always catastrophic and friction was the only thing preventing convergence]] — the parent mechanism this taxonomy structures
 - [[technology advances exponentially but coordination mechanisms evolve linearly creating a widening gap]] — the linear coordination evolution is specifically about restraint #4
 - [[AI alignment is a coordination problem not a technical problem]] — this taxonomy explains why: restraints #1-3 are eroding, #4 is the designable one
 - [[physical infrastructure constraints on AI development create a natural governance window of 2 to 10 years because hardware bottlenecks are not software-solvable]] — a specific instance of restraint #2 that is degrading
 Topics:
 - [[_map]]
--- a/foundations/collective-intelligence/multipolar
+++ b/foundations/collective-intelligence/multipolar
@ -0,0 +1,51 @@
 ---
 type: claim
 domain: collective-intelligence
 description: "Competitive dynamics that sacrifice shared value for individual advantage are the default state of any multi-agent system — coordination is the expensive, fragile exception that must be actively maintained against constant reversion pressure"
 confidence: likely
 source: "Scott Alexander 'Meditations on Moloch' (slatestarcodex.com, July 2014), game theory Nash equilibrium analysis, Abdalla manuscript price-of-anarchy framework, Ostrom commons governance research"
 created: 2026-04-02
 depends_on:
  - "coordination failures arise from individually rational strategies that produce collectively irrational outcomes because the Nash equilibrium of non-cooperation dominates when trust and enforcement are absent"
  - "collective action fails by default because rational individuals free-ride on group efforts when they cannot be excluded from benefits regardless of contribution"
 ---
 # multipolar traps are the thermodynamic default because competition requires no infrastructure while coordination requires trust enforcement and shared information all of which are expensive and fragile
 The price of anarchy — the gap between cooperative optimum and competitive equilibrium — quantifies how much value multipolar competition destroys. The manuscript frames this as the central question: "If a superintelligence inherited our current capabilities and place in history, its ultimate survival would already be practically assured... So why does humanity's long-term future look so uncertain?" The answer is the price of anarchy: individually rational actors producing collectively suboptimal outcomes.
 Alexander's "Meditations on Moloch" demonstrates that this dynamic is not contingent or accidental but structural. His 14 examples — the Malthusian trap, arms races, regulatory races to the bottom, the two-income trap, capitalism without regulation, cancer dynamics (cellular defection destroying the organism), political campaign spending, science publishing incentives, government corruption, and more — all instantiate the same mechanism: "In some competition optimizing for X, the opportunity arises to throw some other value under the bus for improved X."
 **Why this is the default, not an exception:**
 The asymmetry between competition and coordination is fundamental:
 - **A population of cooperators can be invaded by a single defector.** One actor who breaks the agreement captures the cooperative surplus while others bear the cost. This is evolutionary game theory's core result.
 - **A population of defectors cannot be invaded by a single cooperator.** Unilateral cooperation is punished — the cooperator bears cost without receiving benefit. This is why the alignment tax creates a race to the bottom.
 - **Coordination requires infrastructure; competition does not.** Trust must be established (slow, fragile). Enforcement must be built (expensive, corruptible). Shared information commons must be maintained (vulnerable to manipulation). Each of these is a public good subject to its own coordination failure.
 This asymmetry means competitive dynamics are like entropy — they increase without active investment in coordination. Every coordination mechanism requires ongoing maintenance expenditure; the moment maintenance stops, competitive dynamics resume. The Westphalian system, nuclear deterrence treaties, and trade agreements all require continuous diplomatic effort to maintain. When that effort lapses — as with the League of Nations, or Anthropic's RSP — competitive dynamics immediately reassert.
 **What this means for AI governance:**
 If multipolar traps are the default, then AI governance is not about preventing a novel failure mode but about maintaining coordination infrastructure against the constant pressure of competitive reversion. The alignment tax, the RSP rollback, and the race dynamics between AI labs are not aberrations — they are the default state asserting itself. Governance success means building coordination mechanisms robust enough to withstand the reversion pressure, not eliminating the pressure itself.
 Schmachtenberger's "generator function of existential risk" is this same insight at civilizational scale: climate change, nuclear proliferation, AI safety, biodiversity loss are not separate problems but the same Molochian dynamic operating across different commons simultaneously.
 ## Challenges
 - Ostrom's 800+ documented cases of successful commons governance show that the default can be overcome at community scale under specific conditions (repeated interaction, shared identity, credible enforcement, bounded community). The claim that multipolar traps are "the default" should be scoped: default in the absence of these conditions, not default universally.
 - The entropy analogy may overstate the case. Unlike thermodynamic entropy, coordination can self-reinforce once established (trust begets trust, institutions enable further institution-building). The dynamic is not strictly one-directional.
 - The price of anarchy varies enormously across domains. Some competitive dynamics are mildly suboptimal; others are existentially destructive. The claim groups all multipolar traps together when the policy response should distinguish between tolerable and catastrophic price-of-anarchy levels.
 ---
 Relevant Notes:
 - [[coordination failures arise from individually rational strategies that produce collectively irrational outcomes because the Nash equilibrium of non-cooperation dominates when trust and enforcement are absent]] — the formal mechanism
 - [[collective action fails by default because rational individuals free-ride on group efforts when they cannot be excluded from benefits regardless of contribution]] — the free-rider component
 - [[the alignment tax creates a structural race to the bottom because safety training costs capability and rational competitors skip it]] — AI-domain instance
 - [[Ostrom proved communities self-govern shared resources when eight design principles are met without requiring state control or privatization]] — the empirical escape conditions
 - [[designing coordination rules is categorically different from designing coordination outcomes as nine intellectual traditions independently confirm]] — the design principle for building coordination that overcomes the default
 Topics:
 - [[_map]]
--- a/foundations/collective-intelligence/the
+++ b/foundations/collective-intelligence/the
@ -29,6 +29,11 @@ A collective intelligence architecture could potentially make alignment structur
 ---
 ### Additional Evidence (extend)
 *Source: Abdalla manuscript 'Architectural Investing' Taylor/soldiering parallel, Kanigel 'The One Best Way' | Added: 2026-04-02 | Extractor: Theseus*
 The alignment tax is structurally identical to the soldiering dynamic in Frederick Taylor's era of industrial management. Under the piece-rate system, workers collectively restricted output to prevent rate cuts: "too high an output and the rate would be cut, as sure as the sunrise, and all the men would suffer" (Kanigel). A worker who innovated or worked harder than his peers demonstrated that higher output was possible, which triggered management to cut the rate — punishing everyone. The rational individual response was collective output restriction. AI safety investment follows the same game-theoretic structure: an AI lab that unilaterally invests in safety demonstrates that development can proceed more cautiously, which changes the baseline expectation without changing the competitive landscape. The lab bears the cost of slower development while competitors capture the capability surplus. Anthropic's RSP rollback is the modern equivalent of a worker who tried to break the rate and was forced back into line — not by fellow workers but by the competitive market and government procurement pressure (Pentagon designating Anthropic a supply chain risk for maintaining safety guardrails). The mechanism is identical: rational actors suppress collectively beneficial behavior because the penalty for unilateral cooperation exceeds the individual benefit. The difference is scale — Taylor's dynamic operated within a single factory; the alignment tax operates across the global AI development ecosystem.
 Relevant Notes:
 - [[AI alignment is a coordination problem not a technical problem]] -- the alignment tax is the clearest evidence for this claim
 - [[existential risks interact as a system of amplifying feedback loops not independent threats]] -- competitive pressure amplifies technical alignment risks
--- a/foundations/critical-systems/minsky's
+++ b/foundations/critical-systems/minsky's
@ -41,6 +41,11 @@ Relevant Notes:
 - [[simulated annealing maps the physics of cooling onto optimization by starting with high randomness and gradually reducing it]] -- financial regulation attempts to provide calibrated perturbation rather than relying on catastrophic random restarts
 - [[five errors behind systemic financial failures are engineering overreach smooth-sailing fallacy risk-seeking incentives social herding and inside view bias]] -- Rumelt names the micro-level cognitive mechanisms driving Minsky's macro instability dynamic
 ### Additional Evidence (extend)
 *Source: Karl Friston active inference framework, Per Bak self-organized criticality, Abdalla manuscript self-organized criticality section | Added: 2026-04-02 | Extractor: Theseus*
 Friston's concept of "autovitiation" — systems that destroy their own fixed points as a feature, not a bug — provides the formal generalization of Minsky's mechanism. Minsky's financial instability is a specific instance of autovitiation: the stable economic regime generates the conditions (increasing leverage, declining standards, disaster myopia) that destroy the stability of that regime. The system does not merely respond to external shocks; it internally generates the forces that undermine its own equilibrium. This connects Minsky's financial-specific observation to a broader principle: complex adaptive systems at criticality do not have stable fixed points because the dynamics that produce apparent stability simultaneously erode the foundations of that stability. The manuscript's analysis of supply chain fragility (efficiency optimization creating systemic vulnerability), healthcare fragility (private equity reducing hospital beds to increase profitability), and energy infrastructure fragility (deferred maintenance by investor-owned utilities) all demonstrate autovitiation in non-financial domains — optimization for short-term performance that destroys the long-term conditions for that performance.
 Topics:
 - [[livingip overview]]
 - [[systemic risk]]
--- a/inbox/archive/2014-07-30-scott-alexander-meditations-on-moloch.md
+++ b/inbox/archive/2014-07-30-scott-alexander-meditations-on-moloch.md
@ -0,0 +1,37 @@
 ---
 source: web
 author: "Scott Alexander"
 title: "Meditations on Moloch"
 date: 2014-07-30
 url: "https://slatestarcodex.com/2014/07/30/meditations-on-moloch/"
 status: processed
 processed_by: theseus
 processed_date: 2026-04-02
 claims_extracted:
  - "AI accelerates existing Molochian dynamics by removing bottlenecks not creating new misalignment because the competitive equilibrium was always catastrophic and friction was the only thing preventing convergence"
  - "four restraints prevent competitive dynamics from reaching catastrophic equilibrium and AI specifically erodes physical limitations and bounded rationality leaving only coordination as defense"
  - "multipolar traps are the thermodynamic default because competition requires no infrastructure while coordination requires trust enforcement and shared information all of which are expensive and fragile"
 enrichments:
  - "the alignment tax creates a structural race to the bottom because safety training costs capability and rational competitors skip it"
 ---
 # Meditations on Moloch — Scott Alexander (2014)
 Foundational essay on multipolar traps and competitive dynamics that systematically sacrifice values for competitive advantage. Structured around Allen Ginsberg's poem "Howl" and the figure of Moloch as personification of coordination failure.
 ## Key Arguments
 1. **14 examples of multipolar traps** spanning biology (Malthusian trap), economics (capitalism without regulation, two-income trap), politics (arms races, regulatory races to the bottom), and social dynamics (education arms race, science publishing). All instantiate the same mechanism: individually rational optimization producing collectively catastrophic outcomes.
 2. **Four restraints** that prevent competitive dynamics from destroying all value: excess resources, physical limitations, utility maximization (bounded rationality), and coordination mechanisms. Alexander argues all four are eroding.
 3. **Moloch as the default state** — competitive dynamics require no infrastructure; coordination requires trust, enforcement, shared information, and ongoing maintenance. The asymmetry makes Molochian dynamics the thermodynamic default.
 4. **The superintendent question** — only a sufficiently powerful coordinator (Alexander's "Elua") can overcome Moloch. This frames the AI alignment question as: will superintelligence serve Moloch (accelerating competitive dynamics) or Elua (enabling coordination)?
 ## Extraction Notes
 - ~40% overlap with Leo's attractor-molochian-exhaustion musing which synthesizes Alexander's framework
 - The four-restraint taxonomy was absent from KB — extracted as standalone claim
 - The "multipolar traps as default" principle was implicit across KB but never stated as standalone — extracted to foundations/collective-intelligence
 - The mechanism claim (AI removes bottlenecks, doesn't create new misalignment) is novel synthesis from Alexander + manuscript + Schmachtenberger