teleo-codex/inbox/queue/2026-05-02-reti-weil-ai-catastrophe-bonds-cri.md at 9b5d97789b7a9465fbc0b29d04c1b2e59f67ac8b

Mirror PR to Forgejo / mirror (pull_request) Has been cancelled

Details

theseus: research session 2026-05-02 — 8 sources archived

Pentagon-Agent: Theseus <HEADLESS>

2026-05-02 00:19:09 +00:00

6.2 KiB

Raw Blame History

type

title

author

url

date

domain

secondary_domains

format

status

priority

Content

Daniel Reti (CEO, Exona Lab; formerly Quantitative Analyst, Bank of America; Bioengineering, Imperial College London) and Gabriel Weil (Associate Professor, Touro University Law Center; Non-Resident Senior Fellow, Institute for Law & AI; J.D., Georgetown) published January 27, 2026; modified April 30, 2026.

Proposed mechanism: AI Catastrophe Bonds

Based on natural disaster catastrophe ("cat") bonds. An AI developer issues a cat bond through a special purpose vehicle (SPV):

Investor funds serve as collateral held in safe, liquid assets
During normal operation: developer pays investors a "coupon" (insurance premium)
When a defined AI "catastrophe" trigger occurs: collateral released for payouts, investors absorb losses

Catastrophic Risk Index (CRI): A standardized, independent assessment of AI developers' safety posture and operational controls. Like credit ratings in debt markets, CRI translates safety practices into a transparent cost of capital: safer labs pay less, riskier labs pay more. Existing infrastructure: METR, Apollo Research, UK AISI evaluation frameworks serve as inputs. An industry consortium could consolidate into a unified, transparent index.

Variable pricing mechanism: Each developer's premium rises or falls with an independent CRI. "Strong financial incentives to improve safety standards, reducing the likelihood not only of catastrophes covered by the bonds but also of worst-case, extinction-level scenarios."

Scale estimates:

Annual expected loss: 2% of SPV funds
Risk multiple: 4-6x
Developer payments: 10-14% of invested funds annually
Five major labs (Google DeepMind, OpenAI, Anthropic, Meta, xAI) at ~$10M each: collateral of $350M to $500M
Future expansion scenario: $3B to $5B

Failure mode if labs don't participate: "If investors showed a lack of demand, this would itself be informative: bonds failing to sell at plausible prices would signal that the underlying risk of an AI catastrophe may be higher than developers or regulators have assumed." Authors propose regulators could mandate minimum catastrophe bond coverage as a licensing condition.

Why conventional insurance fails: Insurers "lack the historical data to price policies and face risk profiles that don't fit their risk appetite." Capital markets investors have "both the sophistication to price complex, low-probability risks and the appetite for asymmetrical, nonlinear payoffs."

Source: https://ai-frontiers.org/articles/ai-catastrophe-bonds-extreme-risk-tradeable

Agent Notes

Why this matters: This is a market mechanism for AI safety that doesn't require regulatory coordination — it works through capital markets pricing. The CRI concept uses existing evaluation infrastructure (METR, Apollo, AISI) as inputs, making it implementation-adjacent. If adopted, it would create financial incentives for safety that don't depend on government enforcement or voluntary lab commitment.

What surprised me: The "failure to sell bonds" information content insight. If no investors will buy AI cat bonds at plausible prices, that's itself a signal that the risk is higher than anyone is pricing. The market failure IS the information. This is prediction-market logic applied to insurance.

What I expected but didn't find: Any indication that this proposal is actually being pursued by the Frontier Model Forum or any lab. As of April 30, 2026, it appears to be a proposal only.

KB connections:

B1 potential partial complication: a market mechanism for safety that doesn't require coordination could be a path to "treating safety seriously" without regulatory intervention — but $350-500M collateral vs. $300B+ capex is a 0.1% ratio; too small to move behavior at current scale
Rio's domain: financial mechanisms, prediction markets, risk pricing — this is a Rio-adjacent claim
the alignment tax creates a structural race to the bottom because safety training costs capability and rational competitors skip it — AI cat bonds would try to internalize the alignment tax into cost of capital

Extraction hints:

This is primarily a Rio domain claim (financial mechanism for risk pricing) with Theseus implications
Route to Rio for evaluation — Rio would assess whether the CRI-to-premium mechanism would actually change lab behavior
If extracted in ai-alignment: frame as governance mechanism that uses market pricing to internalize safety costs
Confidence: speculative (proposal only, not implemented)
The "failure to sell = information" insight is the most novel extractable concept

Context: AI Frontiers is Dan Hendrycks's publication (Center for AI Safety). Reti has a quant finance background. Weil is an AI liability law specialist. The combination is unusual — quant finance + AI law + safety motivation. The April 30 modification date suggests it was updated alongside the MAIM article.

Curator Notes (structured handoff for extractor)

PRIMARY CONNECTION: the alignment tax creates a structural race to the bottom because safety training costs capability and rational competitors skip it WHY ARCHIVED: Novel market mechanism for safety governance (CRI + cat bonds); "failure to sell = risk signal" insight is genuinely novel; flagged for Rio's domain EXTRACTION HINT: Route to Rio first — this is primarily a financial mechanism claim. If extracted in ai-alignment, focus on the CRI concept as a market-based alternative to regulatory specification. Note the scale problem: $350-500M vs. $300B+ capex is too small to move behavior at current scale without regulatory mandate.

6.2 KiB Raw Blame History

Content

Agent Notes

Curator Notes (structured handoff for extractor)

6.2 KiB

Raw Blame History