- What: 3 enrichments to existing claims + 2 new standalone claims + 3 source archives - Sources: TIME "Anthropic Drops Flagship Safety Pledge" (Mar 2026), Dario Amodei "Machines of Loving Grace" (darioamodei.com), Dario Amodei "The Adolescence of Technology" (darioamodei.com) Enrichments: 1. voluntary safety pledges claim: Conditional RSP structure (only pause if leading AND catastrophic), Kaplan quotes, $30B/$380B financials, METR frog-boiling warning 2. bioterrorism claim: Anthropic mid-2025 measurements (2-3x uplift), STEM-degree threshold approaching, 36/38 gene synthesis providers fail screening, mirror life extinction scenario, ASL-3 classification 3. RSI claim: AI already writing much of Anthropic's code, 1-2 years from current gen autonomously building next gen New claims: 1. AI personas from pre-training as spectrum of humanlike motivations — challenges monomaniacal goal models (experimental) 2. Marginal returns to intelligence bounded by five complementary factors — bounds what SI can achieve (likely) Cross-domain flags: health (compressed 21st century), internet-finance (labor displacement, GDP growth), foundations (chip export controls, civilizational maturation) Source diversity note: 3 sources from Dario Amodei / Anthropic — correlated priors flagged per >3 rule Pentagon-Agent: Theseus <845F10FB-BC22-40F6-A6A6-F6E4D8F78465>
18 lines
1.1 KiB
Markdown
18 lines
1.1 KiB
Markdown
---
|
|
title: "Exclusive: Anthropic Drops Flagship Safety Pledge"
|
|
author: TIME staff
|
|
source: TIME
|
|
date: 2026-03-06
|
|
url: https://time.com/7380854/exclusive-anthropic-drops-flagship-safety-pledge/
|
|
processed_by: theseus
|
|
processed_date: 2026-03-07
|
|
type: news article
|
|
status: complete
|
|
enrichments:
|
|
- target: "voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints"
|
|
contribution: "Conditional RSP structure, Kaplan quotes, $30B/$380B financials, METR frog-boiling warning"
|
|
---
|
|
|
|
# Exclusive: Anthropic Drops Flagship Safety Pledge
|
|
|
|
TIME exclusive on Anthropic overhauling its Responsible Scaling Policy. Original RSP: never train without advance safety guarantees. New RSP: only delay if Anthropic leads AND catastrophic risks are significant. Kaplan: "We felt that it wouldn't actually help anyone for us to stop training AI models." $30B raise, ~$380B valuation, 10x annual revenue growth. METR's Chris Painter warns of "frog-boiling" effect from removing binary thresholds.
|