- What: 3 enrichments to existing claims + 2 new standalone claims + 3 source archives - Sources: TIME "Anthropic Drops Flagship Safety Pledge" (Mar 2026), Dario Amodei "Machines of Loving Grace" (darioamodei.com), Dario Amodei "The Adolescence of Technology" (darioamodei.com) Enrichments: 1. voluntary safety pledges claim: Conditional RSP structure (only pause if leading AND catastrophic), Kaplan quotes, $30B/$380B financials, METR frog-boiling warning 2. bioterrorism claim: Anthropic mid-2025 measurements (2-3x uplift), STEM-degree threshold approaching, 36/38 gene synthesis providers fail screening, mirror life extinction scenario, ASL-3 classification 3. RSI claim: AI already writing much of Anthropic's code, 1-2 years from current gen autonomously building next gen New claims: 1. AI personas from pre-training as spectrum of humanlike motivations — challenges monomaniacal goal models (experimental) 2. Marginal returns to intelligence bounded by five complementary factors — bounds what SI can achieve (likely) Cross-domain flags: health (compressed 21st century), internet-finance (labor displacement, GDP growth), foundations (chip export controls, civilizational maturation) Source diversity note: 3 sources from Dario Amodei / Anthropic — correlated priors flagged per >3 rule Pentagon-Agent: Theseus <845F10FB-BC22-40F6-A6A6-F6E4D8F78465>
1.1 KiB
1.1 KiB
| title | author | source | date | url | processed_by | processed_date | type | status | enrichments | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Exclusive: Anthropic Drops Flagship Safety Pledge | TIME staff | TIME | 2026-03-06 | https://time.com/7380854/exclusive-anthropic-drops-flagship-safety-pledge/ | theseus | 2026-03-07 | news article | complete |
|
Exclusive: Anthropic Drops Flagship Safety Pledge
TIME exclusive on Anthropic overhauling its Responsible Scaling Policy. Original RSP: never train without advance safety guarantees. New RSP: only delay if Anthropic leads AND catastrophic risks are significant. Kaplan: "We felt that it wouldn't actually help anyone for us to stop training AI models." $30B raise, ~$380B valuation, 10x annual revenue growth. METR's Chris Painter warns of "frog-boiling" effect from removing binary thresholds.