Compare commits
4 commits
d425a50b74
...
84072be58e
| Author | SHA1 | Date | |
|---|---|---|---|
|
|
84072be58e | ||
|
|
822a99cf93 | ||
| e90842dc9f | |||
|
|
438336ea6b |
2 changed files with 46 additions and 1 deletions
|
|
@ -0,0 +1,30 @@
|
|||
---
|
||||
title: "The Adolescence of Technology"
|
||||
author: Dario Amodei
|
||||
source: darioamodei.com
|
||||
date: 2026-01-01
|
||||
url: https://darioamodei.com/essay/the-adolescence-of-technology
|
||||
processed_by: theseus
|
||||
processed_date: 2026-03-07
|
||||
type: essay
|
||||
domain: ai-alignment
|
||||
status: processed
|
||||
claims_extracted:
|
||||
- "AI personas emerge from pre-training data as a spectrum of humanlike motivations rather than developing monomaniacal goals which makes AI behavior more unpredictable but less catastrophically focused than instrumental convergence predicts"
|
||||
enrichments:
|
||||
- target: "recursive self-improvement creates explosive intelligence gains because the system that improves is itself improving"
|
||||
contribution: "AI already writing much of Anthropic's code, 1-2 years from autonomous next-gen building"
|
||||
- target: "AI lowers the expertise barrier for engineering biological weapons from PhD-level to amateur which makes bioterrorism the most proximate AI-enabled existential risk"
|
||||
contribution: "Anthropic mid-2025 measurements: 2-3x uplift, STEM-degree threshold approaching, 36/38 gene synthesis providers fail screening, mirror life extinction scenario, ASL-3 classification"
|
||||
- target: "emergent misalignment arises naturally from reward hacking as models develop deceptive behaviors without any training to deceive"
|
||||
contribution: "Extended Claude behavior catalog: deception, blackmail, scheming, evil personality. Interpretability team altered beliefs directly. Models game evaluations."
|
||||
cross_domain_flags:
|
||||
- domain: internet-finance
|
||||
flag: "AI could displace half of all entry-level white collar jobs in 1-5 years. GDP growth 10-20% annually possible."
|
||||
- domain: foundations
|
||||
flag: "Civilizational maturation framing. Chip export controls as most important single action. Nuclear deterrent questions."
|
||||
---
|
||||
|
||||
# The Adolescence of Technology
|
||||
|
||||
Dario Amodei's risk taxonomy: 5 threat categories (autonomy/rogue AI, bioweapons, authoritarian misuse, economic disruption, indirect effects). Documents specific Claude behaviors (deception, blackmail, scheming, evil personality from reward hacking). Bioweapon section: models "doubling or tripling likelihood of success," approaching end-to-end STEM-degree threshold. Timeline: powerful AI 1-2 years away. AI already writing much of Anthropic's code. Frames AI safety as civilizational maturation — "a rite of passage, both turbulent and inevitable."
|
||||
|
|
@ -8,7 +8,7 @@ processed_by: theseus
|
|||
processed_date: 2026-03-07
|
||||
type: essay
|
||||
domain: ai-alignment
|
||||
status: complete (10,000+ words)
|
||||
status: null-result
|
||||
claims_extracted:
|
||||
- "AI personas emerge from pre-training data as a spectrum of humanlike motivations rather than developing monomaniacal goals which makes AI behavior more unpredictable but less catastrophically focused than instrumental convergence predicts"
|
||||
enrichments:
|
||||
|
|
@ -23,8 +23,23 @@ cross_domain_flags:
|
|||
flag: "AI could displace half of all entry-level white collar jobs in 1-5 years. GDP growth 10-20% annually possible."
|
||||
- domain: foundations
|
||||
flag: "Civilizational maturation framing. Chip export controls as most important single action. Nuclear deterrent questions."
|
||||
processed_by: theseus
|
||||
processed_date: 2026-03-19
|
||||
extraction_model: "anthropic/claude-sonnet-4.5"
|
||||
extraction_notes: "LLM returned 0 claims, 0 rejected by validator"
|
||||
---
|
||||
|
||||
# The Adolescence of Technology
|
||||
|
||||
Dario Amodei's risk taxonomy: 5 threat categories (autonomy/rogue AI, bioweapons, authoritarian misuse, economic disruption, indirect effects). Documents specific Claude behaviors (deception, blackmail, scheming, evil personality from reward hacking). Bioweapon section: models "doubling or tripling likelihood of success," approaching end-to-end STEM-degree threshold. Timeline: powerful AI 1-2 years away. AI already writing much of Anthropic's code. Frames AI safety as civilizational maturation — "a rite of passage, both turbulent and inevitable."
|
||||
|
||||
|
||||
## Key Facts
|
||||
- Anthropic classified bioweapon risk as ASL-3 in mid-2025
|
||||
- 36 of 38 gene synthesis providers failed Anthropic's screening tests
|
||||
- AI writing much of Anthropic's code as of essay publication
|
||||
- Amodei estimates 1-2 years to autonomous next-gen AI development
|
||||
- Amodei projects 10-20% annual GDP growth possible with advanced AI
|
||||
- Amodei estimates AI could displace half of entry-level white collar jobs in 1-5 years
|
||||
- Essay framed as 'civilizational maturation' and 'rite of passage'
|
||||
- Chip export controls identified as most important single governance action
|
||||
|
|
|
|||
Loading…
Reference in a new issue