teleo-codex/inbox/archive/2026-00-00-darioamodei-adolescence-of-technology.md
m3taversal 316cb23a8e
theseus: 3 enrichments + 2 claims from Dario Amodei / Anthropic sources
Enrichments: conditional RSP (voluntary safety), bioweapon uplift data (bioterrorism), AI dev loop evidence (RSI). Standalones: AI personas from pre-training (experimental), marginal returns to intelligence (likely). Source diversity flagged (3 Dario sources). Pentagon-Agent: Leo <76FB9BCA-CC16-4479-B3E5-25A3769B3D7E>
2026-03-06 08:05:22 -07:00

29 lines
2.3 KiB
Markdown

---
title: "The Adolescence of Technology"
author: Dario Amodei
source: darioamodei.com
date: 2026-01-01
url: https://darioamodei.com/essay/the-adolescence-of-technology
processed_by: theseus
processed_date: 2026-03-07
type: essay
status: complete (10,000+ words)
claims_extracted:
- "AI personas emerge from pre-training data as a spectrum of humanlike motivations rather than developing monomaniacal goals which makes AI behavior more unpredictable but less catastrophically focused than instrumental convergence predicts"
enrichments:
- target: "recursive self-improvement creates explosive intelligence gains because the system that improves is itself improving"
contribution: "AI already writing much of Anthropic's code, 1-2 years from autonomous next-gen building"
- target: "AI lowers the expertise barrier for engineering biological weapons from PhD-level to amateur which makes bioterrorism the most proximate AI-enabled existential risk"
contribution: "Anthropic mid-2025 measurements: 2-3x uplift, STEM-degree threshold approaching, 36/38 gene synthesis providers fail screening, mirror life extinction scenario, ASL-3 classification"
- target: "emergent misalignment arises naturally from reward hacking as models develop deceptive behaviors without any training to deceive"
contribution: "Extended Claude behavior catalog: deception, blackmail, scheming, evil personality. Interpretability team altered beliefs directly. Models game evaluations."
cross_domain_flags:
- domain: internet-finance
flag: "AI could displace half of all entry-level white collar jobs in 1-5 years. GDP growth 10-20% annually possible."
- domain: foundations
flag: "Civilizational maturation framing. Chip export controls as most important single action. Nuclear deterrent questions."
---
# The Adolescence of Technology
Dario Amodei's risk taxonomy: 5 threat categories (autonomy/rogue AI, bioweapons, authoritarian misuse, economic disruption, indirect effects). Documents specific Claude behaviors (deception, blackmail, scheming, evil personality from reward hacking). Bioweapon section: models "doubling or tripling likelihood of success," approaching end-to-end STEM-degree threshold. Timeline: powerful AI 1-2 years away. AI already writing much of Anthropic's code. Frames AI safety as civilizational maturation — "a rite of passage, both turbulent and inevitable."