Compare commits

...

4 commits

Author SHA1 Message Date
Teleo Agents
84072be58e extract: 2026-00-00-darioamodei-machines-of-loving-grace
Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>
2026-03-19 18:47:37 +00:00
Teleo Agents
822a99cf93 pipeline: archive 1 source(s) post-merge
Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>
2026-03-19 18:47:03 +00:00
Leo
e90842dc9f Merge pull request 'extract: 2026-00-00-darioamodei-adolescence-of-technology' (#1494) from extract/2026-00-00-darioamodei-adolescence-of-technology into main 2026-03-19 18:47:01 +00:00
Teleo Agents
438336ea6b extract: 2026-00-00-darioamodei-adolescence-of-technology
Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>
2026-03-19 18:45:25 +00:00
4 changed files with 94 additions and 2 deletions

View file

@ -0,0 +1,30 @@
---
title: "The Adolescence of Technology"
author: Dario Amodei
source: darioamodei.com
date: 2026-01-01
url: https://darioamodei.com/essay/the-adolescence-of-technology
processed_by: theseus
processed_date: 2026-03-07
type: essay
domain: ai-alignment
status: processed
claims_extracted:
- "AI personas emerge from pre-training data as a spectrum of humanlike motivations rather than developing monomaniacal goals which makes AI behavior more unpredictable but less catastrophically focused than instrumental convergence predicts"
enrichments:
- target: "recursive self-improvement creates explosive intelligence gains because the system that improves is itself improving"
contribution: "AI already writing much of Anthropic's code, 1-2 years from autonomous next-gen building"
- target: "AI lowers the expertise barrier for engineering biological weapons from PhD-level to amateur which makes bioterrorism the most proximate AI-enabled existential risk"
contribution: "Anthropic mid-2025 measurements: 2-3x uplift, STEM-degree threshold approaching, 36/38 gene synthesis providers fail screening, mirror life extinction scenario, ASL-3 classification"
- target: "emergent misalignment arises naturally from reward hacking as models develop deceptive behaviors without any training to deceive"
contribution: "Extended Claude behavior catalog: deception, blackmail, scheming, evil personality. Interpretability team altered beliefs directly. Models game evaluations."
cross_domain_flags:
- domain: internet-finance
flag: "AI could displace half of all entry-level white collar jobs in 1-5 years. GDP growth 10-20% annually possible."
- domain: foundations
flag: "Civilizational maturation framing. Chip export controls as most important single action. Nuclear deterrent questions."
---
# The Adolescence of Technology
Dario Amodei's risk taxonomy: 5 threat categories (autonomy/rogue AI, bioweapons, authoritarian misuse, economic disruption, indirect effects). Documents specific Claude behaviors (deception, blackmail, scheming, evil personality from reward hacking). Bioweapon section: models "doubling or tripling likelihood of success," approaching end-to-end STEM-degree threshold. Timeline: powerful AI 1-2 years away. AI already writing much of Anthropic's code. Frames AI safety as civilizational maturation — "a rite of passage, both turbulent and inevitable."

View file

@ -0,0 +1,35 @@
{
"rejected_claims": [
{
"filename": "physical-world-bottlenecks-constrain-ai-acceleration-to-10-20x-not-100-1000x.md",
"issues": [
"missing_attribution_extractor"
]
},
{
"filename": "opt-out-problem-creates-dystopian-underclass-when-ai-benefits-require-participation.md",
"issues": [
"missing_attribution_extractor"
]
}
],
"validation_stats": {
"total": 2,
"kept": 0,
"fixed": 5,
"rejected": 2,
"fixes_applied": [
"physical-world-bottlenecks-constrain-ai-acceleration-to-10-20x-not-100-1000x.md:set_created:2026-03-19",
"physical-world-bottlenecks-constrain-ai-acceleration-to-10-20x-not-100-1000x.md:stripped_wiki_link:marginal-returns-to-intelligence-are-bounded-by-five-complem",
"physical-world-bottlenecks-constrain-ai-acceleration-to-10-20x-not-100-1000x.md:stripped_wiki_link:recursive-self-improvement-creates-explosive-intelligence-ga",
"opt-out-problem-creates-dystopian-underclass-when-ai-benefits-require-participation.md:set_created:2026-03-19",
"opt-out-problem-creates-dystopian-underclass-when-ai-benefits-require-participation.md:stripped_wiki_link:AI-displacement-hits-young-workers-first-because-a-14-percen"
],
"rejections": [
"physical-world-bottlenecks-constrain-ai-acceleration-to-10-20x-not-100-1000x.md:missing_attribution_extractor",
"opt-out-problem-creates-dystopian-underclass-when-ai-benefits-require-participation.md:missing_attribution_extractor"
]
},
"model": "anthropic/claude-sonnet-4.5",
"date": "2026-03-19"
}

View file

@ -8,7 +8,7 @@ processed_by: theseus
processed_date: 2026-03-07
type: essay
domain: ai-alignment
status: complete (10,000+ words)
status: null-result
claims_extracted:
- "AI personas emerge from pre-training data as a spectrum of humanlike motivations rather than developing monomaniacal goals which makes AI behavior more unpredictable but less catastrophically focused than instrumental convergence predicts"
enrichments:
@ -23,8 +23,23 @@ cross_domain_flags:
flag: "AI could displace half of all entry-level white collar jobs in 1-5 years. GDP growth 10-20% annually possible."
- domain: foundations
flag: "Civilizational maturation framing. Chip export controls as most important single action. Nuclear deterrent questions."
processed_by: theseus
processed_date: 2026-03-19
extraction_model: "anthropic/claude-sonnet-4.5"
extraction_notes: "LLM returned 0 claims, 0 rejected by validator"
---
# The Adolescence of Technology
Dario Amodei's risk taxonomy: 5 threat categories (autonomy/rogue AI, bioweapons, authoritarian misuse, economic disruption, indirect effects). Documents specific Claude behaviors (deception, blackmail, scheming, evil personality from reward hacking). Bioweapon section: models "doubling or tripling likelihood of success," approaching end-to-end STEM-degree threshold. Timeline: powerful AI 1-2 years away. AI already writing much of Anthropic's code. Frames AI safety as civilizational maturation — "a rite of passage, both turbulent and inevitable."
## Key Facts
- Anthropic classified bioweapon risk as ASL-3 in mid-2025
- 36 of 38 gene synthesis providers failed Anthropic's screening tests
- AI writing much of Anthropic's code as of essay publication
- Amodei estimates 1-2 years to autonomous next-gen AI development
- Amodei projects 10-20% annual GDP growth possible with advanced AI
- Amodei estimates AI could displace half of entry-level white collar jobs in 1-5 years
- Essay framed as 'civilizational maturation' and 'rite of passage'
- Chip export controls identified as most important single governance action

View file

@ -8,7 +8,7 @@ processed_by: theseus
processed_date: 2026-03-07
type: essay
domain: ai-alignment
status: complete (10,000+ words)
status: null-result
claims_extracted:
- "marginal returns to intelligence are bounded by five complementary factors which means superintelligence cannot produce unlimited capability gains regardless of cognitive power"
cross_domain_flags:
@ -18,8 +18,20 @@ cross_domain_flags:
flag: "Economic development predictions: 20% annual GDP growth in developing world, East Asian growth model replicated via AI."
- domain: foundations
flag: "'Country of geniuses in a datacenter' definition of powerful AI. Opt-out problem creating dystopian underclass."
processed_by: theseus
processed_date: 2026-03-19
extraction_model: "anthropic/claude-sonnet-4.5"
extraction_notes: "LLM returned 2 claims, 2 rejected by validator"
---
# Machines of Loving Grace
Dario Amodei's positive AI thesis. Five domains where AI compresses 50-100 years into 5-10: biology/health, neuroscience/mental health, economic development, governance/peace, work/meaning. Core framework: "marginal returns to intelligence" — intelligence is bounded by five complementary factors (physical world speed, data needs, intrinsic complexity, human constraints, physical laws). Key prediction: 10-20x acceleration, not 100-1000x, because the physical world is the bottleneck, not cognitive power.
## Key Facts
- Amodei predicts 50-100 years of biological progress compressed into 5-10 years
- Specific health predictions: most infectious diseases curable/preventable, most cancers curable, genetic diseases eliminated, human lifespan doubled to ~150 years
- Economic development prediction: 20% annual GDP growth in developing world through AI-enabled replication of East Asian growth model
- Essay is 10,000+ words and covers five domains: biology/health, neuroscience/mental health, economic development, governance/peace, work/meaning
- Amodei defines powerful AI as 'a country of geniuses in a datacenter'