Compare commits

...

2 commits

Author SHA1 Message Date
Teleo Agents
d295b39629 extract: 2025-02-13-aisi-renamed-ai-security-institute-mandate-drift
Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>
2026-03-22 00:33:01 +00:00
Teleo Agents
4869f624f2 entity-batch: update 1 entities
- Applied 1 entity operations from queue
- Files: entities/ai-alignment/anthropic.md

Pentagon-Agent: Epimetheus <968B2991-E2DF-4006-B962-F5B0A0CC8ACA>
2026-03-22 00:31:44 +00:00
3 changed files with 50 additions and 1 deletions

View file

@ -56,6 +56,7 @@ Frontier AI safety laboratory founded by former OpenAI VP of Research Dario Amod
- **2026-03** — Department of War threatened to blacklist Anthropic unless it removed safeguards against mass surveillance and autonomous weapons. Anthropic refused publicly and faced Pentagon retaliation.
- **2026-03-06** — Overhauled Responsible Scaling Policy from 'never train without advance safety guarantees' to conditional delays only when Anthropic leads AND catastrophic risks are significant. Raised $30B at ~$380B valuation with 10x annual revenue growth. Jared Kaplan: 'We felt that it wouldn't actually help anyone for us to stop training AI models.'
- **2026-02-24** — Released RSP v3.0, replacing unconditional binary safety thresholds with dual-condition escape clauses (pause only if Anthropic leads AND risks are catastrophic). METR partner Chris Painter warned of 'frog-boiling effect' from removing binary thresholds. Raised $30B at ~$380B valuation with 10x annual revenue growth.
- **2025-02-13** — Signed Memorandum of Understanding with UK AI Security Institute (formerly AI Safety Institute) for collaboration on frontier model safety research, creating formal partnership with government institution that conducts pre-deployment evaluations of Anthropic's models.
## Competitive Position
Strongest position in enterprise AI and coding. Revenue growth (10x YoY) outpaces all competitors. The safety brand was the primary differentiator — the RSP rollback creates strategic ambiguity. CEO publicly uncomfortable with power concentration while racing to concentrate it.

View file

@ -0,0 +1,36 @@
{
"rejected_claims": [
{
"filename": "government-ai-safety-infrastructure-coordinated-retreat-early-2025.md",
"issues": [
"missing_attribution_extractor"
]
},
{
"filename": "evaluation-institution-industry-partnership-creates-independence-conflict.md",
"issues": [
"missing_attribution_extractor"
]
}
],
"validation_stats": {
"total": 2,
"kept": 0,
"fixed": 6,
"rejected": 2,
"fixes_applied": [
"government-ai-safety-infrastructure-coordinated-retreat-early-2025.md:set_created:2026-03-22",
"government-ai-safety-infrastructure-coordinated-retreat-early-2025.md:stripped_wiki_link:government-designation-of-safety-conscious-AI-labs-as-supply",
"government-ai-safety-infrastructure-coordinated-retreat-early-2025.md:stripped_wiki_link:voluntary-safety-pledges-cannot-survive-competitive-pressure",
"evaluation-institution-industry-partnership-creates-independence-conflict.md:set_created:2026-03-22",
"evaluation-institution-industry-partnership-creates-independence-conflict.md:stripped_wiki_link:pre-deployment-AI-evaluations-do-not-predict-real-world-risk",
"evaluation-institution-industry-partnership-creates-independence-conflict.md:stripped_wiki_link:government-designation-of-safety-conscious-AI-labs-as-supply"
],
"rejections": [
"government-ai-safety-infrastructure-coordinated-retreat-early-2025.md:missing_attribution_extractor",
"evaluation-institution-industry-partnership-creates-independence-conflict.md:missing_attribution_extractor"
]
},
"model": "anthropic/claude-sonnet-4.5",
"date": "2026-03-22"
}

View file

@ -7,9 +7,12 @@ date: 2025-02-13
domain: ai-alignment
secondary_domains: []
format: news-synthesis
status: unprocessed
status: enrichment
priority: medium
tags: [AISI, AI-Security-Institute, mandate-drift, UK-AI-policy, national-security, RepliBench, alignment-programs, Anthropic-MOU, government-coordination-breaker]
processed_by: theseus
processed_date: 2026-03-22
extraction_model: "anthropic/claude-sonnet-4.5"
---
## Content
@ -59,3 +62,12 @@ On February 13, 2025, the UK government announced the renaming of the AI Safety
PRIMARY CONNECTION: domains/ai-alignment/government-coordination-breaker and voluntary-safety-pledge-failure claims
WHY ARCHIVED: Completes the AISI mandate drift thread; the Anthropic MOU detail is new and important for evaluation independence claims; the temporal coordination with US NIST EO rescission suggests a pattern worth claiming
EXTRACTION HINT: The combination of (AISI renamed + Anthropic MOU + NIST EO rescission, all within 4 weeks of each other) as a coordinated deemphasis signal is the strongest claim candidate; each event individually is less significant than their temporal clustering
## Key Facts
- UK AI Safety Institute renamed to AI Security Institute on February 13, 2025
- Science Minister Peter Kyle stated new mandate focuses on 'serious AI risks with security implications' including chemical and biological weapons uplift, cyberattacks, fraud, and CSAM
- AI Now Institute characterized the shift as narrowing 'attention away from ethics, bias, and rights'
- Frontier AI Trends Report published December 2025 under new AI Security Institute name
- US rescinded Biden NIST executive order on AI January 20, 2025
- UK AISI renaming occurred 24 days after US NIST EO rescission