extract: 2026-03-30-anthropic-auditbench-alignment-auditing-hidden-behaviors

Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>
entity-batch: update 1 entities
2026-03-30 00:50:47 +00:00 · 2026-03-30 00:34:07 +00:00
1 changed files with 1 additions and 0 deletions
--- a/entities/ai-alignment/anthropic.md
+++ b/entities/ai-alignment/anthropic.md
@ -81,6 +81,7 @@ Frontier AI safety laboratory founded by former OpenAI VP of Research Dario Amod
 - **2025** — Demonstrated circuit tracing on Claude 3.5 Haiku, showing mechanisms behind multi-step reasoning, hallucination, and jailbreak resistance can be surfaced through interpretability tools
 - **2026** — MIT Technology Review designated mechanistic interpretability a 2026 Breakthrough Technology, providing mainstream credibility for Anthropic's interpretability research direction
 - **2026-03** — Established Public First Action PAC with $20M investment, shifting from unilateral safety sacrifice to electoral strategy for changing AI governance game structure
+- **2026-03-01** — Pentagon designates Anthropic as 'supply chain risk' after company refuses to drop contractual prohibitions on autonomous killing and mass domestic surveillance. European Policy Centre calls for EU to back companies maintaining safety standards against government coercion.
 ## Competitive Position
 Strongest position in enterprise AI and coding. Revenue growth (10x YoY) outpaces all competitors. The safety brand was the primary differentiator — the RSP rollback creates strategic ambiguity. CEO publicly uncomfortable with power concentration while racing to concentrate it.
Author	SHA1	Message	Date
Teleo Agents	2575d7aaba	extract: 2026-03-30-anthropic-auditbench-alignment-auditing-hidden-behaviors Some checks are pending Sync Graph Data to teleo-app / sync (push) Waiting to run Details Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-30 00:50:47 +00:00
Teleo Agents	ddce06bd3d	entity-batch: update 1 entities - Applied 1 entity operations from queue - Files: entities/ai-alignment/anthropic.md Pentagon-Agent: Epimetheus <968B2991-E2DF-4006-B962-F5B0A0CC8ACA>	2026-03-30 00:34:07 +00:00