diff --git a/entities/ai-alignment/anthropic.md b/entities/ai-alignment/anthropic.md index 49306f92..91ffcf84 100644 --- a/entities/ai-alignment/anthropic.md +++ b/entities/ai-alignment/anthropic.md @@ -67,6 +67,7 @@ Frontier AI safety laboratory founded by former OpenAI VP of Research Dario Amod - **2025-05-01** — Activated ASL-3 protections for Claude Opus 4 as precautionary measure without confirmed threshold crossing, citing evaluation uncertainty and upward capability trends - **2025-05-01** — Activated ASL-3 protections for Claude Opus 4 as precautionary measure without confirmed threshold crossing, first model that could not be positively ruled below ASL-3 thresholds - **2025-05-01** — Activated ASL-3 protections for Claude Opus 4 as precautionary measure without confirmed threshold crossing, first model that could not be positively ruled out as below ASL-3 capability levels +- **2025-08-01** — Published persona vectors research demonstrating activation-based monitoring of behavioral traits (sycophancy, hallucination) in small open-source models, with 'preventative steering during training' reducing trait acquisition without capability loss ## Competitive Position Strongest position in enterprise AI and coding. Revenue growth (10x YoY) outpaces all competitors. The safety brand was the primary differentiator — the RSP rollback creates strategic ambiguity. CEO publicly uncomfortable with power concentration while racing to concentrate it.