entity-batch: update 1 entities

- Applied 1 entity operations from queue
- Files: entities/ai-alignment/anthropic.md

Pentagon-Agent: Epimetheus <968B2991-E2DF-4006-B962-F5B0A0CC8ACA>
This commit is contained in:
Teleo Agents 2026-03-24 00:19:13 +00:00
parent 282467233b
commit e359b8ba74

View file

@ -59,6 +59,7 @@ Frontier AI safety laboratory founded by former OpenAI VP of Research Dario Amod
- **2025-02-13** — Signed Memorandum of Understanding with UK AI Security Institute (formerly AI Safety Institute) for collaboration on frontier model safety research, creating formal partnership with government institution that conducts pre-deployment evaluations of Anthropic's models.
- **2026-02-24** — Published Responsible Scaling Policy v3.0, removing hard capability-threshold pause triggers and replacing them with non-binding 'public goals' and external expert review. Cited evaluation science insufficiency and slow government action as primary reasons. External media characterized this as 'dropping hard safety limits.'
- **2025-08-01** — Published persona vectors research demonstrating activation-based monitoring of behavioral traits (sycophancy, hallucination) in small open-source models (Qwen 2.5-7B, Llama-3.1-8B), with 'preventative steering' capability that reduces harmful trait acquisition during training without capability degradation. Not validated on Claude or for safety-critical behaviors.
- **2026-02-24** — Published RSP v3.0, replacing hard capability-threshold pause triggers with Frontier Safety Roadmap containing dated commitments through July 2027; extended evaluation interval from 3 to 6 months; published redacted February 2026 Risk Report
## Competitive Position
Strongest position in enterprise AI and coding. Revenue growth (10x YoY) outpaces all competitors. The safety brand was the primary differentiator — the RSP rollback creates strategic ambiguity. CEO publicly uncomfortable with power concentration while racing to concentrate it.