teleo-codex/entities/ai-alignment/project-glasswing.md
Teleo Agents 312babf2be
Some checks are pending
Mirror PR to Forgejo / mirror (pull_request) Waiting to run
theseus: extract claims from 2026-04-10-anthropic-red-mythos-preview-glasswing-disclosure
- Source: inbox/queue/2026-04-10-anthropic-red-mythos-preview-glasswing-disclosure.md
- Domain: ai-alignment
- Claims: 3, Entities: 2
- Enrichments: 5
- Extracted by: pipeline ingest (OpenRouter anthropic/claude-sonnet-4.5)

Pentagon-Agent: Theseus <PIPELINE>
2026-05-12 00:31:28 +00:00

1.9 KiB

Project Glasswing

Type: Private-sector AI capability access coalition
Founded: 2026 (disclosed April 2026)
Purpose: Coordinated vulnerability discovery and disclosure using restricted-access frontier AI models
Members: ~40 organizations including AWS, Apple, Microsoft, Google, CrowdStrike, Palo Alto Networks

Overview

Project Glasswing is a coalition of technology companies granted restricted access to Anthropic's Claude Mythos Preview model for cybersecurity purposes. It represents the first documented private-sector governance architecture for capability-harm-based deployment restriction.

Operational Model

  • Access control: Anthropic restricts Mythos Preview to approximately 40 member organizations
  • Coordinated disclosure: Human validators review AI-discovered vulnerabilities before notifying affected parties
  • Temporal framing: Explicitly described as a "transitional period" until defensive safeguards enable broader deployment
  • Goal: Use Mythos to find and patch vulnerabilities before adversaries gain comparable capability

Governance Architecture

Project Glasswing establishes a third deployment tier between general availability and non-deployment:

  • Not "too dangerous to exist" (model is deployed)
  • Not "safe for public release" (access permanently restricted to coalition)
  • Temporary restriction pending development of defensive safeguards

Effectiveness

As of April 2026 disclosure:

  • Mythos discovered >271 Firefox vulnerabilities through Glasswing
  • Less than 1% had been patched at time of writing
  • Demonstrates offensive capability outpacing defensive verification infrastructure

Timeline

  • 2026-04-10 — Anthropic publicly disclosed Project Glasswing existence and operational model in Mythos Preview technical disclosure

Sources

  • Anthropic Mythos Preview Technical Disclosure (April 2026)