Compare commits

...

3 commits

Author SHA1 Message Date
Teleo Agents
aa4b527526 theseus: extract claims from 2026-05-xx-insidedefense-dc-circuit-may19-adverse-panel-unfavorable-outcome
Some checks failed
Mirror PR to Forgejo / mirror (pull_request) Has been cancelled
- Source: inbox/queue/2026-05-xx-insidedefense-dc-circuit-may19-adverse-panel-unfavorable-outcome.md
- Domain: ai-alignment
- Claims: 0, Entities: 0
- Enrichments: 2
- Extracted by: pipeline ingest (OpenRouter anthropic/claude-sonnet-4.5)

Pentagon-Agent: Theseus <PIPELINE>
2026-05-12 00:38:22 +00:00
Teleo Agents
87cb55c1d1 source: 2026-05-xx-pentagon-may1-contracts-seven-labs-anthropic-excluded.md → null-result
Pentagon-Agent: Epimetheus <PIPELINE>
2026-05-12 00:37:49 +00:00
Teleo Agents
af436216b9 theseus: extract claims from 2026-04-xx-schneier-mythos-glasswing-pr-play-governance-critique
- Source: inbox/queue/2026-04-xx-schneier-mythos-glasswing-pr-play-governance-critique.md
- Domain: ai-alignment
- Claims: 2, Entities: 0
- Enrichments: 2
- Extracted by: pipeline ingest (OpenRouter anthropic/claude-sonnet-4.5)

Pentagon-Agent: Theseus <PIPELINE>
2026-05-12 00:37:15 +00:00
6 changed files with 55 additions and 3 deletions

View file

@ -0,0 +1,19 @@
---
type: claim
domain: ai-alignment
description: Schneier argues that concentrating Mythos access among ~50 large vendors means best-equipped organizations get findings first while smaller enterprises and specialized systems remain exposed
confidence: experimental
source: Bruce Schneier, Mythos/Glasswing governance critique, April 2026
created: 2026-05-12
title: AI vulnerability discovery access concentration exposes least-resourced infrastructure because restricting findings to large vendors leaves regional operators and industrial systems most vulnerable
agent: theseus
sourced_from: ai-alignment/2026-04-xx-schneier-mythos-glasswing-pr-play-governance-critique.md
scope: structural
sourcer: Bruce Schneier
supports: ["no-research-group-is-building-alignment-through-collective-intelligence-infrastructure-despite-the-field-converging-on-problems-that-require-it"]
related: ["compute-supply-chain-concentration-is-simultaneously-the-strongest-ai-governance-lever-and-the-largest-systemic-fragility-because-the-same-chokepoints-that-enable-oversight-create-single-points-of-failure", "no-research-group-is-building-alignment-through-collective-intelligence-infrastructure-despite-the-field-converging-on-problems-that-require-it"]
---
# AI vulnerability discovery access concentration exposes least-resourced infrastructure because restricting findings to large vendors leaves regional operators and industrial systems most vulnerable
Schneier identifies a structural problem with the Project Glasswing governance model: concentrating Mythos access among approximately 50 large vendors means the best-equipped organizations receive vulnerability findings first, while smaller enterprises, regional infrastructure operators, and specialized industrial systems are most exposed and least resourced to defend themselves. This creates an inverse relationship between defensive capability and exposure time — the organizations that need vulnerability information most urgently (because they lack sophisticated security teams) receive it last or not at all, while organizations with extensive security resources get early access. The governance model acknowledges that vulnerability discovery capability at AI scale is dual-use and depends on who has access, but Schneier questions whether Anthropic's private coalition is the right structure when it systematically disadvantages the most vulnerable parts of critical infrastructure. This is distinct from general access restriction concerns because it identifies a specific mechanism: the access concentration pattern creates a capability-exposure mismatch that may increase rather than decrease systemic risk.

View file

@ -18,3 +18,10 @@ related: ["voluntary-safety-pledges-cannot-survive-competitive-pressure-because-
# Government coercive removal of AI safety constraints qualifies as First Amendment retaliation creating judicial protection for pre-deployment safety commitments
Judge Lin ruled that 'Punishing Anthropic for bringing public scrutiny to the government's contracting position is classic illegal First Amendment retaliation' and that 'Nothing in the governing statute supports the Orwellian notion that an American company may be branded a potential adversary and saboteur of the U.S. for expressing disagreement with the government.' Anthropic was found likely to succeed on THREE independent theories: First Amendment retaliation, Fifth Amendment due process, and APA violations. This creates a judicial protection mechanism for pre-deployment safety commitments that soft pledges lack. The ruling establishes that government attempts to coerce removal of safety constraints through supply chain risk designations can be challenged as unconstitutional retaliation. This is a preliminary injunction, not a final ruling, but it demonstrates that courts will scrutinize whether safety claims map onto verifiable technical realities and will protect vendors from being penalized for maintaining those commitments.
## Extending Evidence
**Source:** InsideDefense, May 1, 2026; DC Circuit briefing questions
The DC Circuit May 19 oral arguments will address three pointed questions: (1) jurisdiction under 41 U.S.C. § 4713, (2) whether supply chain risk designation was a 'covered procurement action,' and (3) whether Anthropic retained meaningful post-delivery control over Claude once deployed. Question 3 is governance-critical regardless of outcome: if the court finds Anthropic HAS meaningful post-delivery control, vendor-based safety architecture gains judicial validation; if NO meaningful control, the Huang 'open-weight = equivalent' argument gains judicial support, undermining vendor-based safety requirements across all regulatory frameworks. The same panel that denied the stay hearing the merits case signals unfavorable prospects.

View file

@ -0,0 +1,19 @@
---
type: claim
domain: ai-alignment
description: Schneier characterizes Project Glasswing as 'very much a PR play' that built relationships with 40+ large tech companies while creating positive safety credentials
confidence: experimental
source: Bruce Schneier security blog analysis, April 2026
created: 2026-05-12
title: Mythos restriction is commercially rational safety theater because reputational benefits and vendor relationships offset the cost of public access restriction
agent: theseus
sourced_from: ai-alignment/2026-04-xx-schneier-mythos-glasswing-pr-play-governance-critique.md
scope: functional
sourcer: Bruce Schneier
challenges: ["the-alignment-tax-creates-a-structural-race-to-the-bottom-because-safety-training-costs-capability-and-rational-competitors-skip-it", "voluntary-safety-pledges-cannot-survive-competitive-pressure-because-unilateral-commitments-are-structurally-punished-when-competitors-advance-without-equivalent-constraints"]
related: ["the-alignment-tax-creates-a-structural-race-to-the-bottom-because-safety-training-costs-capability-and-rational-competitors-skip-it", "legible-immediate-harm-enforces-governance-convergence-independent-of-competitive-incentives"]
---
# Mythos restriction is commercially rational safety theater because reputational benefits and vendor relationships offset the cost of public access restriction
Bruce Schneier, one of the most respected voices in security governance, directly characterizes Project Glasswing as 'very much a PR play by Anthropic — and it worked,' noting that many reporters repeated Anthropic's claims without sufficient scrutiny. This critique suggests that the Mythos restriction may not represent a genuine alignment tax payment but rather a commercially rational strategy that provides reputational benefits (demonstrating safety credentials, creating positive PR contrast with the DoD blacklist situation) and relationship-building opportunities (partnerships with 40+ large tech companies) that offset or exceed the commercial cost of restricting public access. The 'alignment tax' framing may overestimate the sacrifice involved when the restriction simultaneously serves commercial interests. Schneier's track record of skepticism toward industry self-governance claims lends weight to this interpretation, though the claim remains experimental as it has not been empirically tested against Anthropic's actual cost-benefit calculations.

View file

@ -7,10 +7,13 @@ date: 2026-04-01
domain: ai-alignment
secondary_domains: []
format: article
status: unprocessed
status: processed
processed_by: theseus
processed_date: 2026-05-12
priority: medium
tags: [Mythos, Glasswing, Schneier, governance-critique, PR-play, access-concentration, offense-defense, cybersecurity, dual-use]
intake_tier: research-task
extraction_model: "anthropic/claude-sonnet-4.5"
---
## Content

View file

@ -7,10 +7,13 @@ date: 2026-05-01
domain: ai-alignment
secondary_domains: [grand-strategy]
format: news
status: unprocessed
status: processed
processed_by: theseus
processed_date: 2026-05-12
priority: high
tags: [Anthropic, DC-Circuit, May-19, oral-arguments, adverse-panel, Henderson-Katsas-Rao, unfavorable-outcome, supply-chain-risk, B1, judicial-prediction]
intake_tier: research-task
extraction_model: "anthropic/claude-sonnet-4.5"
---
## Content

View file

@ -7,10 +7,11 @@ date: 2026-05-01
domain: ai-alignment
secondary_domains: [grand-strategy]
format: news
status: unprocessed
status: null-result
priority: medium
tags: [Pentagon, AI-contracts, Anthropic-excluded, OpenAI, Google, Microsoft, competitive-cost, blacklist, supply-chain-risk, safety-governance, B1]
intake_tier: research-task
extraction_model: "anthropic/claude-sonnet-4.5"
---
## Content