Compare commits
20 commits
extract/20
...
main
| Author | SHA1 | Date | |
|---|---|---|---|
|
|
a2eb074e52 | ||
|
|
fa65d8ca3c | ||
| a664eeb0ca | |||
|
|
488ca3698a | ||
|
|
8a3b97454f | ||
| 53975fb1e3 | |||
| d431236909 | |||
|
|
29eb6e8607 | ||
|
|
e700ceb6c6 | ||
|
|
2442ab4b44 | ||
|
|
86d20401fb | ||
|
|
53f8b17263 | ||
|
|
27738263dd | ||
| 456372c3dc | |||
|
|
7593b07d74 | ||
|
|
aa496c0deb | ||
| 8a378b02d5 | |||
|
|
8598d95858 | ||
|
|
2ad829b5cc | ||
|
|
43382ad7c4 |
27 changed files with 308 additions and 29 deletions
|
|
@ -30,6 +30,12 @@ This concentration has direct alignment implications:
|
||||||
|
|
||||||
The counterfactual worth tracking: Chinese open-source models (Qwen, DeepSeek) now capture 50-60% of new open-model adoption globally. If open-source models close the capability gap (currently 6-18 months, shrinking), capital concentration at the frontier may become less alignment-relevant as capability diffuses. But as of March 2026, frontier capability remains concentrated.
|
The counterfactual worth tracking: Chinese open-source models (Qwen, DeepSeek) now capture 50-60% of new open-model adoption globally. If open-source models close the capability gap (currently 6-18 months, shrinking), capital concentration at the frontier may become less alignment-relevant as capability diffuses. But as of March 2026, frontier capability remains concentrated.
|
||||||
|
|
||||||
|
|
||||||
|
### Additional Evidence (extend)
|
||||||
|
*Source: [[2026-03-16-theseus-ai-coordination-governance-evidence]] | Added: 2026-03-19*
|
||||||
|
|
||||||
|
450+ organizations lobbied on AI in 2025, up from 6 in 2016. $92M in lobbying fees Q1-Q3 2025. Industry successfully blocked California SB 1047 through coordinated lobbying. Concentration creates not just market power but political power—oligopoly structure enables collective action to prevent binding regulation.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
Relevant Notes:
|
Relevant Notes:
|
||||||
|
|
|
||||||
|
|
@ -41,6 +41,12 @@ Expert consensus identifies 'external scrutiny, proactive evaluation and transpa
|
||||||
|
|
||||||
STREAM proposal identifies that current model reports lack 'sufficient detail to enable meaningful independent assessment' of dangerous capability evaluations. The need for a standardized reporting framework confirms that transparency problems extend beyond general disclosure (FMTI scores) to the specific domain of dangerous capability evaluation where external verification is currently impossible.
|
STREAM proposal identifies that current model reports lack 'sufficient detail to enable meaningful independent assessment' of dangerous capability evaluations. The need for a standardized reporting framework confirms that transparency problems extend beyond general disclosure (FMTI scores) to the specific domain of dangerous capability evaluation where external verification is currently impossible.
|
||||||
|
|
||||||
|
|
||||||
|
### Additional Evidence (confirm)
|
||||||
|
*Source: [[2026-03-16-theseus-ai-coordination-governance-evidence]] | Added: 2026-03-19*
|
||||||
|
|
||||||
|
Stanford FMTI 2024→2025 data: mean transparency score declined 17 points. Meta -29 points, Mistral -37 points, OpenAI -14 points. OpenAI removed 'safely' from mission statement (Nov 2025), dissolved Superalignment team (May 2024) and Mission Alignment team (Feb 2026). Google accused by 60 UK lawmakers of violating Seoul commitments with Gemini 2.5 Pro (Apr 2025).
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
Relevant Notes:
|
Relevant Notes:
|
||||||
|
|
|
||||||
|
|
@ -21,6 +21,12 @@ This is not a story about Anthropic's leadership failing. It is a story about [[
|
||||||
|
|
||||||
The alignment implication is structural: if the most safety-motivated lab with the most commercially successful safety brand cannot maintain binding safety commitments, then voluntary self-regulation is not a viable alignment strategy. This strengthens the case for coordination-based approaches — [[AI alignment is a coordination problem not a technical problem]] — because the failure mode is not that safety is technically impossible but that unilateral safety is economically unsustainable.
|
The alignment implication is structural: if the most safety-motivated lab with the most commercially successful safety brand cannot maintain binding safety commitments, then voluntary self-regulation is not a viable alignment strategy. This strengthens the case for coordination-based approaches — [[AI alignment is a coordination problem not a technical problem]] — because the failure mode is not that safety is technically impossible but that unilateral safety is economically unsustainable.
|
||||||
|
|
||||||
|
|
||||||
|
### Additional Evidence (confirm)
|
||||||
|
*Source: [[2026-03-16-theseus-ai-coordination-governance-evidence]] | Added: 2026-03-19*
|
||||||
|
|
||||||
|
Anthropic's own language in RSP documentation: commitments are 'very hard to meet without industry-wide coordination.' OpenAI made safety explicitly conditional on competitor behavior in Preparedness Framework v2 (April 2025). Pattern holds across all voluntary commitments—no frontier lab maintained unilateral safety constraints when competitors advanced without them.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
Relevant Notes:
|
Relevant Notes:
|
||||||
|
|
|
||||||
|
|
@ -30,6 +30,12 @@ For alignment, this means the governance infrastructure that exists (export cont
|
||||||
|
|
||||||
The CFR article confirms diverging governance philosophies between democracies and authoritarian systems, with China's amended Cybersecurity Law emphasizing state oversight while the US pursues standard-setting body engagement. Horowitz notes the US 'must engage in standard-setting bodies to counter China's AI governance influence,' indicating that the most active governance is competitive positioning rather than safety coordination.
|
The CFR article confirms diverging governance philosophies between democracies and authoritarian systems, with China's amended Cybersecurity Law emphasizing state oversight while the US pursues standard-setting body engagement. Horowitz notes the US 'must engage in standard-setting bodies to counter China's AI governance influence,' indicating that the most active governance is competitive positioning rather than safety coordination.
|
||||||
|
|
||||||
|
|
||||||
|
### Additional Evidence (extend)
|
||||||
|
*Source: [[2026-03-16-theseus-ai-coordination-governance-evidence]] | Added: 2026-03-19*
|
||||||
|
|
||||||
|
US export controls use tiered country system with deployment caps. Nvidia designed compliance chips (H800, A800) specifically to meet regulatory thresholds. Mechanism proves compute governance CAN work when backed by state enforcement, but current implementation optimizes for strategic advantage over China rather than catastrophic risk reduction. KYC for compute proposed but not implemented, showing technical feasibility without political will.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
Relevant Notes:
|
Relevant Notes:
|
||||||
|
|
|
||||||
|
|
@ -40,10 +40,16 @@ The voluntary-collaborative model adds a selection bias dimension to evaluation
|
||||||
|
|
||||||
|
|
||||||
### Additional Evidence (confirm)
|
### Additional Evidence (confirm)
|
||||||
*Source: [[2026-02-23-shapira-agents-of-chaos]] | Added: 2026-03-19*
|
*Source: 2026-02-23-shapira-agents-of-chaos | Added: 2026-03-19*
|
||||||
|
|
||||||
Agents of Chaos study provides concrete empirical evidence: 11 documented case studies of security vulnerabilities (unauthorized compliance, identity spoofing, cross-agent propagation, destructive actions) that emerged only in realistic multi-agent deployment with persistent memory and system access—none of which would be detected by static single-agent benchmarks. The study explicitly argues that current evaluation paradigms are insufficient for realistic deployment conditions.
|
Agents of Chaos study provides concrete empirical evidence: 11 documented case studies of security vulnerabilities (unauthorized compliance, identity spoofing, cross-agent propagation, destructive actions) that emerged only in realistic multi-agent deployment with persistent memory and system access—none of which would be detected by static single-agent benchmarks. The study explicitly argues that current evaluation paradigms are insufficient for realistic deployment conditions.
|
||||||
|
|
||||||
|
|
||||||
|
### Additional Evidence (extend)
|
||||||
|
*Source: [[2026-03-00-metr-aisi-pre-deployment-evaluation-practice]] | Added: 2026-03-19*
|
||||||
|
|
||||||
|
METR and UK AISI evaluations as of March 2026 focus primarily on sabotage risk and cyber capabilities (METR's Claude Opus 4.6 sabotage assessment, AISI's cyber range testing of 7 LLMs). This narrow scope may miss alignment-relevant risks that don't manifest as sabotage or cyber threats. The evaluation infrastructure is optimizing for measurable near-term risks rather than harder-to-operationalize catastrophic scenarios.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
Relevant Notes:
|
Relevant Notes:
|
||||||
|
|
@ -52,5 +58,5 @@ Relevant Notes:
|
||||||
- [[the gap between theoretical AI capability and observed deployment is massive across all occupations because adoption lag not capability limits determines real-world impact]]
|
- [[the gap between theoretical AI capability and observed deployment is massive across all occupations because adoption lag not capability limits determines real-world impact]]
|
||||||
|
|
||||||
Topics:
|
Topics:
|
||||||
- [[domains/ai-alignment/_map]]
|
- domains/ai-alignment/_map
|
||||||
- [[core/grand-strategy/_map]]
|
- core/grand-strategy/_map
|
||||||
|
|
|
||||||
|
|
@ -39,6 +39,12 @@ The International AI Safety Report 2026 (multi-government committee, February 20
|
||||||
|
|
||||||
The gap between expert consensus (76 specialists identify third-party audits as top-3 priority) and actual implementation (no mandatory audit requirements at major labs) demonstrates that knowing what's needed is insufficient. Even when the field's experts across multiple domains agree on priorities, competitive dynamics prevent voluntary adoption.
|
The gap between expert consensus (76 specialists identify third-party audits as top-3 priority) and actual implementation (no mandatory audit requirements at major labs) demonstrates that knowing what's needed is insufficient. Even when the field's experts across multiple domains agree on priorities, competitive dynamics prevent voluntary adoption.
|
||||||
|
|
||||||
|
|
||||||
|
### Additional Evidence (confirm)
|
||||||
|
*Source: [[2026-03-16-theseus-ai-coordination-governance-evidence]] | Added: 2026-03-19*
|
||||||
|
|
||||||
|
Comprehensive evidence across governance mechanisms: ALL international declarations (Bletchley, Seoul, Paris, Hiroshima, OECD, UN) produced zero verified behavioral change. Frontier Model Forum produced no binding commitments. White House voluntary commitments eroded. 450+ organizations lobbied on AI in 2025 ($92M in fees), California SB 1047 vetoed after industry pressure. Only binding regulation (EU AI Act, China enforcement, US export controls) changed behavior.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
Relevant Notes:
|
Relevant Notes:
|
||||||
|
|
|
||||||
|
|
@ -52,16 +52,22 @@ Dropout crossed 1M paid subscribers in October 2025 with 31% YoY growth, represe
|
||||||
|
|
||||||
|
|
||||||
### Additional Evidence (confirm)
|
### Additional Evidence (confirm)
|
||||||
*Source: [[2024-00-00-markrmason-dropout-streaming-model-community-economics]] | Added: 2026-03-18*
|
*Source: 2024-00-00-markrmason-dropout-streaming-model-community-economics | Added: 2026-03-18*
|
||||||
|
|
||||||
Dropout contributes $30M+ ARR to the indie streaming category as of 2023, with 1M+ subscribers by October 2025. Platform is profitable and distributed profit sharing to all contributors earning $1+ in 2023. This adds another data point to the commercial scale thesis for creator-owned streaming.
|
Dropout contributes $30M+ ARR to the indie streaming category as of 2023, with 1M+ subscribers by October 2025. Platform is profitable and distributed profit sharing to all contributors earning $1+ in 2023. This adds another data point to the commercial scale thesis for creator-owned streaming.
|
||||||
|
|
||||||
|
|
||||||
### Additional Evidence (confirm)
|
### Additional Evidence (confirm)
|
||||||
*Source: [[2024-00-00-markrmason-dropout-streaming-model-community-economics]] | Added: 2026-03-19*
|
*Source: 2024-00-00-markrmason-dropout-streaming-model-community-economics | Added: 2026-03-19*
|
||||||
|
|
||||||
Dropout specifically contributes $30M+ ARR to the indie streaming category total. The platform's profitability and profit-sharing model (distributed to anyone earning $1+ in 2023) demonstrates creator-owned infrastructure can sustain both platform operations and contributor compensation at scale.
|
Dropout specifically contributes $30M+ ARR to the indie streaming category total. The platform's profitability and profit-sharing model (distributed to anyone earning $1+ in 2023) demonstrates creator-owned infrastructure can sustain both platform operations and contributor compensation at scale.
|
||||||
|
|
||||||
|
|
||||||
|
### Additional Evidence (confirm)
|
||||||
|
*Source: [[2026-03-01-variety-dropout-superfan-tier-1million-subscribers]] | Added: 2026-03-19*
|
||||||
|
|
||||||
|
Dropout crossed 1 million subscribers in October 2025 with 31% year-over-year growth, representing a major indie streaming platform reaching seven-figure subscriber scale. This adds to the evidence that creator-owned streaming is commercially viable at scale.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
Relevant Notes:
|
Relevant Notes:
|
||||||
|
|
|
||||||
|
|
@ -38,16 +38,22 @@ Critical Role's Beacon launched May 2024 at $5.99/month and experienced ~20% Twi
|
||||||
|
|
||||||
|
|
||||||
### Additional Evidence (confirm)
|
### Additional Evidence (confirm)
|
||||||
*Source: [[2024-00-00-markrmason-dropout-streaming-model-community-economics]] | Added: 2026-03-18*
|
*Source: 2024-00-00-markrmason-dropout-streaming-model-community-economics | Added: 2026-03-18*
|
||||||
|
|
||||||
Dropout reached $30M+ ARR and profitability in 2023 as a niche TTRPG/game show platform. Dimension 20 sold out Madison Square Garden in January 2025. This adds TTRPG actual play to the indie streaming category alongside other verticals, with similar patterns: niche focus, subscription-first, organic social distribution.
|
Dropout reached $30M+ ARR and profitability in 2023 as a niche TTRPG/game show platform. Dimension 20 sold out Madison Square Garden in January 2025. This adds TTRPG actual play to the indie streaming category alongside other verticals, with similar patterns: niche focus, subscription-first, organic social distribution.
|
||||||
|
|
||||||
|
|
||||||
### Additional Evidence (confirm)
|
### Additional Evidence (confirm)
|
||||||
*Source: [[2024-00-00-markrmason-dropout-streaming-model-community-economics]] | Added: 2026-03-19*
|
*Source: 2024-00-00-markrmason-dropout-streaming-model-community-economics | Added: 2026-03-19*
|
||||||
|
|
||||||
Dropout reached $30M+ ARR and 1M+ subscribers by October 2025, achieving profitability in 2023. The platform grew 100% in 2023 with no paid marketing until late 2022, relying entirely on organic social media clips. This confirms indie streaming platforms can reach commercial scale with niche content (TTRPG actual play, improv game shows) when community alignment is strong.
|
Dropout reached $30M+ ARR and 1M+ subscribers by October 2025, achieving profitability in 2023. The platform grew 100% in 2023 with no paid marketing until late 2022, relying entirely on organic social media clips. This confirms indie streaming platforms can reach commercial scale with niche content (TTRPG actual play, improv game shows) when community alignment is strong.
|
||||||
|
|
||||||
|
|
||||||
|
### Additional Evidence (confirm)
|
||||||
|
*Source: [[2026-03-01-variety-dropout-superfan-tier-1million-subscribers]] | Added: 2026-03-19*
|
||||||
|
|
||||||
|
Dropout's growth trajectory (1M subscribers, 31% YoY growth, fan-requested premium tier) demonstrates the indie streaming category pattern: subscription-first revenue, no advertising, organic social distribution, and community-responsive product decisions. The superfan tier specifically shows how indie platforms can experiment with pricing structures that major streamers cannot.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
Relevant Notes:
|
Relevant Notes:
|
||||||
|
|
|
||||||
|
|
@ -103,16 +103,22 @@ Value in Health modeling study shows Medicare saves $715M over 10 years with com
|
||||||
|
|
||||||
|
|
||||||
### Additional Evidence (challenge)
|
### Additional Evidence (challenge)
|
||||||
*Source: [[2026-01-13-aon-glp1-employer-cost-savings-cancer-reduction]] | Added: 2026-03-18*
|
*Source: 2026-01-13-aon-glp1-employer-cost-savings-cancer-reduction | Added: 2026-03-18*
|
||||||
|
|
||||||
Aon's temporal cost analysis shows medical costs rise 23% in year 1 but grow only 2% after 12 months (vs 6% for non-users), with diabetes patients showing 6-9 percentage point lower cost growth at 30 months. This suggests the 'inflationary through 2035' claim may only apply to short-term payers, while long-term risk-bearers see net savings.
|
Aon's temporal cost analysis shows medical costs rise 23% in year 1 but grow only 2% after 12 months (vs 6% for non-users), with diabetes patients showing 6-9 percentage point lower cost growth at 30 months. This suggests the 'inflationary through 2035' claim may only apply to short-term payers, while long-term risk-bearers see net savings.
|
||||||
|
|
||||||
|
|
||||||
### Additional Evidence (challenge)
|
### Additional Evidence (challenge)
|
||||||
*Source: [[2026-03-19-glp1-price-compression-international-generics-claim-challenge]] | Added: 2026-03-19*
|
*Source: 2026-03-19-glp1-price-compression-international-generics-claim-challenge | Added: 2026-03-19*
|
||||||
|
|
||||||
International generic competition beginning January 2026 (Canada patent expiry, immediate Sandoz/Apotex/Teva filings) creates price compression trajectory faster than 'inflationary through 2035' assumes. Oral Wegovy launched at $149-299/month (5-8x reduction vs $1,300/month injectable). China/India generics projected at $40-50/month by 2030. Aon 192K patient study shows break-even timing is highly price-sensitive: at $1,300/month, multi-year retention required; at $50-150/month, Aon data suggests cost savings within 12-18 months under capitation. The 'inflationary through 2035' conclusion holds at current US pricing but becomes invalid if international generic arbitrage and oral formulation competition compress effective prices to $50-150/month range by 2030. Scope qualification needed: claim is valid conditional on pricing trajectory assumptions that are now challenged by G7 patent cliff precedent.
|
International generic competition beginning January 2026 (Canada patent expiry, immediate Sandoz/Apotex/Teva filings) creates price compression trajectory faster than 'inflationary through 2035' assumes. Oral Wegovy launched at $149-299/month (5-8x reduction vs $1,300/month injectable). China/India generics projected at $40-50/month by 2030. Aon 192K patient study shows break-even timing is highly price-sensitive: at $1,300/month, multi-year retention required; at $50-150/month, Aon data suggests cost savings within 12-18 months under capitation. The 'inflationary through 2035' conclusion holds at current US pricing but becomes invalid if international generic arbitrage and oral formulation competition compress effective prices to $50-150/month range by 2030. Scope qualification needed: claim is valid conditional on pricing trajectory assumptions that are now challenged by G7 patent cliff precedent.
|
||||||
|
|
||||||
|
|
||||||
|
### Additional Evidence (challenge)
|
||||||
|
*Source: [[2026-03-01-glp1-lifestyle-modification-efficacy-combined-approach]] | Added: 2026-03-19*
|
||||||
|
|
||||||
|
If GLP-1 + exercise combination produces durable weight maintenance (3.5 kg regain vs 8.7 kg for medication alone), and if behavioral change persists after medication discontinuation, then the chronic use model may not be necessary for long-term value capture. This challenges the inflationary cost projection if the optimal intervention is time-limited medication + permanent behavioral change rather than lifetime pharmacotherapy.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
Relevant Notes:
|
Relevant Notes:
|
||||||
|
|
|
||||||
|
|
@ -85,16 +85,22 @@ Weight regain data shows that even among patients who complete treatment, GLP-1
|
||||||
|
|
||||||
|
|
||||||
### Additional Evidence (extend)
|
### Additional Evidence (extend)
|
||||||
*Source: [[2026-01-13-aon-glp1-employer-cost-savings-cancer-reduction]] | Added: 2026-03-18*
|
*Source: 2026-01-13-aon-glp1-employer-cost-savings-cancer-reduction | Added: 2026-03-18*
|
||||||
|
|
||||||
Aon data shows the 80%+ adherent cohort captures dramatically stronger cost reductions (9 percentage points lower for diabetes, 7 points for weight loss), confirming that adherence is the binding variable for economic viability. The adherence-dependent savings pattern means low persistence rates eliminate cost-effectiveness even when clinical benefits exist.
|
Aon data shows the 80%+ adherent cohort captures dramatically stronger cost reductions (9 percentage points lower for diabetes, 7 points for weight loss), confirming that adherence is the binding variable for economic viability. The adherence-dependent savings pattern means low persistence rates eliminate cost-effectiveness even when clinical benefits exist.
|
||||||
|
|
||||||
|
|
||||||
### Additional Evidence (extend)
|
### Additional Evidence (extend)
|
||||||
*Source: [[2026-03-19-vida-ai-biology-acceleration-healthspan-constraint]] | Added: 2026-03-19*
|
*Source: 2026-03-19-vida-ai-biology-acceleration-healthspan-constraint | Added: 2026-03-19*
|
||||||
|
|
||||||
GLP-1 behavioral adherence failures demonstrate that even breakthrough pharmacology cannot overcome behavioral determinants: patients on GLP-1 alone show same weight regain as placebo without behavior change. This is direct evidence that the 'human constraints' factor (Amodei framework) limits pharmaceutical efficacy independent of drug quality.
|
GLP-1 behavioral adherence failures demonstrate that even breakthrough pharmacology cannot overcome behavioral determinants: patients on GLP-1 alone show same weight regain as placebo without behavior change. This is direct evidence that the 'human constraints' factor (Amodei framework) limits pharmaceutical efficacy independent of drug quality.
|
||||||
|
|
||||||
|
|
||||||
|
### Additional Evidence (extend)
|
||||||
|
*Source: [[2026-03-01-glp1-lifestyle-modification-efficacy-combined-approach]] | Added: 2026-03-19*
|
||||||
|
|
||||||
|
Weight regain data shows GLP-1 alone (8.7 kg regain) performs no better than placebo (7.6 kg) after discontinuation, while combination with exercise reduces regain to 3.5 kg. This suggests the low persistence rates may be economically rational from a patient perspective if medication alone provides no durable benefit—patients who discontinue without establishing exercise habits return to baseline regardless of medication duration.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
Relevant Notes:
|
Relevant Notes:
|
||||||
|
|
|
||||||
|
|
@ -49,6 +49,11 @@ Frontier AI safety laboratory founded by former OpenAI VP of Research Dario Amod
|
||||||
|
|
||||||
- **2026-03-18** — Department of War threatened to blacklist Anthropic unless it removed safeguards against mass surveillance and autonomous weapons; Anthropic refused publicly and Pentagon retaliated (reported by HKS Carr-Ryan Center)
|
- **2026-03-18** — Department of War threatened to blacklist Anthropic unless it removed safeguards against mass surveillance and autonomous weapons; Anthropic refused publicly and Pentagon retaliated (reported by HKS Carr-Ryan Center)
|
||||||
- **2026-03** — Department of War threatened to blacklist Anthropic unless it removed safeguards against mass surveillance and autonomous weapons; Anthropic refused publicly and Pentagon retaliated (HKS Carr-Ryan Center report)
|
- **2026-03** — Department of War threatened to blacklist Anthropic unless it removed safeguards against mass surveillance and autonomous weapons; Anthropic refused publicly and Pentagon retaliated (HKS Carr-Ryan Center report)
|
||||||
|
- **2026-02** — Abandoned binding RSP (Responsible Scaling Policy)
|
||||||
|
- **2026-03** — Reached $380B valuation, ~$19B annualized revenue (10x YoY sustained 3 years)
|
||||||
|
- **2026-03** — Claude Code achieved 54% enterprise coding market share, $2.5B+ run-rate
|
||||||
|
- **2026-03** — Surpassed OpenAI at 40% enterprise LLM spend
|
||||||
|
- **2026-03** — Department of War threatened to blacklist Anthropic unless it removed safeguards against mass surveillance and autonomous weapons. Anthropic refused publicly and faced Pentagon retaliation.
|
||||||
## Competitive Position
|
## Competitive Position
|
||||||
Strongest position in enterprise AI and coding. Revenue growth (10x YoY) outpaces all competitors. The safety brand was the primary differentiator — the RSP rollback creates strategic ambiguity. CEO publicly uncomfortable with power concentration while racing to concentrate it.
|
Strongest position in enterprise AI and coding. Revenue growth (10x YoY) outpaces all competitors. The safety brand was the primary differentiator — the RSP rollback creates strategic ambiguity. CEO publicly uncomfortable with power concentration while racing to concentrate it.
|
||||||
|
|
||||||
|
|
|
||||||
|
|
@ -45,6 +45,12 @@ The largest and most-valued AI laboratory. OpenAI pioneered the transformer-base
|
||||||
- **2026-02** — Raised $110B at $840B valuation, restructured to PBC
|
- **2026-02** — Raised $110B at $840B valuation, restructured to PBC
|
||||||
- **2026** — IPO preparation underway
|
- **2026** — IPO preparation underway
|
||||||
|
|
||||||
|
- **2025-2026** — John Schulman departed for Thinking Machines Lab
|
||||||
|
- **2026-03** — Reached $840B valuation, ~$25B annualized revenue
|
||||||
|
- **2026-03** — 68% consumer market share, 27% enterprise LLM spend
|
||||||
|
- **2026-03** — Released GPT-5/5.2/5.3
|
||||||
|
- **2026-03** — Restructured to Public Benefit Corporation
|
||||||
|
- **2026-03** — IPO expected H2 2026-2027
|
||||||
## Competitive Position
|
## Competitive Position
|
||||||
Highest valuation and strongest consumer brand, but losing enterprise share to Anthropic. The Microsoft partnership (exclusive API hosting) provides distribution but also dependency. Key vulnerability: the enterprise coding market — where Anthropic's Claude Code dominates — may prove more valuable than consumer chat.
|
Highest valuation and strongest consumer brand, but losing enterprise share to Anthropic. The Microsoft partnership (exclusive API hosting) provides distribution but also dependency. Key vulnerability: the enterprise coding market — where Anthropic's Claude Code dominates — may prove more valuable than consumer chat.
|
||||||
|
|
||||||
|
|
|
||||||
|
|
@ -30,6 +30,7 @@ Community-driven animated IP founded by former VFX artists from Sony Pictures, A
|
||||||
- **2025-11-01** — Presented informal co-creation governance model at MIPJunior 2025 in Cannes, detailing seven specific community engagement mechanisms including weekly IP bible updates and social media as test kitchen for creative decisions
|
- **2025-11-01** — Presented informal co-creation governance model at MIPJunior 2025 in Cannes, detailing seven specific community engagement mechanisms including weekly IP bible updates and social media as test kitchen for creative decisions
|
||||||
- **2025-10-01** — Announced 39 x 7-minute animated series launching YouTube-first with Method Animation (Mediawan) co-production. Gameloft mobile game in co-development. Nearly 1B social views across community.
|
- **2025-10-01** — Announced 39 x 7-minute animated series launching YouTube-first with Method Animation (Mediawan) co-production. Gameloft mobile game in co-development. Nearly 1B social views across community.
|
||||||
- **2025-10-01** — Announced 39-episode animated series launching YouTube-first, co-produced with Method Animation (Mediawan), followed by traditional TV/streaming sales. Community has generated nearly 1B social views. Gameloft mobile game in co-development.
|
- **2025-10-01** — Announced 39-episode animated series launching YouTube-first, co-produced with Method Animation (Mediawan), followed by traditional TV/streaming sales. Community has generated nearly 1B social views. Gameloft mobile game in co-development.
|
||||||
|
- **2025-10-01** — Announced 39-episode animated series launching YouTube-first, co-produced with Method Animation (Mediawan), with Gameloft mobile game in co-development. Community has generated nearly 1B social views.
|
||||||
## Relationship to KB
|
## Relationship to KB
|
||||||
|
|
||||||
- Implements [[fanchise management is a stack of increasing fan engagement from content extensions through co-creation and co-ownership]] through specific co-creation mechanisms
|
- Implements [[fanchise management is a stack of increasing fan engagement from content extensions through co-creation and co-ownership]] through specific co-creation mechanisms
|
||||||
|
|
|
||||||
|
|
@ -26,6 +26,7 @@ Creator-owned streaming platform focused on comedy content. Reached 1M+ subscrib
|
||||||
- **2025-10-01** — Crossed 1 million subscribers (31% YoY growth). Launched $129.99/year superfan tier in response to fan requests to support platform at higher price point.
|
- **2025-10-01** — Crossed 1 million subscribers (31% YoY growth). Launched $129.99/year superfan tier in response to fan requests to support platform at higher price point.
|
||||||
- **2025-10-01** — Crossed 1 million subscribers (31% YoY growth). Launched $129.99/year superfan tier in response to fan requests for higher-priced support option. Dimension 20 MSG live show sold out (January 2025). Brennan Lee Mulligan signed 3-year deal while simultaneously participating in Critical Role Campaign 4.
|
- **2025-10-01** — Crossed 1 million subscribers (31% YoY growth). Launched $129.99/year superfan tier in response to fan requests for higher-priced support option. Dimension 20 MSG live show sold out (January 2025). Brennan Lee Mulligan signed 3-year deal while simultaneously participating in Critical Role Campaign 4.
|
||||||
- **2025-10-01** — Crossed 1 million subscribers with 31% YoY growth; launched $129.99/year superfan tier in response to fan requests to support platform
|
- **2025-10-01** — Crossed 1 million subscribers with 31% YoY growth; launched $129.99/year superfan tier in response to fan requests to support platform
|
||||||
|
- **2025-10-01** — Crossed 1 million subscribers (31% YoY growth); launched $129.99/year superfan tier originated by fan request
|
||||||
## Relationship to KB
|
## Relationship to KB
|
||||||
|
|
||||||
- [[creator-owned-streaming-infrastructure-has-reached-commercial-scale-with-430M-annual-creator-revenue-across-13M-subscribers]]
|
- [[creator-owned-streaming-infrastructure-has-reached-commercial-scale-with-430M-annual-creator-revenue-across-13M-subscribers]]
|
||||||
|
|
|
||||||
|
|
@ -53,3 +53,5 @@ Treasury controlled by token holders through futarchy-based governance. Team can
|
||||||
- **February 2026** — Peak monthly volume of $3.95M
|
- **February 2026** — Peak monthly volume of $3.95M
|
||||||
- **March 15, 2026** — Pine Analytics publishes pre-ICO analysis identifying 182x gross profit multiple concern
|
- **March 15, 2026** — Pine Analytics publishes pre-ICO analysis identifying 182x gross profit multiple concern
|
||||||
- **March 26, 2026** — ICO scheduled on MetaDAO
|
- **March 26, 2026** — ICO scheduled on MetaDAO
|
||||||
|
|
||||||
|
- **2026-03-26** — [[p2p-me-metadao-ico]] Active: ICO scheduled, targeting $6M raise at $15.5M FDV with Pine Analytics identifying 182x gross profit multiple concerns
|
||||||
|
|
@ -1,24 +1,34 @@
|
||||||
{
|
{
|
||||||
"rejected_claims": [
|
"rejected_claims": [
|
||||||
{
|
{
|
||||||
"filename": "pre-deployment-ai-evaluation-operates-on-voluntary-collaborative-model-where-labs-can-decline-without-consequence.md",
|
"filename": "pre-deployment-AI-evaluation-operates-on-voluntary-collaborative-model-where-labs-can-decline-without-consequence.md",
|
||||||
|
"issues": [
|
||||||
|
"missing_attribution_extractor"
|
||||||
|
]
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"filename": "UK-AISI-renaming-to-Security-Institute-signals-government-priority-shift-from-existential-safety-to-cybersecurity-threats.md",
|
||||||
"issues": [
|
"issues": [
|
||||||
"missing_attribution_extractor"
|
"missing_attribution_extractor"
|
||||||
]
|
]
|
||||||
}
|
}
|
||||||
],
|
],
|
||||||
"validation_stats": {
|
"validation_stats": {
|
||||||
"total": 1,
|
"total": 2,
|
||||||
"kept": 0,
|
"kept": 0,
|
||||||
"fixed": 3,
|
"fixed": 6,
|
||||||
"rejected": 1,
|
"rejected": 2,
|
||||||
"fixes_applied": [
|
"fixes_applied": [
|
||||||
"pre-deployment-ai-evaluation-operates-on-voluntary-collaborative-model-where-labs-can-decline-without-consequence.md:set_created:2026-03-19",
|
"pre-deployment-AI-evaluation-operates-on-voluntary-collaborative-model-where-labs-can-decline-without-consequence.md:set_created:2026-03-19",
|
||||||
"pre-deployment-ai-evaluation-operates-on-voluntary-collaborative-model-where-labs-can-decline-without-consequence.md:stripped_wiki_link:voluntary-safety-pledges-cannot-survive-competitive-pressure",
|
"pre-deployment-AI-evaluation-operates-on-voluntary-collaborative-model-where-labs-can-decline-without-consequence.md:stripped_wiki_link:voluntary-safety-pledges-cannot-survive-competitive-pressure",
|
||||||
"pre-deployment-ai-evaluation-operates-on-voluntary-collaborative-model-where-labs-can-decline-without-consequence.md:stripped_wiki_link:only-binding-regulation-with-enforcement-teeth-changes-front"
|
"pre-deployment-AI-evaluation-operates-on-voluntary-collaborative-model-where-labs-can-decline-without-consequence.md:stripped_wiki_link:only-binding-regulation-with-enforcement-teeth-changes-front",
|
||||||
|
"UK-AISI-renaming-to-Security-Institute-signals-government-priority-shift-from-existential-safety-to-cybersecurity-threats.md:set_created:2026-03-19",
|
||||||
|
"UK-AISI-renaming-to-Security-Institute-signals-government-priority-shift-from-existential-safety-to-cybersecurity-threats.md:stripped_wiki_link:government-designation-of-safety-conscious-AI-labs-as-supply",
|
||||||
|
"UK-AISI-renaming-to-Security-Institute-signals-government-priority-shift-from-existential-safety-to-cybersecurity-threats.md:stripped_wiki_link:compute-export-controls-are-the-most-impactful-AI-governance"
|
||||||
],
|
],
|
||||||
"rejections": [
|
"rejections": [
|
||||||
"pre-deployment-ai-evaluation-operates-on-voluntary-collaborative-model-where-labs-can-decline-without-consequence.md:missing_attribution_extractor"
|
"pre-deployment-AI-evaluation-operates-on-voluntary-collaborative-model-where-labs-can-decline-without-consequence.md:missing_attribution_extractor",
|
||||||
|
"UK-AISI-renaming-to-Security-Institute-signals-government-priority-shift-from-existential-safety-to-cybersecurity-threats.md:missing_attribution_extractor"
|
||||||
]
|
]
|
||||||
},
|
},
|
||||||
"model": "anthropic/claude-sonnet-4.5",
|
"model": "anthropic/claude-sonnet-4.5",
|
||||||
|
|
|
||||||
|
|
@ -10,16 +10,17 @@
|
||||||
"validation_stats": {
|
"validation_stats": {
|
||||||
"total": 1,
|
"total": 1,
|
||||||
"kept": 0,
|
"kept": 0,
|
||||||
"fixed": 2,
|
"fixed": 3,
|
||||||
"rejected": 1,
|
"rejected": 1,
|
||||||
"fixes_applied": [
|
"fixes_applied": [
|
||||||
"glp-1-combined-with-structured-exercise-achieves-60-percent-better-weight-maintenance-than-medication-alone-after-discontinuation.md:set_created:2026-03-18",
|
"glp-1-combined-with-structured-exercise-achieves-60-percent-better-weight-maintenance-than-medication-alone-after-discontinuation.md:set_created:2026-03-19",
|
||||||
"glp-1-combined-with-structured-exercise-achieves-60-percent-better-weight-maintenance-than-medication-alone-after-discontinuation.md:stripped_wiki_link:glp-1-persistence-drops-to-15-percent-at-two-years-for-non-d"
|
"glp-1-combined-with-structured-exercise-achieves-60-percent-better-weight-maintenance-than-medication-alone-after-discontinuation.md:stripped_wiki_link:glp-1-persistence-drops-to-15-percent-at-two-years-for-non-d",
|
||||||
|
"glp-1-combined-with-structured-exercise-achieves-60-percent-better-weight-maintenance-than-medication-alone-after-discontinuation.md:stripped_wiki_link:GLP-1 receptor agonists are the largest therapeutic category"
|
||||||
],
|
],
|
||||||
"rejections": [
|
"rejections": [
|
||||||
"glp-1-combined-with-structured-exercise-achieves-60-percent-better-weight-maintenance-than-medication-alone-after-discontinuation.md:missing_attribution_extractor"
|
"glp-1-combined-with-structured-exercise-achieves-60-percent-better-weight-maintenance-than-medication-alone-after-discontinuation.md:missing_attribution_extractor"
|
||||||
]
|
]
|
||||||
},
|
},
|
||||||
"model": "anthropic/claude-sonnet-4.5",
|
"model": "anthropic/claude-sonnet-4.5",
|
||||||
"date": "2026-03-18"
|
"date": "2026-03-19"
|
||||||
}
|
}
|
||||||
|
|
@ -0,0 +1,26 @@
|
||||||
|
{
|
||||||
|
"rejected_claims": [
|
||||||
|
{
|
||||||
|
"filename": "fan-requested-premium-tiers-signal-community-governance-of-pricing-without-formal-ownership.md",
|
||||||
|
"issues": [
|
||||||
|
"missing_attribution_extractor"
|
||||||
|
]
|
||||||
|
}
|
||||||
|
],
|
||||||
|
"validation_stats": {
|
||||||
|
"total": 1,
|
||||||
|
"kept": 0,
|
||||||
|
"fixed": 3,
|
||||||
|
"rejected": 1,
|
||||||
|
"fixes_applied": [
|
||||||
|
"fan-requested-premium-tiers-signal-community-governance-of-pricing-without-formal-ownership.md:set_created:2026-03-19",
|
||||||
|
"fan-requested-premium-tiers-signal-community-governance-of-pricing-without-formal-ownership.md:stripped_wiki_link:community-ownership-accelerates-growth-through-aligned-evang",
|
||||||
|
"fan-requested-premium-tiers-signal-community-governance-of-pricing-without-formal-ownership.md:stripped_wiki_link:fanchise-management-is-a-stack-of-increasing-fan-engagement-"
|
||||||
|
],
|
||||||
|
"rejections": [
|
||||||
|
"fan-requested-premium-tiers-signal-community-governance-of-pricing-without-formal-ownership.md:missing_attribution_extractor"
|
||||||
|
]
|
||||||
|
},
|
||||||
|
"model": "anthropic/claude-sonnet-4.5",
|
||||||
|
"date": "2026-03-19"
|
||||||
|
}
|
||||||
|
|
@ -0,0 +1,46 @@
|
||||||
|
{
|
||||||
|
"rejected_claims": [
|
||||||
|
{
|
||||||
|
"filename": "binding-regulation-with-enforcement-is-the-only-ai-governance-mechanism-that-changes-frontier-lab-behavior.md",
|
||||||
|
"issues": [
|
||||||
|
"missing_attribution_extractor"
|
||||||
|
]
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"filename": "compute-governance-through-export-controls-works-but-targets-geopolitics-not-safety-leaving-capability-race-unconstrained.md",
|
||||||
|
"issues": [
|
||||||
|
"missing_attribution_extractor"
|
||||||
|
]
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"filename": "third-party-ai-evaluation-ecosystem-is-fragile-without-regulatory-mandate-because-voluntary-participation-and-funding-instability-threaten-continuity.md",
|
||||||
|
"issues": [
|
||||||
|
"missing_attribution_extractor"
|
||||||
|
]
|
||||||
|
}
|
||||||
|
],
|
||||||
|
"validation_stats": {
|
||||||
|
"total": 3,
|
||||||
|
"kept": 0,
|
||||||
|
"fixed": 9,
|
||||||
|
"rejected": 3,
|
||||||
|
"fixes_applied": [
|
||||||
|
"binding-regulation-with-enforcement-is-the-only-ai-governance-mechanism-that-changes-frontier-lab-behavior.md:set_created:2026-03-19",
|
||||||
|
"binding-regulation-with-enforcement-is-the-only-ai-governance-mechanism-that-changes-frontier-lab-behavior.md:stripped_wiki_link:only binding regulation with enforcement teeth changes front",
|
||||||
|
"binding-regulation-with-enforcement-is-the-only-ai-governance-mechanism-that-changes-frontier-lab-behavior.md:stripped_wiki_link:voluntary safety commitments collapse under competitive pres",
|
||||||
|
"binding-regulation-with-enforcement-is-the-only-ai-governance-mechanism-that-changes-frontier-lab-behavior.md:stripped_wiki_link:Anthropics RSP rollback under commercial pressure is the fir",
|
||||||
|
"compute-governance-through-export-controls-works-but-targets-geopolitics-not-safety-leaving-capability-race-unconstrained.md:set_created:2026-03-19",
|
||||||
|
"compute-governance-through-export-controls-works-but-targets-geopolitics-not-safety-leaving-capability-race-unconstrained.md:stripped_wiki_link:compute export controls are the most impactful AI governance",
|
||||||
|
"compute-governance-through-export-controls-works-but-targets-geopolitics-not-safety-leaving-capability-race-unconstrained.md:stripped_wiki_link:nation-states will inevitably assert control over frontier A",
|
||||||
|
"third-party-ai-evaluation-ecosystem-is-fragile-without-regulatory-mandate-because-voluntary-participation-and-funding-instability-threaten-continuity.md:set_created:2026-03-19",
|
||||||
|
"third-party-ai-evaluation-ecosystem-is-fragile-without-regulatory-mandate-because-voluntary-participation-and-funding-instability-threaten-continuity.md:stripped_wiki_link:pre-deployment-AI-evaluations-do-not-predict-real-world-risk"
|
||||||
|
],
|
||||||
|
"rejections": [
|
||||||
|
"binding-regulation-with-enforcement-is-the-only-ai-governance-mechanism-that-changes-frontier-lab-behavior.md:missing_attribution_extractor",
|
||||||
|
"compute-governance-through-export-controls-works-but-targets-geopolitics-not-safety-leaving-capability-race-unconstrained.md:missing_attribution_extractor",
|
||||||
|
"third-party-ai-evaluation-ecosystem-is-fragile-without-regulatory-mandate-because-voluntary-participation-and-funding-instability-threaten-continuity.md:missing_attribution_extractor"
|
||||||
|
]
|
||||||
|
},
|
||||||
|
"model": "anthropic/claude-sonnet-4.5",
|
||||||
|
"date": "2026-03-19"
|
||||||
|
}
|
||||||
|
|
@ -0,0 +1,40 @@
|
||||||
|
{
|
||||||
|
"rejected_claims": [
|
||||||
|
{
|
||||||
|
"filename": "enterprise-coding-agents-emerged-as-first-killer-app-category-for-frontier-ai-because-verifiable-output-and-immediate-roi-overcome-adoption-friction.md",
|
||||||
|
"issues": [
|
||||||
|
"missing_attribution_extractor",
|
||||||
|
"opsec_internal_deal_terms"
|
||||||
|
]
|
||||||
|
},
|
||||||
|
{
|
||||||
|
"filename": "frontier-ai-lab-talent-circulation-accelerated-dramatically-in-2025-2026-with-11-plus-google-executives-to-microsoft-and-multiple-openai-departures-indicating-competitive-pressure-on-retention.md",
|
||||||
|
"issues": [
|
||||||
|
"missing_attribution_extractor",
|
||||||
|
"opsec_internal_deal_terms"
|
||||||
|
]
|
||||||
|
}
|
||||||
|
],
|
||||||
|
"validation_stats": {
|
||||||
|
"total": 2,
|
||||||
|
"kept": 0,
|
||||||
|
"fixed": 6,
|
||||||
|
"rejected": 2,
|
||||||
|
"fixes_applied": [
|
||||||
|
"enterprise-coding-agents-emerged-as-first-killer-app-category-for-frontier-ai-because-verifiable-output-and-immediate-roi-overcome-adoption-friction.md:set_created:2026-03-19",
|
||||||
|
"enterprise-coding-agents-emerged-as-first-killer-app-category-for-frontier-ai-because-verifiable-output-and-immediate-roi-overcome-adoption-friction.md:stripped_wiki_link:coding-agents-crossed-usability-threshold-december-2025-when",
|
||||||
|
"enterprise-coding-agents-emerged-as-first-killer-app-category-for-frontier-ai-because-verifiable-output-and-immediate-roi-overcome-adoption-friction.md:stripped_wiki_link:the-gap-between-theoretical-AI-capability-and-observed-deplo",
|
||||||
|
"frontier-ai-lab-talent-circulation-accelerated-dramatically-in-2025-2026-with-11-plus-google-executives-to-microsoft-and-multiple-openai-departures-indicating-competitive-pressure-on-retention.md:set_created:2026-03-19",
|
||||||
|
"frontier-ai-lab-talent-circulation-accelerated-dramatically-in-2025-2026-with-11-plus-google-executives-to-microsoft-and-multiple-openai-departures-indicating-competitive-pressure-on-retention.md:stripped_wiki_link:AI-talent-circulation-between-frontier-labs-transfers-alignm",
|
||||||
|
"frontier-ai-lab-talent-circulation-accelerated-dramatically-in-2025-2026-with-11-plus-google-executives-to-microsoft-and-multiple-openai-departures-indicating-competitive-pressure-on-retention.md:stripped_wiki_link:Anthropics-RSP-rollback-under-commercial-pressure-is-the-fir"
|
||||||
|
],
|
||||||
|
"rejections": [
|
||||||
|
"enterprise-coding-agents-emerged-as-first-killer-app-category-for-frontier-ai-because-verifiable-output-and-immediate-roi-overcome-adoption-friction.md:missing_attribution_extractor",
|
||||||
|
"enterprise-coding-agents-emerged-as-first-killer-app-category-for-frontier-ai-because-verifiable-output-and-immediate-roi-overcome-adoption-friction.md:opsec_internal_deal_terms",
|
||||||
|
"frontier-ai-lab-talent-circulation-accelerated-dramatically-in-2025-2026-with-11-plus-google-executives-to-microsoft-and-multiple-openai-departures-indicating-competitive-pressure-on-retention.md:missing_attribution_extractor",
|
||||||
|
"frontier-ai-lab-talent-circulation-accelerated-dramatically-in-2025-2026-with-11-plus-google-executives-to-microsoft-and-multiple-openai-departures-indicating-competitive-pressure-on-retention.md:opsec_internal_deal_terms"
|
||||||
|
]
|
||||||
|
},
|
||||||
|
"model": "anthropic/claude-sonnet-4.5",
|
||||||
|
"date": "2026-03-19"
|
||||||
|
}
|
||||||
|
|
@ -7,13 +7,17 @@ date: 2026-03-01
|
||||||
domain: ai-alignment
|
domain: ai-alignment
|
||||||
secondary_domains: []
|
secondary_domains: []
|
||||||
format: article
|
format: article
|
||||||
status: unprocessed
|
status: enrichment
|
||||||
priority: medium
|
priority: medium
|
||||||
tags: [evaluation-infrastructure, pre-deployment, METR, AISI, voluntary-collaborative, Inspect, Claude-Opus-4-6, cyber-evaluation]
|
tags: [evaluation-infrastructure, pre-deployment, METR, AISI, voluntary-collaborative, Inspect, Claude-Opus-4-6, cyber-evaluation]
|
||||||
processed_by: theseus
|
processed_by: theseus
|
||||||
processed_date: 2026-03-19
|
processed_date: 2026-03-19
|
||||||
enrichments_applied: ["pre-deployment-AI-evaluations-do-not-predict-real-world-risk-creating-institutional-governance-built-on-unreliable-foundations.md"]
|
enrichments_applied: ["pre-deployment-AI-evaluations-do-not-predict-real-world-risk-creating-institutional-governance-built-on-unreliable-foundations.md"]
|
||||||
extraction_model: "anthropic/claude-sonnet-4.5"
|
extraction_model: "anthropic/claude-sonnet-4.5"
|
||||||
|
processed_by: theseus
|
||||||
|
processed_date: 2026-03-19
|
||||||
|
enrichments_applied: ["pre-deployment-AI-evaluations-do-not-predict-real-world-risk-creating-institutional-governance-built-on-unreliable-foundations.md"]
|
||||||
|
extraction_model: "anthropic/claude-sonnet-4.5"
|
||||||
---
|
---
|
||||||
|
|
||||||
## Content
|
## Content
|
||||||
|
|
@ -49,7 +53,7 @@ Synthesized overview of the two main organizations conducting pre-deployment AI
|
||||||
**KB connections:**
|
**KB connections:**
|
||||||
- [[voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints]] — voluntary evaluation has the same structural problem; a lab can simply not invite METR
|
- [[voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints]] — voluntary evaluation has the same structural problem; a lab can simply not invite METR
|
||||||
- [[technology advances exponentially but coordination mechanisms evolve linearly creating a widening gap]] — METR and AISI are growing their evaluation capacity, but AI capabilities are growing faster; the gap widens in every period
|
- [[technology advances exponentially but coordination mechanisms evolve linearly creating a widening gap]] — METR and AISI are growing their evaluation capacity, but AI capabilities are growing faster; the gap widens in every period
|
||||||
- [[government designation of safety-conscious AI labs as supply chain risks inverts the regulatory dynamic]] — AISI renaming to "Security Institute" is a softer version of the same dynamic — government safety infrastructure shifting to serve government security interests rather than existential risk reduction
|
- government designation of safety-conscious AI labs as supply chain risks inverts the regulatory dynamic — AISI renaming to "Security Institute" is a softer version of the same dynamic — government safety infrastructure shifting to serve government security interests rather than existential risk reduction
|
||||||
|
|
||||||
**Extraction hints:**
|
**Extraction hints:**
|
||||||
- Key claim: "Pre-deployment AI evaluation operates on a voluntary-collaborative model where evaluators (METR, AISI) require lab cooperation, meaning labs that decline evaluation face no consequence"
|
- Key claim: "Pre-deployment AI evaluation operates on a voluntary-collaborative model where evaluators (METR, AISI) require lab cooperation, meaning labs that decline evaluation face no consequence"
|
||||||
|
|
@ -72,3 +76,14 @@ EXTRACTION HINT: Focus on the voluntary-collaborative limitation: no evaluation
|
||||||
- UK AISI was renamed from 'AI Safety Institute' to 'AI Security Institute' in 2026
|
- UK AISI was renamed from 'AI Safety Institute' to 'AI Security Institute' in 2026
|
||||||
- UK AISI tested 7 LLMs on custom cyber ranges as of March 16, 2026
|
- UK AISI tested 7 LLMs on custom cyber ranges as of March 16, 2026
|
||||||
- METR maintains a Frontier AI Safety Policies repository covering Amazon, Anthropic, Google DeepMind, Meta, Microsoft, and OpenAI
|
- METR maintains a Frontier AI Safety Policies repository covering Amazon, Anthropic, Google DeepMind, Meta, Microsoft, and OpenAI
|
||||||
|
|
||||||
|
|
||||||
|
## Key Facts
|
||||||
|
- METR reviewed Anthropic's Claude Opus 4.6 sabotage risk report on March 12, 2026
|
||||||
|
- UK AISI tested 7 LLMs on custom cyber ranges as of March 16, 2026
|
||||||
|
- UK AISI was renamed from 'AI Safety Institute' to 'AI Security Institute' in 2026
|
||||||
|
- METR maintains a Frontier AI Safety Policies repository covering Amazon, Anthropic, Google DeepMind, Meta, Microsoft, and OpenAI
|
||||||
|
- UK AISI released the Inspect evaluation framework in April 2024
|
||||||
|
- UK AISI released Inspect Scout transcript analysis tool on February 25, 2026
|
||||||
|
- UK AISI released ControlArena library for AI control experiments on October 22, 2025
|
||||||
|
- UK AISI conducted international joint testing exercise on agentic systems in July 2025
|
||||||
|
|
|
||||||
|
|
@ -7,13 +7,17 @@ date: 2026-03-01
|
||||||
domain: health
|
domain: health
|
||||||
secondary_domains: []
|
secondary_domains: []
|
||||||
format: review
|
format: review
|
||||||
status: unprocessed
|
status: enrichment
|
||||||
priority: high
|
priority: high
|
||||||
tags: [glp-1, lifestyle-modification, exercise, sarcopenia, muscle-preservation, adherence, weight-regain, obesity]
|
tags: [glp-1, lifestyle-modification, exercise, sarcopenia, muscle-preservation, adherence, weight-regain, obesity]
|
||||||
processed_by: vida
|
processed_by: vida
|
||||||
processed_date: 2026-03-18
|
processed_date: 2026-03-18
|
||||||
enrichments_applied: ["glp-1-persistence-drops-to-15-percent-at-two-years-for-non-diabetic-obesity-patients-undermining-chronic-use-economics.md", "GLP-1 receptor agonists are the largest therapeutic category launch in pharmaceutical history but their chronic use model makes the net cost impact inflationary through 2035.md"]
|
enrichments_applied: ["glp-1-persistence-drops-to-15-percent-at-two-years-for-non-diabetic-obesity-patients-undermining-chronic-use-economics.md", "GLP-1 receptor agonists are the largest therapeutic category launch in pharmaceutical history but their chronic use model makes the net cost impact inflationary through 2035.md"]
|
||||||
extraction_model: "anthropic/claude-sonnet-4.5"
|
extraction_model: "anthropic/claude-sonnet-4.5"
|
||||||
|
processed_by: vida
|
||||||
|
processed_date: 2026-03-19
|
||||||
|
enrichments_applied: ["glp-1-persistence-drops-to-15-percent-at-two-years-for-non-diabetic-obesity-patients-undermining-chronic-use-economics.md", "GLP-1 receptor agonists are the largest therapeutic category launch in pharmaceutical history but their chronic use model makes the net cost impact inflationary through 2035.md"]
|
||||||
|
extraction_model: "anthropic/claude-sonnet-4.5"
|
||||||
---
|
---
|
||||||
|
|
||||||
## Content
|
## Content
|
||||||
|
|
@ -88,3 +92,13 @@ EXTRACTION HINT: Focus on the GLP-1 alone vs. GLP-1+exercise regain comparison
|
||||||
- Up to 50% of adults over 80 experience sarcopenia; aging reduces muscle mass 12-16% independent of weight loss interventions
|
- Up to 50% of adults over 80 experience sarcopenia; aging reduces muscle mass 12-16% independent of weight loss interventions
|
||||||
- Tirzepatide may have better muscle preservation profile than semaglutide (preliminary data, not FDA-approved for this indication)
|
- Tirzepatide may have better muscle preservation profile than semaglutide (preliminary data, not FDA-approved for this indication)
|
||||||
- BALANCE model includes lifestyle support component but specific exercise programming details not specified in source
|
- BALANCE model includes lifestyle support component but specific exercise programming details not specified in source
|
||||||
|
|
||||||
|
|
||||||
|
## Key Facts
|
||||||
|
- WHO December 2025 guidelines specifically recommend GLP-1 therapies 'combined with intensive behavioral therapy to maximize and sustain benefits'
|
||||||
|
- Meta-analysis of 22 RCTs with 2,258 participants found approximately 25% of GLP-1 weight loss is lean mass
|
||||||
|
- Without exercise, 15-40% of GLP-1 weight loss is lean mass; with resistance training, lean mass loss is substantially reduced
|
||||||
|
- Up to 50% of adults over 80 experience sarcopenia; aging reduces muscle mass 12-16% independent of weight loss interventions
|
||||||
|
- At week 52 all intervention groups regained weight after stopping; by week 104: placebo +7.6 kg, liraglutide only +8.7 kg, exercise only +5.4 kg, combination +3.5 kg
|
||||||
|
- Tirzepatide may have better muscle preservation profile than semaglutide (preliminary data, not FDA-approved for this indication)
|
||||||
|
- ADA notes new therapies claiming 'enhanced quality of weight loss by improving muscle preservation' but no FDA-approved compounds with proven muscle preservation yet
|
||||||
|
|
|
||||||
|
|
@ -7,9 +7,13 @@ date: 2025-10-01
|
||||||
domain: entertainment
|
domain: entertainment
|
||||||
secondary_domains: []
|
secondary_domains: []
|
||||||
format: article
|
format: article
|
||||||
status: unprocessed
|
status: enrichment
|
||||||
priority: medium
|
priority: medium
|
||||||
tags: [dropout, superfan, subscription-economics, community-economics, sam-reich, indie-streaming, 1-million-subscribers]
|
tags: [dropout, superfan, subscription-economics, community-economics, sam-reich, indie-streaming, 1-million-subscribers]
|
||||||
|
processed_by: clay
|
||||||
|
processed_date: 2026-03-19
|
||||||
|
enrichments_applied: ["creator-owned-streaming-infrastructure-has-reached-commercial-scale-with-430M-annual-creator-revenue-across-13M-subscribers.md", "indie-streaming-platforms-emerged-as-category-by-2024-with-convergent-structural-patterns-across-content-verticals.md"]
|
||||||
|
extraction_model: "anthropic/claude-sonnet-4.5"
|
||||||
---
|
---
|
||||||
|
|
||||||
## Content
|
## Content
|
||||||
|
|
@ -57,3 +61,12 @@ Variety exclusive interview with Sam Reich (Dropout CEO) about the platform cros
|
||||||
PRIMARY CONNECTION: [[community ownership accelerates growth through aligned evangelism not passive holding]]
|
PRIMARY CONNECTION: [[community ownership accelerates growth through aligned evangelism not passive holding]]
|
||||||
WHY ARCHIVED: Primary source for the "voluntary premium subscription = functionally equivalent to token ownership" claim. The fan-requested superfan tier is the clearest evidence that community alignment doesn't require Web3.
|
WHY ARCHIVED: Primary source for the "voluntary premium subscription = functionally equivalent to token ownership" claim. The fan-requested superfan tier is the clearest evidence that community alignment doesn't require Web3.
|
||||||
EXTRACTION HINT: Focus on the fan-originated tier (they ASKED for it) as the novel finding — this is community governance of pricing, not just community consumption. Contrast with Doodles DOOD token mechanics.
|
EXTRACTION HINT: Focus on the fan-originated tier (they ASKED for it) as the novel finding — this is community governance of pricing, not just community consumption. Contrast with Doodles DOOD token mechanics.
|
||||||
|
|
||||||
|
|
||||||
|
## Key Facts
|
||||||
|
- Dropout crossed 1 million subscribers in October 2025
|
||||||
|
- Dropout subscriber growth 2024→2025: 31%
|
||||||
|
- Dropout superfan tier pricing: $129.99/year (approximately 2x standard tier)
|
||||||
|
- Dimension 20 MSG live taping sold out in January 2025 (20,000 seat capacity)
|
||||||
|
- Brennan Lee Mulligan signed 3-year Dropout deal while simultaneously participating in Critical Role Campaign 4
|
||||||
|
- Dropout did not use paid marketing until 2022; distribution relies on short clips shared organically by fans
|
||||||
|
|
|
||||||
|
|
@ -6,10 +6,14 @@ url: null
|
||||||
date_published: 2026-03-16
|
date_published: 2026-03-16
|
||||||
date_archived: 2026-03-16
|
date_archived: 2026-03-16
|
||||||
domain: ai-alignment
|
domain: ai-alignment
|
||||||
status: unprocessed
|
status: enrichment
|
||||||
processed_by: theseus
|
processed_by: theseus
|
||||||
tags: [ai-governance, coordination, safety-commitments, regulation, enforcement, voluntary-pledges]
|
tags: [ai-governance, coordination, safety-commitments, regulation, enforcement, voluntary-pledges]
|
||||||
sourced_via: "Theseus research agent — 45 web searches synthesized from Brookings, Stanford FMTI, EU legislation, OECD, government publications, TechCrunch, TIME, CNN, Fortune, academic papers"
|
sourced_via: "Theseus research agent — 45 web searches synthesized from Brookings, Stanford FMTI, EU legislation, OECD, government publications, TechCrunch, TIME, CNN, Fortune, academic papers"
|
||||||
|
processed_by: theseus
|
||||||
|
processed_date: 2026-03-19
|
||||||
|
enrichments_applied: ["AI transparency is declining not improving because Stanford FMTI scores dropped 17 points in one year while frontier labs dissolved safety teams and removed safety language from mission statements.md", "Anthropics RSP rollback under commercial pressure is the first empirical confirmation that binding safety commitments cannot survive the competitive dynamics of frontier AI development.md", "voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md", "compute export controls are the most impactful AI governance mechanism but target geopolitical competition not safety leaving capability development unconstrained.md", "AI investment concentration where 58 percent of funding flows to megarounds and two companies capture 14 percent of all global venture capital creates a structural oligopoly that alignment governance must account for.md"]
|
||||||
|
extraction_model: "anthropic/claude-sonnet-4.5"
|
||||||
---
|
---
|
||||||
|
|
||||||
# Empirical Evidence: AI Coordination and Governance Mechanisms That Changed Behavior
|
# Empirical Evidence: AI Coordination and Governance Mechanisms That Changed Behavior
|
||||||
|
|
@ -51,3 +55,19 @@ Core finding: almost no international AI governance mechanism has produced verif
|
||||||
- Insurance/liability: market projected $29.7B by 2033. Creates market incentives aligned with safety.
|
- Insurance/liability: market projected $29.7B by 2033. Creates market incentives aligned with safety.
|
||||||
- Third-party auditing: METR, Apollo Research. Apollo warns ecosystem unsustainable without regulatory mandate.
|
- Third-party auditing: METR, Apollo Research. Apollo warns ecosystem unsustainable without regulatory mandate.
|
||||||
- Futarchy: implemented for DAO governance (MetaDAO, Optimism experiment) but not yet for AI governance.
|
- Futarchy: implemented for DAO governance (MetaDAO, Optimism experiment) but not yet for AI governance.
|
||||||
|
|
||||||
|
|
||||||
|
## Key Facts
|
||||||
|
- EU AI Act: Apple paused Apple Intelligence in EU, Meta changed ads, EUR 500M+ fines under DMA
|
||||||
|
- China implemented mandatory algorithm filing with criminal enforcement (August 2023)
|
||||||
|
- US export controls: tiered country system, deployment caps, Nvidia compliance chips (H800, A800)
|
||||||
|
- Stanford FMTI transparency scores: -17 points mean (2024→2025), Meta -29, Mistral -37, OpenAI -14
|
||||||
|
- OpenAI removed 'safely' from mission statement (November 2025)
|
||||||
|
- OpenAI dissolved Superalignment team (May 2024) and Mission Alignment team (February 2026)
|
||||||
|
- Google accused by 60 UK lawmakers of violating Seoul commitments (Gemini 2.5 Pro, April 2025)
|
||||||
|
- 450+ organizations lobbied on AI in 2025 (up from 6 in 2016), $92M in lobbying fees Q1-Q3 2025
|
||||||
|
- California SB 1047 vetoed after industry lobbying
|
||||||
|
- Watermarking: 38% implementation rate across frontier labs
|
||||||
|
- US AISI defunded/rebranded after initial establishment
|
||||||
|
- UK-US joint evaluation of OpenAI o1 model conducted
|
||||||
|
- Insurance/liability market projected $29.7B by 2033
|
||||||
|
|
|
||||||
|
|
@ -7,10 +7,13 @@ date_published: 2026-03-16
|
||||||
date_archived: 2026-03-16
|
date_archived: 2026-03-16
|
||||||
domain: ai-alignment
|
domain: ai-alignment
|
||||||
secondary_domains: [internet-finance]
|
secondary_domains: [internet-finance]
|
||||||
status: unprocessed
|
status: enrichment
|
||||||
processed_by: theseus
|
processed_by: theseus
|
||||||
tags: [industry-landscape, ai-labs, funding, competitive-dynamics, startups, investors]
|
tags: [industry-landscape, ai-labs, funding, competitive-dynamics, startups, investors]
|
||||||
sourced_via: "Theseus research agent — 33 web searches synthesized from MIT Tech Review, TechCrunch, Crunchbase, OECD, company announcements, CNBC, Fortune, etc."
|
sourced_via: "Theseus research agent — 33 web searches synthesized from MIT Tech Review, TechCrunch, Crunchbase, OECD, company announcements, CNBC, Fortune, etc."
|
||||||
|
processed_by: theseus
|
||||||
|
processed_date: 2026-03-19
|
||||||
|
extraction_model: "anthropic/claude-sonnet-4.5"
|
||||||
---
|
---
|
||||||
|
|
||||||
# AI Industry Landscape Briefing — March 2026
|
# AI Industry Landscape Briefing — March 2026
|
||||||
|
|
@ -54,3 +57,19 @@ Multi-source synthesis of the current AI industry state. Key data points:
|
||||||
- Daniel Gross → left SSI for Meta superintelligence team
|
- Daniel Gross → left SSI for Meta superintelligence team
|
||||||
- John Schulman → left OpenAI for Thinking Machines Lab
|
- John Schulman → left OpenAI for Thinking Machines Lab
|
||||||
- 11+ Google executives → Microsoft in 2025
|
- 11+ Google executives → Microsoft in 2025
|
||||||
|
|
||||||
|
|
||||||
|
## Key Facts
|
||||||
|
- xAI reached ~$230B valuation with Grok 4/4.1 leading LMArena, 1M+ H100 GPUs, $20B Series E Jan 2026
|
||||||
|
- Mistral reached $13.8B valuation, EUR 300M ARR targeting EUR 1B, building European sovereign compute
|
||||||
|
- Google DeepMind released Gemini 3/3.1 family, 21% enterprise LLM spend, $175-185B capex 2026, Deep Think achieved gold-medal Olympiad results
|
||||||
|
- Sierra (Bret Taylor) reached $10B+ valuation in agentic customer service
|
||||||
|
- Databricks reached $134B valuation, $5B Series L, filed for IPO Q2 2026
|
||||||
|
- 2025 total AI VC: $259-270B (52-61% of all global VC)
|
||||||
|
- Feb 2026 AI funding: $189B (largest single month ever)
|
||||||
|
- 75-79% of AI funding to US companies
|
||||||
|
- Inference cost deflation ~10x/year
|
||||||
|
- Chinese open-source (Qwen, DeepSeek) capturing 50-60% of new open-model adoption
|
||||||
|
- 95% of enterprise AI pilots fail to deliver ROI (MIT Project NANDA)
|
||||||
|
- Big 5 AI capex: $660-690B planned 2026
|
||||||
|
- US deregulating AI, EU softening regulations
|
||||||
|
|
|
||||||
Loading…
Reference in a new issue