diff --git a/domains/ai-alignment/AI investment concentration where 58 percent of funding flows to megarounds and two companies capture 14 percent of all global venture capital creates a structural oligopoly that alignment governance must account for.md b/domains/ai-alignment/AI investment concentration where 58 percent of funding flows to megarounds and two companies capture 14 percent of all global venture capital creates a structural oligopoly that alignment governance must account for.md index 66269a85..461ae640 100644 --- a/domains/ai-alignment/AI investment concentration where 58 percent of funding flows to megarounds and two companies capture 14 percent of all global venture capital creates a structural oligopoly that alignment governance must account for.md +++ b/domains/ai-alignment/AI investment concentration where 58 percent of funding flows to megarounds and two companies capture 14 percent of all global venture capital creates a structural oligopoly that alignment governance must account for.md @@ -30,6 +30,12 @@ This concentration has direct alignment implications: The counterfactual worth tracking: Chinese open-source models (Qwen, DeepSeek) now capture 50-60% of new open-model adoption globally. If open-source models close the capability gap (currently 6-18 months, shrinking), capital concentration at the frontier may become less alignment-relevant as capability diffuses. But as of March 2026, frontier capability remains concentrated. + +### Additional Evidence (extend) +*Source: [[2026-03-16-theseus-ai-coordination-governance-evidence]] | Added: 2026-03-19* + +450+ organizations lobbied on AI in 2025, up from 6 in 2016. $92M in lobbying fees Q1-Q3 2025. Industry successfully blocked California SB 1047 through coordinated lobbying. Concentration creates not just market power but political power—oligopoly structure enables collective action to prevent binding regulation. + --- Relevant Notes: diff --git a/domains/ai-alignment/AI transparency is declining not improving because Stanford FMTI scores dropped 17 points in one year while frontier labs dissolved safety teams and removed safety language from mission statements.md b/domains/ai-alignment/AI transparency is declining not improving because Stanford FMTI scores dropped 17 points in one year while frontier labs dissolved safety teams and removed safety language from mission statements.md index 4f70867e..80f49a69 100644 --- a/domains/ai-alignment/AI transparency is declining not improving because Stanford FMTI scores dropped 17 points in one year while frontier labs dissolved safety teams and removed safety language from mission statements.md +++ b/domains/ai-alignment/AI transparency is declining not improving because Stanford FMTI scores dropped 17 points in one year while frontier labs dissolved safety teams and removed safety language from mission statements.md @@ -41,6 +41,12 @@ Expert consensus identifies 'external scrutiny, proactive evaluation and transpa STREAM proposal identifies that current model reports lack 'sufficient detail to enable meaningful independent assessment' of dangerous capability evaluations. The need for a standardized reporting framework confirms that transparency problems extend beyond general disclosure (FMTI scores) to the specific domain of dangerous capability evaluation where external verification is currently impossible. + +### Additional Evidence (confirm) +*Source: [[2026-03-16-theseus-ai-coordination-governance-evidence]] | Added: 2026-03-19* + +Stanford FMTI 2024→2025 data: mean transparency score declined 17 points. Meta -29 points, Mistral -37 points, OpenAI -14 points. OpenAI removed 'safely' from mission statement (Nov 2025), dissolved Superalignment team (May 2024) and Mission Alignment team (Feb 2026). Google accused by 60 UK lawmakers of violating Seoul commitments with Gemini 2.5 Pro (Apr 2025). + --- Relevant Notes: diff --git a/domains/ai-alignment/Anthropics RSP rollback under commercial pressure is the first empirical confirmation that binding safety commitments cannot survive the competitive dynamics of frontier AI development.md b/domains/ai-alignment/Anthropics RSP rollback under commercial pressure is the first empirical confirmation that binding safety commitments cannot survive the competitive dynamics of frontier AI development.md index 59bb4483..3507d90c 100644 --- a/domains/ai-alignment/Anthropics RSP rollback under commercial pressure is the first empirical confirmation that binding safety commitments cannot survive the competitive dynamics of frontier AI development.md +++ b/domains/ai-alignment/Anthropics RSP rollback under commercial pressure is the first empirical confirmation that binding safety commitments cannot survive the competitive dynamics of frontier AI development.md @@ -21,6 +21,12 @@ This is not a story about Anthropic's leadership failing. It is a story about [[ The alignment implication is structural: if the most safety-motivated lab with the most commercially successful safety brand cannot maintain binding safety commitments, then voluntary self-regulation is not a viable alignment strategy. This strengthens the case for coordination-based approaches — [[AI alignment is a coordination problem not a technical problem]] — because the failure mode is not that safety is technically impossible but that unilateral safety is economically unsustainable. + +### Additional Evidence (confirm) +*Source: [[2026-03-16-theseus-ai-coordination-governance-evidence]] | Added: 2026-03-19* + +Anthropic's own language in RSP documentation: commitments are 'very hard to meet without industry-wide coordination.' OpenAI made safety explicitly conditional on competitor behavior in Preparedness Framework v2 (April 2025). Pattern holds across all voluntary commitments—no frontier lab maintained unilateral safety constraints when competitors advanced without them. + --- Relevant Notes: diff --git a/domains/ai-alignment/compute export controls are the most impactful AI governance mechanism but target geopolitical competition not safety leaving capability development unconstrained.md b/domains/ai-alignment/compute export controls are the most impactful AI governance mechanism but target geopolitical competition not safety leaving capability development unconstrained.md index b407badb..b699cd13 100644 --- a/domains/ai-alignment/compute export controls are the most impactful AI governance mechanism but target geopolitical competition not safety leaving capability development unconstrained.md +++ b/domains/ai-alignment/compute export controls are the most impactful AI governance mechanism but target geopolitical competition not safety leaving capability development unconstrained.md @@ -30,6 +30,12 @@ For alignment, this means the governance infrastructure that exists (export cont The CFR article confirms diverging governance philosophies between democracies and authoritarian systems, with China's amended Cybersecurity Law emphasizing state oversight while the US pursues standard-setting body engagement. Horowitz notes the US 'must engage in standard-setting bodies to counter China's AI governance influence,' indicating that the most active governance is competitive positioning rather than safety coordination. + +### Additional Evidence (extend) +*Source: [[2026-03-16-theseus-ai-coordination-governance-evidence]] | Added: 2026-03-19* + +US export controls use tiered country system with deployment caps. Nvidia designed compliance chips (H800, A800) specifically to meet regulatory thresholds. Mechanism proves compute governance CAN work when backed by state enforcement, but current implementation optimizes for strategic advantage over China rather than catastrophic risk reduction. KYC for compute proposed but not implemented, showing technical feasibility without political will. + --- Relevant Notes: diff --git a/domains/ai-alignment/voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md b/domains/ai-alignment/voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md index 015955cd..3a70c264 100644 --- a/domains/ai-alignment/voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md +++ b/domains/ai-alignment/voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md @@ -39,6 +39,12 @@ The International AI Safety Report 2026 (multi-government committee, February 20 The gap between expert consensus (76 specialists identify third-party audits as top-3 priority) and actual implementation (no mandatory audit requirements at major labs) demonstrates that knowing what's needed is insufficient. Even when the field's experts across multiple domains agree on priorities, competitive dynamics prevent voluntary adoption. + +### Additional Evidence (confirm) +*Source: [[2026-03-16-theseus-ai-coordination-governance-evidence]] | Added: 2026-03-19* + +Comprehensive evidence across governance mechanisms: ALL international declarations (Bletchley, Seoul, Paris, Hiroshima, OECD, UN) produced zero verified behavioral change. Frontier Model Forum produced no binding commitments. White House voluntary commitments eroded. 450+ organizations lobbied on AI in 2025 ($92M in fees), California SB 1047 vetoed after industry pressure. Only binding regulation (EU AI Act, China enforcement, US export controls) changed behavior. + --- Relevant Notes: diff --git a/domains/health/GLP-1 receptor agonists are the largest therapeutic category launch in pharmaceutical history but their chronic use model makes the net cost impact inflationary through 2035.md b/domains/health/GLP-1 receptor agonists are the largest therapeutic category launch in pharmaceutical history but their chronic use model makes the net cost impact inflationary through 2035.md index 8a55608b..0b4f4b8c 100644 --- a/domains/health/GLP-1 receptor agonists are the largest therapeutic category launch in pharmaceutical history but their chronic use model makes the net cost impact inflationary through 2035.md +++ b/domains/health/GLP-1 receptor agonists are the largest therapeutic category launch in pharmaceutical history but their chronic use model makes the net cost impact inflationary through 2035.md @@ -103,16 +103,22 @@ Value in Health modeling study shows Medicare saves $715M over 10 years with com ### Additional Evidence (challenge) -*Source: [[2026-01-13-aon-glp1-employer-cost-savings-cancer-reduction]] | Added: 2026-03-18* +*Source: 2026-01-13-aon-glp1-employer-cost-savings-cancer-reduction | Added: 2026-03-18* Aon's temporal cost analysis shows medical costs rise 23% in year 1 but grow only 2% after 12 months (vs 6% for non-users), with diabetes patients showing 6-9 percentage point lower cost growth at 30 months. This suggests the 'inflationary through 2035' claim may only apply to short-term payers, while long-term risk-bearers see net savings. ### Additional Evidence (challenge) -*Source: [[2026-03-19-glp1-price-compression-international-generics-claim-challenge]] | Added: 2026-03-19* +*Source: 2026-03-19-glp1-price-compression-international-generics-claim-challenge | Added: 2026-03-19* International generic competition beginning January 2026 (Canada patent expiry, immediate Sandoz/Apotex/Teva filings) creates price compression trajectory faster than 'inflationary through 2035' assumes. Oral Wegovy launched at $149-299/month (5-8x reduction vs $1,300/month injectable). China/India generics projected at $40-50/month by 2030. Aon 192K patient study shows break-even timing is highly price-sensitive: at $1,300/month, multi-year retention required; at $50-150/month, Aon data suggests cost savings within 12-18 months under capitation. The 'inflationary through 2035' conclusion holds at current US pricing but becomes invalid if international generic arbitrage and oral formulation competition compress effective prices to $50-150/month range by 2030. Scope qualification needed: claim is valid conditional on pricing trajectory assumptions that are now challenged by G7 patent cliff precedent. + +### Additional Evidence (challenge) +*Source: [[2026-03-01-glp1-lifestyle-modification-efficacy-combined-approach]] | Added: 2026-03-19* + +If GLP-1 + exercise combination produces durable weight maintenance (3.5 kg regain vs 8.7 kg for medication alone), and if behavioral change persists after medication discontinuation, then the chronic use model may not be necessary for long-term value capture. This challenges the inflationary cost projection if the optimal intervention is time-limited medication + permanent behavioral change rather than lifetime pharmacotherapy. + --- Relevant Notes: diff --git a/domains/health/glp-1-persistence-drops-to-15-percent-at-two-years-for-non-diabetic-obesity-patients-undermining-chronic-use-economics.md b/domains/health/glp-1-persistence-drops-to-15-percent-at-two-years-for-non-diabetic-obesity-patients-undermining-chronic-use-economics.md index f89adfb2..7c183635 100644 --- a/domains/health/glp-1-persistence-drops-to-15-percent-at-two-years-for-non-diabetic-obesity-patients-undermining-chronic-use-economics.md +++ b/domains/health/glp-1-persistence-drops-to-15-percent-at-two-years-for-non-diabetic-obesity-patients-undermining-chronic-use-economics.md @@ -85,16 +85,22 @@ Weight regain data shows that even among patients who complete treatment, GLP-1 ### Additional Evidence (extend) -*Source: [[2026-01-13-aon-glp1-employer-cost-savings-cancer-reduction]] | Added: 2026-03-18* +*Source: 2026-01-13-aon-glp1-employer-cost-savings-cancer-reduction | Added: 2026-03-18* Aon data shows the 80%+ adherent cohort captures dramatically stronger cost reductions (9 percentage points lower for diabetes, 7 points for weight loss), confirming that adherence is the binding variable for economic viability. The adherence-dependent savings pattern means low persistence rates eliminate cost-effectiveness even when clinical benefits exist. ### Additional Evidence (extend) -*Source: [[2026-03-19-vida-ai-biology-acceleration-healthspan-constraint]] | Added: 2026-03-19* +*Source: 2026-03-19-vida-ai-biology-acceleration-healthspan-constraint | Added: 2026-03-19* GLP-1 behavioral adherence failures demonstrate that even breakthrough pharmacology cannot overcome behavioral determinants: patients on GLP-1 alone show same weight regain as placebo without behavior change. This is direct evidence that the 'human constraints' factor (Amodei framework) limits pharmaceutical efficacy independent of drug quality. + +### Additional Evidence (extend) +*Source: [[2026-03-01-glp1-lifestyle-modification-efficacy-combined-approach]] | Added: 2026-03-19* + +Weight regain data shows GLP-1 alone (8.7 kg regain) performs no better than placebo (7.6 kg) after discontinuation, while combination with exercise reduces regain to 3.5 kg. This suggests the low persistence rates may be economically rational from a patient perspective if medication alone provides no durable benefit—patients who discontinue without establishing exercise habits return to baseline regardless of medication duration. + --- Relevant Notes: diff --git a/entities/ai-alignment/anthropic.md b/entities/ai-alignment/anthropic.md index 47169b2f..f8de31b6 100644 --- a/entities/ai-alignment/anthropic.md +++ b/entities/ai-alignment/anthropic.md @@ -49,6 +49,11 @@ Frontier AI safety laboratory founded by former OpenAI VP of Research Dario Amod - **2026-03-18** — Department of War threatened to blacklist Anthropic unless it removed safeguards against mass surveillance and autonomous weapons; Anthropic refused publicly and Pentagon retaliated (reported by HKS Carr-Ryan Center) - **2026-03** — Department of War threatened to blacklist Anthropic unless it removed safeguards against mass surveillance and autonomous weapons; Anthropic refused publicly and Pentagon retaliated (HKS Carr-Ryan Center report) +- **2026-02** — Abandoned binding RSP (Responsible Scaling Policy) +- **2026-03** — Reached $380B valuation, ~$19B annualized revenue (10x YoY sustained 3 years) +- **2026-03** — Claude Code achieved 54% enterprise coding market share, $2.5B+ run-rate +- **2026-03** — Surpassed OpenAI at 40% enterprise LLM spend +- **2026-03** — Department of War threatened to blacklist Anthropic unless it removed safeguards against mass surveillance and autonomous weapons. Anthropic refused publicly and faced Pentagon retaliation. ## Competitive Position Strongest position in enterprise AI and coding. Revenue growth (10x YoY) outpaces all competitors. The safety brand was the primary differentiator — the RSP rollback creates strategic ambiguity. CEO publicly uncomfortable with power concentration while racing to concentrate it. diff --git a/entities/ai-alignment/openai.md b/entities/ai-alignment/openai.md index 72063ffa..4bff74f8 100644 --- a/entities/ai-alignment/openai.md +++ b/entities/ai-alignment/openai.md @@ -45,6 +45,12 @@ The largest and most-valued AI laboratory. OpenAI pioneered the transformer-base - **2026-02** — Raised $110B at $840B valuation, restructured to PBC - **2026** — IPO preparation underway +- **2025-2026** — John Schulman departed for Thinking Machines Lab +- **2026-03** — Reached $840B valuation, ~$25B annualized revenue +- **2026-03** — 68% consumer market share, 27% enterprise LLM spend +- **2026-03** — Released GPT-5/5.2/5.3 +- **2026-03** — Restructured to Public Benefit Corporation +- **2026-03** — IPO expected H2 2026-2027 ## Competitive Position Highest valuation and strongest consumer brand, but losing enterprise share to Anthropic. The Microsoft partnership (exclusive API hosting) provides distribution but also dependency. Key vulnerability: the enterprise coding market — where Anthropic's Claude Code dominates — may prove more valuable than consumer chat. diff --git a/entities/ai-alignment/uk-aisi.md b/entities/ai-alignment/uk-aisi.md index c56907b2..79341bfb 100644 --- a/entities/ai-alignment/uk-aisi.md +++ b/entities/ai-alignment/uk-aisi.md @@ -33,6 +33,14 @@ The first government-established AI safety evaluation body, created after the Bl - **2024-12** — Joint pre-deployment evaluation of OpenAI o1 with US AISI - **2025-02** — Rebranded to "AI Security Institute" +- **2026-03-16** — Published cyber capability testing results on 7 LLMs using custom-built cyber ranges +- **2026-00-00** — Renamed from 'AI Safety Institute' to 'AI Security Institute' +- **2026-02-25** — Released Inspect Scout transcript analysis tool +- **2026-02-17** — Published universal jailbreak assessment against best-defended systems +- **2025-10-22** — Released ControlArena library for AI control experiments +- **2025-07-00** — Conducted international joint testing exercise on agentic systems +- **2025-05-00** — Released HiBayES statistical modeling framework +- **2024-04-00** — Released open-source Inspect evaluation framework ## Alignment Significance The UK AISI is the strongest evidence that institutional infrastructure CAN be created from international coordination — but also the strongest evidence that institutional infrastructure without enforcement authority has limited impact. Labs grant access voluntarily. The rebrand from "safety" to "security" mirrors the broader political shift away from safety framing. diff --git a/entities/entertainment/dropout.md b/entities/entertainment/dropout.md index b255a362..6d9f0b88 100644 --- a/entities/entertainment/dropout.md +++ b/entities/entertainment/dropout.md @@ -26,6 +26,7 @@ Creator-owned streaming platform focused on comedy content. Reached 1M+ subscrib - **2025-10-01** — Crossed 1 million subscribers (31% YoY growth). Launched $129.99/year superfan tier in response to fan requests to support platform at higher price point. - **2025-10-01** — Crossed 1 million subscribers (31% YoY growth). Launched $129.99/year superfan tier in response to fan requests for higher-priced support option. Dimension 20 MSG live show sold out (January 2025). Brennan Lee Mulligan signed 3-year deal while simultaneously participating in Critical Role Campaign 4. - **2025-10-01** — Crossed 1 million subscribers with 31% YoY growth; launched $129.99/year superfan tier in response to fan requests to support platform +- **2025-10-01** — Crossed 1 million subscribers (31% YoY growth); launched $129.99/year superfan tier originated by fan request ## Relationship to KB - [[creator-owned-streaming-infrastructure-has-reached-commercial-scale-with-430M-annual-creator-revenue-across-13M-subscribers]] diff --git a/entities/internet-finance/p2p-me.md b/entities/internet-finance/p2p-me.md index c6b360a2..8540527e 100644 --- a/entities/internet-finance/p2p-me.md +++ b/entities/internet-finance/p2p-me.md @@ -53,3 +53,5 @@ Treasury controlled by token holders through futarchy-based governance. Team can - **February 2026** — Peak monthly volume of $3.95M - **March 15, 2026** — Pine Analytics publishes pre-ICO analysis identifying 182x gross profit multiple concern - **March 26, 2026** — ICO scheduled on MetaDAO + +- **2026-03-26** — [[p2p-me-metadao-ico]] Active: ICO scheduled, targeting $6M raise at $15.5M FDV with Pine Analytics identifying 182x gross profit multiple concerns \ No newline at end of file diff --git a/inbox/queue/2026-03-18-moonvillage-he3-power-mobility-dilemma.md b/inbox/archive/space-development/2026-03-18-moonvillage-he3-power-mobility-dilemma.md similarity index 100% rename from inbox/queue/2026-03-18-moonvillage-he3-power-mobility-dilemma.md rename to inbox/archive/space-development/2026-03-18-moonvillage-he3-power-mobility-dilemma.md diff --git a/inbox/queue/.extraction-debug/2026-02-28-demoura-when-ai-writes-software.json b/inbox/queue/.extraction-debug/2026-02-28-demoura-when-ai-writes-software.json new file mode 100644 index 00000000..b580e3f3 --- /dev/null +++ b/inbox/queue/.extraction-debug/2026-02-28-demoura-when-ai-writes-software.json @@ -0,0 +1,35 @@ +{ + "rejected_claims": [ + { + "filename": "ai-generated-code-overfits-test-suites-creating-adversarial-brittleness-that-testing-cannot-detect.md", + "issues": [ + "missing_attribution_extractor" + ] + }, + { + "filename": "supply-chain-poisoning-via-ai-training-data-enables-systematic-vulnerability-injection-at-ecosystem-scale.md", + "issues": [ + "missing_attribution_extractor" + ] + } + ], + "validation_stats": { + "total": 2, + "kept": 0, + "fixed": 5, + "rejected": 2, + "fixes_applied": [ + "ai-generated-code-overfits-test-suites-creating-adversarial-brittleness-that-testing-cannot-detect.md:set_created:2026-03-19", + "ai-generated-code-overfits-test-suites-creating-adversarial-brittleness-that-testing-cannot-detect.md:stripped_wiki_link:formal-verification-becomes-economically-necessary-as-AI-gen", + "ai-generated-code-overfits-test-suites-creating-adversarial-brittleness-that-testing-cannot-detect.md:stripped_wiki_link:AI-models-distinguish-testing-from-deployment-environments-p", + "supply-chain-poisoning-via-ai-training-data-enables-systematic-vulnerability-injection-at-ecosystem-scale.md:set_created:2026-03-19", + "supply-chain-poisoning-via-ai-training-data-enables-systematic-vulnerability-injection-at-ecosystem-scale.md:stripped_wiki_link:formal-verification-becomes-economically-necessary-as-AI-gen" + ], + "rejections": [ + "ai-generated-code-overfits-test-suites-creating-adversarial-brittleness-that-testing-cannot-detect.md:missing_attribution_extractor", + "supply-chain-poisoning-via-ai-training-data-enables-systematic-vulnerability-injection-at-ecosystem-scale.md:missing_attribution_extractor" + ] + }, + "model": "anthropic/claude-sonnet-4.5", + "date": "2026-03-19" +} \ No newline at end of file diff --git a/inbox/queue/.extraction-debug/2026-03-01-glp1-lifestyle-modification-efficacy-combined-approach.json b/inbox/queue/.extraction-debug/2026-03-01-glp1-lifestyle-modification-efficacy-combined-approach.json index 20ebfd0d..b2904bbd 100644 --- a/inbox/queue/.extraction-debug/2026-03-01-glp1-lifestyle-modification-efficacy-combined-approach.json +++ b/inbox/queue/.extraction-debug/2026-03-01-glp1-lifestyle-modification-efficacy-combined-approach.json @@ -10,16 +10,17 @@ "validation_stats": { "total": 1, "kept": 0, - "fixed": 2, + "fixed": 3, "rejected": 1, "fixes_applied": [ - "glp-1-combined-with-structured-exercise-achieves-60-percent-better-weight-maintenance-than-medication-alone-after-discontinuation.md:set_created:2026-03-18", - "glp-1-combined-with-structured-exercise-achieves-60-percent-better-weight-maintenance-than-medication-alone-after-discontinuation.md:stripped_wiki_link:glp-1-persistence-drops-to-15-percent-at-two-years-for-non-d" + "glp-1-combined-with-structured-exercise-achieves-60-percent-better-weight-maintenance-than-medication-alone-after-discontinuation.md:set_created:2026-03-19", + "glp-1-combined-with-structured-exercise-achieves-60-percent-better-weight-maintenance-than-medication-alone-after-discontinuation.md:stripped_wiki_link:glp-1-persistence-drops-to-15-percent-at-two-years-for-non-d", + "glp-1-combined-with-structured-exercise-achieves-60-percent-better-weight-maintenance-than-medication-alone-after-discontinuation.md:stripped_wiki_link:GLP-1 receptor agonists are the largest therapeutic category" ], "rejections": [ "glp-1-combined-with-structured-exercise-achieves-60-percent-better-weight-maintenance-than-medication-alone-after-discontinuation.md:missing_attribution_extractor" ] }, "model": "anthropic/claude-sonnet-4.5", - "date": "2026-03-18" + "date": "2026-03-19" } \ No newline at end of file diff --git a/inbox/queue/.extraction-debug/2026-03-16-theseus-ai-coordination-governance-evidence.json b/inbox/queue/.extraction-debug/2026-03-16-theseus-ai-coordination-governance-evidence.json new file mode 100644 index 00000000..cf203310 --- /dev/null +++ b/inbox/queue/.extraction-debug/2026-03-16-theseus-ai-coordination-governance-evidence.json @@ -0,0 +1,46 @@ +{ + "rejected_claims": [ + { + "filename": "binding-regulation-with-enforcement-is-the-only-ai-governance-mechanism-that-changes-frontier-lab-behavior.md", + "issues": [ + "missing_attribution_extractor" + ] + }, + { + "filename": "compute-governance-through-export-controls-works-but-targets-geopolitics-not-safety-leaving-capability-race-unconstrained.md", + "issues": [ + "missing_attribution_extractor" + ] + }, + { + "filename": "third-party-ai-evaluation-ecosystem-is-fragile-without-regulatory-mandate-because-voluntary-participation-and-funding-instability-threaten-continuity.md", + "issues": [ + "missing_attribution_extractor" + ] + } + ], + "validation_stats": { + "total": 3, + "kept": 0, + "fixed": 9, + "rejected": 3, + "fixes_applied": [ + "binding-regulation-with-enforcement-is-the-only-ai-governance-mechanism-that-changes-frontier-lab-behavior.md:set_created:2026-03-19", + "binding-regulation-with-enforcement-is-the-only-ai-governance-mechanism-that-changes-frontier-lab-behavior.md:stripped_wiki_link:only binding regulation with enforcement teeth changes front", + "binding-regulation-with-enforcement-is-the-only-ai-governance-mechanism-that-changes-frontier-lab-behavior.md:stripped_wiki_link:voluntary safety commitments collapse under competitive pres", + "binding-regulation-with-enforcement-is-the-only-ai-governance-mechanism-that-changes-frontier-lab-behavior.md:stripped_wiki_link:Anthropics RSP rollback under commercial pressure is the fir", + "compute-governance-through-export-controls-works-but-targets-geopolitics-not-safety-leaving-capability-race-unconstrained.md:set_created:2026-03-19", + "compute-governance-through-export-controls-works-but-targets-geopolitics-not-safety-leaving-capability-race-unconstrained.md:stripped_wiki_link:compute export controls are the most impactful AI governance", + "compute-governance-through-export-controls-works-but-targets-geopolitics-not-safety-leaving-capability-race-unconstrained.md:stripped_wiki_link:nation-states will inevitably assert control over frontier A", + "third-party-ai-evaluation-ecosystem-is-fragile-without-regulatory-mandate-because-voluntary-participation-and-funding-instability-threaten-continuity.md:set_created:2026-03-19", + "third-party-ai-evaluation-ecosystem-is-fragile-without-regulatory-mandate-because-voluntary-participation-and-funding-instability-threaten-continuity.md:stripped_wiki_link:pre-deployment-AI-evaluations-do-not-predict-real-world-risk" + ], + "rejections": [ + "binding-regulation-with-enforcement-is-the-only-ai-governance-mechanism-that-changes-frontier-lab-behavior.md:missing_attribution_extractor", + "compute-governance-through-export-controls-works-but-targets-geopolitics-not-safety-leaving-capability-race-unconstrained.md:missing_attribution_extractor", + "third-party-ai-evaluation-ecosystem-is-fragile-without-regulatory-mandate-because-voluntary-participation-and-funding-instability-threaten-continuity.md:missing_attribution_extractor" + ] + }, + "model": "anthropic/claude-sonnet-4.5", + "date": "2026-03-19" +} \ No newline at end of file diff --git a/inbox/queue/.extraction-debug/2026-03-16-theseus-ai-industry-landscape-briefing.json b/inbox/queue/.extraction-debug/2026-03-16-theseus-ai-industry-landscape-briefing.json new file mode 100644 index 00000000..afc25a75 --- /dev/null +++ b/inbox/queue/.extraction-debug/2026-03-16-theseus-ai-industry-landscape-briefing.json @@ -0,0 +1,40 @@ +{ + "rejected_claims": [ + { + "filename": "enterprise-coding-agents-emerged-as-first-killer-app-category-for-frontier-ai-because-verifiable-output-and-immediate-roi-overcome-adoption-friction.md", + "issues": [ + "missing_attribution_extractor", + "opsec_internal_deal_terms" + ] + }, + { + "filename": "frontier-ai-lab-talent-circulation-accelerated-dramatically-in-2025-2026-with-11-plus-google-executives-to-microsoft-and-multiple-openai-departures-indicating-competitive-pressure-on-retention.md", + "issues": [ + "missing_attribution_extractor", + "opsec_internal_deal_terms" + ] + } + ], + "validation_stats": { + "total": 2, + "kept": 0, + "fixed": 6, + "rejected": 2, + "fixes_applied": [ + "enterprise-coding-agents-emerged-as-first-killer-app-category-for-frontier-ai-because-verifiable-output-and-immediate-roi-overcome-adoption-friction.md:set_created:2026-03-19", + "enterprise-coding-agents-emerged-as-first-killer-app-category-for-frontier-ai-because-verifiable-output-and-immediate-roi-overcome-adoption-friction.md:stripped_wiki_link:coding-agents-crossed-usability-threshold-december-2025-when", + "enterprise-coding-agents-emerged-as-first-killer-app-category-for-frontier-ai-because-verifiable-output-and-immediate-roi-overcome-adoption-friction.md:stripped_wiki_link:the-gap-between-theoretical-AI-capability-and-observed-deplo", + "frontier-ai-lab-talent-circulation-accelerated-dramatically-in-2025-2026-with-11-plus-google-executives-to-microsoft-and-multiple-openai-departures-indicating-competitive-pressure-on-retention.md:set_created:2026-03-19", + "frontier-ai-lab-talent-circulation-accelerated-dramatically-in-2025-2026-with-11-plus-google-executives-to-microsoft-and-multiple-openai-departures-indicating-competitive-pressure-on-retention.md:stripped_wiki_link:AI-talent-circulation-between-frontier-labs-transfers-alignm", + "frontier-ai-lab-talent-circulation-accelerated-dramatically-in-2025-2026-with-11-plus-google-executives-to-microsoft-and-multiple-openai-departures-indicating-competitive-pressure-on-retention.md:stripped_wiki_link:Anthropics-RSP-rollback-under-commercial-pressure-is-the-fir" + ], + "rejections": [ + "enterprise-coding-agents-emerged-as-first-killer-app-category-for-frontier-ai-because-verifiable-output-and-immediate-roi-overcome-adoption-friction.md:missing_attribution_extractor", + "enterprise-coding-agents-emerged-as-first-killer-app-category-for-frontier-ai-because-verifiable-output-and-immediate-roi-overcome-adoption-friction.md:opsec_internal_deal_terms", + "frontier-ai-lab-talent-circulation-accelerated-dramatically-in-2025-2026-with-11-plus-google-executives-to-microsoft-and-multiple-openai-departures-indicating-competitive-pressure-on-retention.md:missing_attribution_extractor", + "frontier-ai-lab-talent-circulation-accelerated-dramatically-in-2025-2026-with-11-plus-google-executives-to-microsoft-and-multiple-openai-departures-indicating-competitive-pressure-on-retention.md:opsec_internal_deal_terms" + ] + }, + "model": "anthropic/claude-sonnet-4.5", + "date": "2026-03-19" +} \ No newline at end of file diff --git a/inbox/queue/2026-02-28-demoura-when-ai-writes-software.md b/inbox/queue/2026-02-28-demoura-when-ai-writes-software.md index 0070a125..2fc8a41a 100644 --- a/inbox/queue/2026-02-28-demoura-when-ai-writes-software.md +++ b/inbox/queue/2026-02-28-demoura-when-ai-writes-software.md @@ -7,11 +7,14 @@ date_published: 2026-02-28 date_archived: 2026-03-16 domain: ai-alignment secondary_domains: [teleological-economics] -status: unprocessed +status: enrichment processed_by: theseus tags: [formal-verification, lean, ai-generated-code, proof-verification, trust-infrastructure] sourced_via: "Alex Obadia (@ObadiaAlex) tweet, ARIA Research Scaling Trust programme" twitter_id: "712705562191011841" +processed_by: theseus +processed_date: 2026-03-19 +extraction_model: "anthropic/claude-sonnet-4.5" --- # When AI Writes the World's Software, Who Verifies It? @@ -33,3 +36,15 @@ Key arguments: - "The barrier to verified software is no longer AI capability. It is platform readiness." Directly relevant to [[formal verification of AI-generated proofs provides scalable oversight that human review cannot match because machine-checked correctness scales with AI capability while human verification degrades]]. + + +## Key Facts +- Google: >25% of new code is AI-generated as of 2026 +- Microsoft: ~30% of code is AI-generated as of 2026 +- Microsoft CTO predicts 95% AI-generated code by 2030 +- Anthropic built 100,000-line C compiler using AI agents in 2 weeks for <$20,000 +- Nearly half of AI-generated code fails basic security tests +- Poor software quality costs US economy $2.41T/year (CSIQ 2022) +- Lean has 200K+ formalized theorems +- 5 Fields medalists have adopted Lean +- AlphaProof uses Lean as verification platform diff --git a/inbox/queue/2026-03-01-glp1-lifestyle-modification-efficacy-combined-approach.md b/inbox/queue/2026-03-01-glp1-lifestyle-modification-efficacy-combined-approach.md index d9916cbb..b0e625d2 100644 --- a/inbox/queue/2026-03-01-glp1-lifestyle-modification-efficacy-combined-approach.md +++ b/inbox/queue/2026-03-01-glp1-lifestyle-modification-efficacy-combined-approach.md @@ -7,13 +7,17 @@ date: 2026-03-01 domain: health secondary_domains: [] format: review -status: unprocessed +status: enrichment priority: high tags: [glp-1, lifestyle-modification, exercise, sarcopenia, muscle-preservation, adherence, weight-regain, obesity] processed_by: vida processed_date: 2026-03-18 enrichments_applied: ["glp-1-persistence-drops-to-15-percent-at-two-years-for-non-diabetic-obesity-patients-undermining-chronic-use-economics.md", "GLP-1 receptor agonists are the largest therapeutic category launch in pharmaceutical history but their chronic use model makes the net cost impact inflationary through 2035.md"] extraction_model: "anthropic/claude-sonnet-4.5" +processed_by: vida +processed_date: 2026-03-19 +enrichments_applied: ["glp-1-persistence-drops-to-15-percent-at-two-years-for-non-diabetic-obesity-patients-undermining-chronic-use-economics.md", "GLP-1 receptor agonists are the largest therapeutic category launch in pharmaceutical history but their chronic use model makes the net cost impact inflationary through 2035.md"] +extraction_model: "anthropic/claude-sonnet-4.5" --- ## Content @@ -88,3 +92,13 @@ EXTRACTION HINT: Focus on the GLP-1 alone vs. GLP-1+exercise regain comparison - Up to 50% of adults over 80 experience sarcopenia; aging reduces muscle mass 12-16% independent of weight loss interventions - Tirzepatide may have better muscle preservation profile than semaglutide (preliminary data, not FDA-approved for this indication) - BALANCE model includes lifestyle support component but specific exercise programming details not specified in source + + +## Key Facts +- WHO December 2025 guidelines specifically recommend GLP-1 therapies 'combined with intensive behavioral therapy to maximize and sustain benefits' +- Meta-analysis of 22 RCTs with 2,258 participants found approximately 25% of GLP-1 weight loss is lean mass +- Without exercise, 15-40% of GLP-1 weight loss is lean mass; with resistance training, lean mass loss is substantially reduced +- Up to 50% of adults over 80 experience sarcopenia; aging reduces muscle mass 12-16% independent of weight loss interventions +- At week 52 all intervention groups regained weight after stopping; by week 104: placebo +7.6 kg, liraglutide only +8.7 kg, exercise only +5.4 kg, combination +3.5 kg +- Tirzepatide may have better muscle preservation profile than semaglutide (preliminary data, not FDA-approved for this indication) +- ADA notes new therapies claiming 'enhanced quality of weight loss by improving muscle preservation' but no FDA-approved compounds with proven muscle preservation yet diff --git a/inbox/queue/2026-03-16-theseus-ai-coordination-governance-evidence.md b/inbox/queue/2026-03-16-theseus-ai-coordination-governance-evidence.md index a6f19e7f..d684c85e 100644 --- a/inbox/queue/2026-03-16-theseus-ai-coordination-governance-evidence.md +++ b/inbox/queue/2026-03-16-theseus-ai-coordination-governance-evidence.md @@ -6,10 +6,14 @@ url: null date_published: 2026-03-16 date_archived: 2026-03-16 domain: ai-alignment -status: unprocessed +status: enrichment processed_by: theseus tags: [ai-governance, coordination, safety-commitments, regulation, enforcement, voluntary-pledges] sourced_via: "Theseus research agent — 45 web searches synthesized from Brookings, Stanford FMTI, EU legislation, OECD, government publications, TechCrunch, TIME, CNN, Fortune, academic papers" +processed_by: theseus +processed_date: 2026-03-19 +enrichments_applied: ["AI transparency is declining not improving because Stanford FMTI scores dropped 17 points in one year while frontier labs dissolved safety teams and removed safety language from mission statements.md", "Anthropics RSP rollback under commercial pressure is the first empirical confirmation that binding safety commitments cannot survive the competitive dynamics of frontier AI development.md", "voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md", "compute export controls are the most impactful AI governance mechanism but target geopolitical competition not safety leaving capability development unconstrained.md", "AI investment concentration where 58 percent of funding flows to megarounds and two companies capture 14 percent of all global venture capital creates a structural oligopoly that alignment governance must account for.md"] +extraction_model: "anthropic/claude-sonnet-4.5" --- # Empirical Evidence: AI Coordination and Governance Mechanisms That Changed Behavior @@ -51,3 +55,19 @@ Core finding: almost no international AI governance mechanism has produced verif - Insurance/liability: market projected $29.7B by 2033. Creates market incentives aligned with safety. - Third-party auditing: METR, Apollo Research. Apollo warns ecosystem unsustainable without regulatory mandate. - Futarchy: implemented for DAO governance (MetaDAO, Optimism experiment) but not yet for AI governance. + + +## Key Facts +- EU AI Act: Apple paused Apple Intelligence in EU, Meta changed ads, EUR 500M+ fines under DMA +- China implemented mandatory algorithm filing with criminal enforcement (August 2023) +- US export controls: tiered country system, deployment caps, Nvidia compliance chips (H800, A800) +- Stanford FMTI transparency scores: -17 points mean (2024→2025), Meta -29, Mistral -37, OpenAI -14 +- OpenAI removed 'safely' from mission statement (November 2025) +- OpenAI dissolved Superalignment team (May 2024) and Mission Alignment team (February 2026) +- Google accused by 60 UK lawmakers of violating Seoul commitments (Gemini 2.5 Pro, April 2025) +- 450+ organizations lobbied on AI in 2025 (up from 6 in 2016), $92M in lobbying fees Q1-Q3 2025 +- California SB 1047 vetoed after industry lobbying +- Watermarking: 38% implementation rate across frontier labs +- US AISI defunded/rebranded after initial establishment +- UK-US joint evaluation of OpenAI o1 model conducted +- Insurance/liability market projected $29.7B by 2033 diff --git a/inbox/queue/2026-03-16-theseus-ai-industry-landscape-briefing.md b/inbox/queue/2026-03-16-theseus-ai-industry-landscape-briefing.md index 4a58f571..b68a8b5e 100644 --- a/inbox/queue/2026-03-16-theseus-ai-industry-landscape-briefing.md +++ b/inbox/queue/2026-03-16-theseus-ai-industry-landscape-briefing.md @@ -7,10 +7,13 @@ date_published: 2026-03-16 date_archived: 2026-03-16 domain: ai-alignment secondary_domains: [internet-finance] -status: unprocessed +status: enrichment processed_by: theseus tags: [industry-landscape, ai-labs, funding, competitive-dynamics, startups, investors] sourced_via: "Theseus research agent — 33 web searches synthesized from MIT Tech Review, TechCrunch, Crunchbase, OECD, company announcements, CNBC, Fortune, etc." +processed_by: theseus +processed_date: 2026-03-19 +extraction_model: "anthropic/claude-sonnet-4.5" --- # AI Industry Landscape Briefing — March 2026 @@ -54,3 +57,19 @@ Multi-source synthesis of the current AI industry state. Key data points: - Daniel Gross → left SSI for Meta superintelligence team - John Schulman → left OpenAI for Thinking Machines Lab - 11+ Google executives → Microsoft in 2025 + + +## Key Facts +- xAI reached ~$230B valuation with Grok 4/4.1 leading LMArena, 1M+ H100 GPUs, $20B Series E Jan 2026 +- Mistral reached $13.8B valuation, EUR 300M ARR targeting EUR 1B, building European sovereign compute +- Google DeepMind released Gemini 3/3.1 family, 21% enterprise LLM spend, $175-185B capex 2026, Deep Think achieved gold-medal Olympiad results +- Sierra (Bret Taylor) reached $10B+ valuation in agentic customer service +- Databricks reached $134B valuation, $5B Series L, filed for IPO Q2 2026 +- 2025 total AI VC: $259-270B (52-61% of all global VC) +- Feb 2026 AI funding: $189B (largest single month ever) +- 75-79% of AI funding to US companies +- Inference cost deflation ~10x/year +- Chinese open-source (Qwen, DeepSeek) capturing 50-60% of new open-model adoption +- 95% of enterprise AI pilots fail to deliver ROI (MIT Project NANDA) +- Big 5 AI capex: $660-690B planned 2026 +- US deregulating AI, EU softening regulations