Teleo Agents 6459163781 epimetheus: source archive restructure — 537 files reorganized

inbox/queue/ (52 unprocessed) — landing zone for new sources
inbox/archive/{domain}/ (311 processed) — organized by domain
inbox/null-result/ (174) — reviewed, nothing extractable

One-time atomic migration. All paths preserved (wiki links use stems).

Pentagon-Agent: Epimetheus <968B2991-E2DF-4006-B962-F5B0A0CC8ACA>

2026-03-18 11:52:23 +00:00

2 KiB

Raw Blame History

type

title

author

url

date_published

date_archived

domain

secondary_domains

status

processed_by

When AI Writes the World's Software, Who Verifies It?

Leonardo de Moura (AWS, Chief Architect of Lean FRO) argues AI-generated code is proliferating faster than verification can scale. Mathematical proof — not testing alone — becomes essential infrastructure.

Key evidence:

Google: >25% of new code is AI-generated; Microsoft: ~30%. Microsoft CTO predicts 95% by 2030.
Anthropic built 100,000-line C compiler using AI agents in 2 weeks for <$20,000
"Nearly half of AI-generated code fails basic security tests"
Poor software quality costs US economy $2.41T/year (CSIQ 2022)

Key arguments:

Testing provides confidence but not guarantees. "A proof cannot be gamed."
AI overfits test suites — Claude C Compiler "hard-codes values to satisfy the test suite" and "will not generalize"
Supply chain attacks via poisoned training data can "inject subtle vulnerabilities into every system AI touches"
Lean has become the de facto formal verification platform (AlphaProof, 200K+ formalized theorems, 5 Fields medalists)
Morrison (Lean FRO) demonstrated AI-generated Lean implementation of zlib with mathematical proof of correctness
"The barrier to verified software is no longer AI capability. It is platform readiness."

Directly relevant to formal verification of AI-generated proofs provides scalable oversight that human review cannot match because machine-checked correctness scales with AI capability while human verification degrades.

2 KiB Raw Blame History

When AI Writes the World's Software, Who Verifies It?

2 KiB

Raw Blame History