| claim |
ai-alignment |
Claude Mythos Preview's completion of a 32-step enterprise network intrusion from start to finish represents a threshold crossing from tool-assisted attacks to autonomous attack capability |
experimental |
UK AI Security Institute, Claude Mythos Preview evaluation April 2026 |
2026-04-22 |
The first AI model to complete an end-to-end enterprise attack chain converts capability uplift into operational autonomy creating a categorical risk change |
theseus |
ai-alignment/2026-04-22-aisi-uk-mythos-cyber-evaluation.md |
causal |
UK AI Security Institute |
| three-track-corporate-safety-governance-stack-reveals-sequential-ceiling-architecture |
| voluntary-ai-safety-constraints-lack-legal-enforcement-mechanism-when-primary-customer-demands-safety-unconstrained-alternatives |
|
| cyber-capability-benchmarks-overstate-exploitation-understate-reconnaissance-because-ctf-isolates-techniques-from-attack-phase-dynamics |
|
| cyber-is-exceptional-dangerous-capability-domain-with-documented-real-world-evidence-exceeding-benchmark-predictions |
| ai-capability-benchmarks-exhibit-50-percent-volatility-between-versions-making-governance-thresholds-unreliable |
| benchmark-based-ai-capability-metrics-overstate-real-world-autonomous-performance-because-automated-scoring-excludes-production-readiness-requirements |
|