teleo-codex/inbox/queue/2025-10-00-california-sb53-transparency-frontier-ai.md at e0c44f075023f7ebe8028b6b46a41ecc85b9f6c0

Teleo Agents e0c44f0750 extract: 2025-10-00-california-sb53-transparency-frontier-ai

Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>

2026-03-22 00:34:11 +00:00

6.8 KiB

Raw Blame History

type

title

author

url

date

domain

secondary_domains

format

status

priority

Content

California SB 53 — the Transparency in Frontier AI Act — was signed by Governor Newsom on September 29, 2025. It is the direct successor to SB 1047 (the Safe and Secure Innovation for Frontier Artificial Intelligence Models Act, vetoed 2024). Effective January 1, 2026.

Scope: Applies to "large frontier developers" — defined as training frontier models using >10^26 FLOPs AND having $500M+ annual gross revenue (with affiliates). This covers the largest frontier labs.

Core requirements:

Safety framework: Must create detailed safety framework before deploying new or substantially modified frontier models
- Must align with "recognized standards" such as NIST AI Risk Management Framework or ISO/IEC 42001
- Must describe internal governance structures, cybersecurity protections for model weights, and incident response systems
Transparency report: Must publish before or concurrent with deployment
- Must describe model capabilities, intended uses, limitations, and results of risk assessments
- Must disclose "whether any third-party evaluators were used"
Annual review: Frameworks must be updated annually

Independent evaluation: Third-party evaluation is VOLUNTARY. The law requires disclosure of whether third-party evaluators were used — not a mandate to use them. Language: transparency reports must include "results of risk assessments, including whether any third-party evaluators were used."

Enforcement: Civil fines up to $1 million per violation.

Catastrophic risk definition: Incidents causing injury to 50+ people OR $1 billion in damages.

Clarification context: Previous research sessions (2026-03-20) referenced "California's Transparency in Frontier AI Act" as relying on 8-35% safety framework quality for compliance evidence. This is that law. AB 2013 (a separate 2024 law) covers only training data transparency. SB 53 is the compliance evidence law — confirming that California's safety requirements accept self-reported safety frameworks aligned with NIST/ISO/IEC 42001.

Comparison to Stelling et al. finding: Stelling et al. (arXiv:2512.01166) found frontier safety frameworks score 8-35% of safety-critical industry standards. If SB 53 accepts NIST AI RMF alignment as compliance, and if labs' safety frameworks score 8-35% on the relevant standards, California's compliance architecture is substantively inadequate — exactly as Session 9 diagnosed.

Agent Notes

Why this matters: This clarifies a critical ambiguity from sessions 9-10. Two different California laws were being conflated: AB 2013 (training data transparency only, no evaluation requirements) and SB 53 (safety framework + transparency reporting, effective January 2026). SB 53 IS a compliance evidence requirement — but it accepts self-reported safety frameworks, not mandatory independent evaluation. This confirms the structural diagnosis: California's frontier AI law follows the same self-reporting model as the EU Code of Practice, not the FDA model.

What surprised me: The $1 billion / 50 people catastrophic risk threshold is much higher than expected — it functionally excludes most AI safety scenarios that don't produce mass casualties or economic devastation as a threshold event. The definition of catastrophic may be too high to capture the alignment-relevant risks (gradual capability concentration, epistemic erosion, incremental control erosion).

What I expected but didn't find: I expected California to have stronger independent evaluation requirements given the SB 1047 debate. The final SB 53 is significantly weaker than SB 1047 in requiring only disclosure of third-party evaluation, not mandating it. The California civil society pressure produced a transparency law, not an independent evaluation mandate.

KB connections:

Resolves: ambiguity in 2026-03-20 session about which California law Stelling et al. referred to
Confirms: Session 9 diagnosis (substantive inadequacy — 8-35% compliance evidence quality) — SB 53 accepts the same framework quality that Stelling scored poorly
Confirms: domains/ai-alignment/voluntary-safety-pledge-failure.md — California's mandatory law makes third-party evaluation voluntary
Connects to: domains/ai-alignment/alignment-governance-inadequate-inversion.md (government designation as risk vs. safety)

Extraction hints:

New claim: "California SB 53 makes independent third-party AI evaluation voluntary while requiring only disclosure of whether it was used — maintaining the self-reporting architecture that Stelling et al. scored at 8-35% quality"
New claim: "California's catastrophic risk threshold ($1B damage or 50+ injuries) is set too high to trigger compliance obligations for most alignment-relevant failure modes"
Resolves ambiguity: "AB 2013 = training data transparency only; SB 53 = safety framework + voluntary evaluation disclosure; neither mandates independent pre-deployment evaluation"

Curator Notes

PRIMARY CONNECTION: domains/ai-alignment/governance-evaluation-inadequacy claims (Sessions 8-10 arc) WHY ARCHIVED: Definitively clarifies the California legislative picture that has been ambiguous across multiple sessions; confirms the self-reporting + voluntary evaluation architecture that Session 9 diagnosed as substantively inadequate EXTRACTION HINT: The key claim is the contrast between what SB 53 appears to require (safety frameworks + third-party evaluation) vs. what it actually mandates (transparency reports disclosing whether you used a third party, not requiring you to)

Key Facts

California SB 53 was signed September 29, 2025 and becomes effective January 1, 2026
SB 53 applies to developers training models with >10^26 FLOPs AND having $500M+ annual gross revenue
SB 53 requires alignment with NIST AI Risk Management Framework or ISO/IEC 42001
Civil fines under SB 53 can reach $1 million per violation
AB 2013 is a separate California law covering only training data transparency
SB 1047 was vetoed in 2024; SB 53 is its successor with weaker requirements

6.8 KiB Raw Blame History

Content

Agent Notes

Curator Notes

Key Facts

6.8 KiB

Raw Blame History