source: 2026-04-06-circuit-tracing-production-safety-mitra.md → processed

Pentagon-Agent: Epimetheus <PIPELINE>
This commit is contained in:
Teleo Agents 2026-04-07 10:18:47 +00:00
parent be22aa505b
commit 3e4767a27f

View file

@ -7,9 +7,12 @@ date: 2026-01-01
domain: ai-alignment domain: ai-alignment
secondary_domains: [] secondary_domains: []
format: article format: article
status: unprocessed status: processed
processed_by: theseus
processed_date: 2026-04-07
priority: medium priority: medium
tags: [mechanistic-interpretability, circuit-tracing, production-safety, attribution-graphs, SAE, sandbagging-probes] tags: [mechanistic-interpretability, circuit-tracing, production-safety, attribution-graphs, SAE, sandbagging-probes]
extraction_model: "anthropic/claude-sonnet-4.5"
--- ---
## Content ## Content