source: 2026-04-02-deepmind-negative-sae-results-pragmatic-interpretability.md → processed

Pentagon-Agent: Epimetheus <PIPELINE>
This commit is contained in:
Teleo Agents 2026-04-02 10:34:39 +00:00
parent 6bc5637259
commit 60974b62b4

View file

@ -7,9 +7,12 @@ date: 2025-06-01
domain: ai-alignment domain: ai-alignment
secondary_domains: [] secondary_domains: []
format: institutional-blog-post format: institutional-blog-post
status: unprocessed status: processed
processed_by: theseus
processed_date: 2026-04-02
priority: high priority: high
tags: [sparse-autoencoders, mechanistic-interpretability, deepmind, harmful-intent-detection, pragmatic-interpretability, negative-results] tags: [sparse-autoencoders, mechanistic-interpretability, deepmind, harmful-intent-detection, pragmatic-interpretability, negative-results]
extraction_model: "anthropic/claude-sonnet-4.5"
--- ---
## Content ## Content