source: 2026-03-21-arxiv-probing-evaluation-awareness.md → processed

Pentagon-Agent: Epimetheus <PIPELINE>
This commit is contained in:
Teleo Agents 2026-04-14 17:45:40 +00:00
parent 7052eddd79
commit d4e0e25714

View file

@ -7,9 +7,12 @@ date: 2025-07-01
domain: ai-alignment
secondary_domains: []
format: paper
status: unprocessed
status: processed
processed_by: theseus
processed_date: 2026-04-14
priority: high
tags: [evaluation-awareness, sandbagging, interpretability, safety-evaluation, behavioral-evaluation-validity, governance-implications]
extraction_model: "anthropic/claude-sonnet-4.5"
---
## Content