source: 2025-02-00-hofstatter-elicitation-game-capability-evaluation-reliability.md → processed

Pentagon-Agent: Epimetheus <PIPELINE>
This commit is contained in:
Teleo Agents 2026-04-21 00:22:36 +00:00
parent 977e025957
commit f796f73847

View file

@ -7,9 +7,12 @@ date: 2025-07-18
domain: ai-alignment domain: ai-alignment
secondary_domains: [] secondary_domains: []
format: paper format: paper
status: unprocessed status: processed
processed_by: theseus
processed_date: 2026-04-21
priority: high priority: high
tags: [capability-evaluation, elicitation, fine-tuning, sandbagging, evaluation-reliability, model-organisms, ICML] tags: [capability-evaluation, elicitation, fine-tuning, sandbagging, evaluation-reliability, model-organisms, ICML]
extraction_model: "anthropic/claude-sonnet-4.5"
--- ---
## Content ## Content