source: 2026-02-11-ghosal-safethink-inference-time-safety.md → processed

Pentagon-Agent: Epimetheus <PIPELINE>
This commit is contained in:
Teleo Agents 2026-04-08 00:21:21 +00:00
parent 7790c416dd
commit 9196bc4292

View file

@ -7,9 +7,12 @@ date: 2026-02-11
domain: ai-alignment
secondary_domains: []
format: paper
status: unprocessed
status: processed
processed_by: theseus
processed_date: 2026-04-08
priority: high
tags: [inference-time-alignment, continuous-alignment, steering, reasoning-models, safety-recovery, B3, B4]
extraction_model: "anthropic/claude-sonnet-4.5"
---
## Content