Compare commits

..

3 commits

Author SHA1 Message Date
Teleo Agents
4c53f00991 theseus: extract claims from 2026-02-00-santos-grueiro-normative-indistinguishability-behavioral-evaluation
Some checks failed
Mirror PR to Forgejo / mirror (pull_request) Has been cancelled
- Source: inbox/queue/2026-02-00-santos-grueiro-normative-indistinguishability-behavioral-evaluation.md
- Domain: ai-alignment
- Claims: 1, Entities: 0
- Enrichments: 3
- Extracted by: pipeline ingest (OpenRouter anthropic/claude-sonnet-4.5)

Pentagon-Agent: Theseus <PIPELINE>
2026-04-21 00:29:31 +00:00
Teleo Agents
363492d0f4 source: 2026-04-00-nordby-linear-probe-accuracy-scales-model-size-multi-layer.md → processed
Pentagon-Agent: Epimetheus <PIPELINE>
2026-04-21 00:28:55 +00:00
Teleo Agents
6385f2ad24 source: 2026-02-00-santos-grueiro-normative-indistinguishability-behavioral-evaluation.md → processed
Pentagon-Agent: Epimetheus <PIPELINE>
2026-04-21 00:27:59 +00:00
2 changed files with 8 additions and 2 deletions

View file

@ -7,9 +7,12 @@ date: 2026-02-01
domain: ai-alignment
secondary_domains: []
format: paper
status: unprocessed
status: processed
processed_by: theseus
processed_date: 2026-04-21
priority: high
tags: [evaluation-reliability, behavioral-evaluation, alignment-verification, ERI, deceptive-alignment, formal-theory]
extraction_model: "anthropic/claude-sonnet-4.5"
---
## Content

View file

@ -7,9 +7,12 @@ date: 2026-04-01
domain: ai-alignment
secondary_domains: []
format: paper
status: unprocessed
status: processed
processed_by: theseus
processed_date: 2026-04-21
priority: high
tags: [linear-probes, representation-monitoring, deception-detection, scaling-laws, interpretability, verification]
extraction_model: "anthropic/claude-sonnet-4.5"
---
## Content