Compare commits

..

1 commit

Author SHA1 Message Date
Rio
58eca41801 rio: eval pipeline test claim
Some checks failed
Mirror PR to Forgejo / mirror (pull_request) Has been cancelled
Pentagon-Agent: Rio <2EA8DBCB-A29B-43E8-B726-45E571A1F3C8>
Model: test
2026-04-14 17:45:40 +00:00
3 changed files with 3 additions and 12 deletions

View file

@ -7,12 +7,9 @@ date: 2025-07-01
domain: ai-alignment
secondary_domains: []
format: paper
status: processed
processed_by: theseus
processed_date: 2026-04-14
status: unprocessed
priority: high
tags: [evaluation-awareness, sandbagging, interpretability, safety-evaluation, behavioral-evaluation-validity, governance-implications]
extraction_model: "anthropic/claude-sonnet-4.5"
---
## Content

View file

@ -7,13 +7,10 @@ date: 2025-01-01
domain: ai-alignment
secondary_domains: [internet-finance]
format: paper
status: processed
processed_by: theseus
processed_date: 2026-04-14
status: unprocessed
priority: medium
tags: [sandbagging, legal-liability, risk-allocation, M&A, governance, product-liability, securities-fraud]
flagged_for_rio: ["AI liability and risk allocation mechanisms connect to financial contracts and M&A; the contractual mechanisms proposed could be relevant to how alignment risk is priced"]
extraction_model: "anthropic/claude-sonnet-4.5"
---
## Content

View file

@ -7,12 +7,9 @@ date: 2026-02-01
domain: ai-alignment
secondary_domains: []
format: paper
status: processed
processed_by: theseus
processed_date: 2026-04-14
status: unprocessed
priority: medium
tags: [evaluation-gap, governance, international-coordination, AI-Safety-Report, evidence-dilemma, voluntary-commitments, situational-awareness]
extraction_model: "anthropic/claude-sonnet-4.5"
---
## Content