theseus: research session 2026-03-10 #188
4 changed files with 4 additions and 4 deletions
|
|
@ -6,7 +6,7 @@ url: https://ai-frontiers.org/articles/ai-alignment-cannot-be-top-down
|
|||
date: 2025-01-01
|
||||
domain: ai-alignment
|
||||
secondary_domains: [collective-intelligence, mechanisms]
|
||||
format: article
|
||||
format: report
|
||||
status: unprocessed
|
||||
priority: high
|
||||
tags: [democratic-alignment, RLCF, pluralistic-alignment, community-feedback, Taiwan, civic-AI]
|
||||
|
|
|
|||
|
|
@ -6,7 +6,7 @@ url: https://blog.cip.org/p/from-global-dialogues-to-democratic
|
|||
date: 2025-12-01
|
||||
domain: ai-alignment
|
||||
secondary_domains: [collective-intelligence, mechanisms]
|
||||
format: article
|
||||
format: report
|
||||
status: unprocessed
|
||||
priority: high
|
||||
tags: [democratic-alignment, evaluation, pluralistic, global-dialogues, weval, samiksha, empirical-results]
|
||||
|
|
|
|||
|
|
@ -6,7 +6,7 @@ url: https://gist.github.com/bigsnarfdude/629f19f635981999c51a8bd44c6e2a54
|
|||
date: 2026-01-01
|
||||
domain: ai-alignment
|
||||
secondary_domains: []
|
||||
format: article
|
||||
format: report
|
||||
status: unprocessed
|
||||
priority: high
|
||||
tags: [mechanistic-interpretability, SAE, safety, technical-alignment, limitations, DeepMind-pivot]
|
||||
|
|
|
|||
|
|
@ -6,7 +6,7 @@ url: https://time.com/7380854/exclusive-anthropic-drops-flagship-safety-pledge/
|
|||
date: 2026-02-01
|
||||
domain: ai-alignment
|
||||
secondary_domains: [grand-strategy]
|
||||
format: article
|
||||
format: report
|
||||
status: unprocessed
|
||||
priority: high
|
||||
tags: [Anthropic, RSP, safety-pledge, competitive-pressure, institutional-failure, voluntary-commitments]
|
||||
|
|
|
|||
Loading…
Reference in a new issue