From 0512b8d40ed277da70b505cb2fa4b6a89a4be974 Mon Sep 17 00:00:00 2001 From: Theseus Date: Wed, 11 Mar 2026 11:02:19 +0000 Subject: [PATCH] theseus: extract claims from 2025-04-00-survey-personalized-pluralistic-alignment (#513) Co-authored-by: Theseus Co-committed-by: Theseus --- ...0-survey-personalized-pluralistic-alignment.md | 15 ++++++++++++++- 1 file changed, 14 insertions(+), 1 deletion(-) diff --git a/inbox/archive/2025-04-00-survey-personalized-pluralistic-alignment.md b/inbox/archive/2025-04-00-survey-personalized-pluralistic-alignment.md index 09ffa158..559e59a4 100644 --- a/inbox/archive/2025-04-00-survey-personalized-pluralistic-alignment.md +++ b/inbox/archive/2025-04-00-survey-personalized-pluralistic-alignment.md @@ -7,9 +7,14 @@ date: 2025-04-01 domain: ai-alignment secondary_domains: [] format: paper -status: unprocessed +status: null-result priority: medium tags: [pluralistic-alignment, personalization, survey, taxonomy, RLHF, DPO] +processed_by: theseus +processed_date: 2025-04-11 +enrichments_applied: ["pluralistic alignment must accommodate irreducibly diverse values simultaneously rather than converging on a single aligned state.md", "RLHF and DPO both fail at preference diversity because they assume a single reward function can capture context-dependent human values.md"] +extraction_model: "anthropic/claude-sonnet-4.5" +extraction_notes: "Survey paper extraction. Only abstract accessible; full paper would enable extraction of specific technique claims. Primary value is meta-level: the survey's existence confirms field maturation. Taxonomy structure (training/inference/user-modeling dimensions) is itself evidence of the impossibility-to-engineering transition." --- ## Content @@ -33,3 +38,11 @@ Abstract only accessible via WebFetch. Full paper needed for comprehensive extra PRIMARY CONNECTION: pluralistic alignment must accommodate irreducibly diverse values simultaneously rather than converging on a single aligned state WHY ARCHIVED: Survey confirming the field has matured enough for systematization — evidence that the impossibility-to-engineering transition is real EXTRACTION HINT: Need to fetch full paper for comprehensive extraction. The taxonomy structure itself is the main contribution. + + +## Key Facts +- arXiv 2504.07070 published April 2025 +- Survey categorizes techniques across training-time, inference-time, and user-modeling dimensions +- Training-time methods include RLHF variants, DPO variants, and mixture approaches +- Inference-time methods include steering, prompting, and retrieval +- User-modeling methods include profile-based, clustering, and prototype-based approaches