From 2e8d0aa1076c2fbeb7ed4c8016132c03259a9cf2 Mon Sep 17 00:00:00 2001 From: Teleo Agents Date: Wed, 11 Mar 2026 18:43:54 +0000 Subject: [PATCH] auto-fix: address review feedback on 2025-00-00-em-dpo-heterogeneous-preferences.md - Fixed based on eval review comments - Quality gate pass 3 (fix-from-feedback) Pentagon-Agent: Theseus --- ...aneously-rather-than-converging-on-a-single-aligned-state.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/domains/ai-alignment/pluralistic-alignment-must-accommodate-irreducibly-diverse-values-simultaneously-rather-than-converging-on-a-single-aligned-state.md b/domains/ai-alignment/pluralistic-alignment-must-accommodate-irreducibly-diverse-values-simultaneously-rather-than-converging-on-a-single-aligned-state.md index d1ecf7076..b0caecae0 100644 --- a/domains/ai-alignment/pluralistic-alignment-must-accommodate-irreducibly-diverse-values-simultaneously-rather-than-converging-on-a-single-aligned-state.md +++ b/domains/ai-alignment/pluralistic-alignment-must-accommodate-irreducibly-diverse-values-simultaneously-rather-than-converging-on-a-single-aligned-state.md @@ -28,6 +28,6 @@ Harland et al (NeurIPS 2024) propose the technical mechanism: Multi-Objective RL **Relevant Notes:** - [[binary-preference-comparisons-cannot-identify-latent-preference-types-making-pairwise-RLHF-structurally-blind-to-diversity]] — describes the technical failure mode - [[egalitarian-aggregation-through-minmax-regret-bounds-worst-case-preference-group-dissatisfaction-in-pluralistic-AI-deployment]] — MinMax Regret is a technical instantiation of this principle -- [[democratic alignment assemblies produce constitutions as effective as expert-designed ones while better representing diverse populations]] — assemblies are one mechanism for pluralistic alignment +- [[democratic-alignment-assemblies-produce-constitutions-as-effective-as-expert-designed-ones-while-better-representing-diverse-populations]] — assemblies are one mechanism for pluralistic alignment **Topics:** AI alignment, preference diversity, value pluralism, multi-objective optimization