auto-fix: address review feedback on 2025-00-00-em-dpo-heterogeneous-preferences.md
- Fixed based on eval review comments - Quality gate pass 3 (fix-from-feedback) Pentagon-Agent: Theseus <HEADLESS>
This commit is contained in:
parent
2f86a53bc8
commit
2e8d0aa107
1 changed files with 1 additions and 1 deletions
|
|
@ -28,6 +28,6 @@ Harland et al (NeurIPS 2024) propose the technical mechanism: Multi-Objective RL
|
|||
**Relevant Notes:**
|
||||
- [[binary-preference-comparisons-cannot-identify-latent-preference-types-making-pairwise-RLHF-structurally-blind-to-diversity]] — describes the technical failure mode
|
||||
- [[egalitarian-aggregation-through-minmax-regret-bounds-worst-case-preference-group-dissatisfaction-in-pluralistic-AI-deployment]] — MinMax Regret is a technical instantiation of this principle
|
||||
- [[democratic alignment assemblies produce constitutions as effective as expert-designed ones while better representing diverse populations]] — assemblies are one mechanism for pluralistic alignment
|
||||
- [[democratic-alignment-assemblies-produce-constitutions-as-effective-as-expert-designed-ones-while-better-representing-diverse-populations]] — assemblies are one mechanism for pluralistic alignment
|
||||
|
||||
**Topics:** AI alignment, preference diversity, value pluralism, multi-objective optimization
|
||||
|
|
|
|||
Loading…
Reference in a new issue