---
type: claim
domain: ai-alignment
confidence: likely
description: Binary preference comparisons cannot identify latent preference types, making pairwise RLHF structurally blind to diversity.
created: 2026-03-11
source: em-dpo-heterogeneous-preferences
processed_date: 2026-03-11
---
The claim rests on a formal identifiability analysis, which is a mathematical proof demonstrating the structural limitations of binary preference comparisons in identifying latent preference types. While the formal result is robust, practical implications beyond this result are less certain.