--- type: claim domain: ai-alignment confidence: likely description: Binary preference comparisons cannot identify latent preference types, making pairwise RLHF structurally blind to diversity. created: 2026-03-11 source: em-dpo-heterogeneous-preferences processed_date: 2026-03-11 --- The claim rests on a formal identifiability analysis, which is a mathematical proof demonstrating the structural limitations of binary preference comparisons in identifying latent preference types. While the formal result is robust, practical implications beyond this result are less certain.