| claim |
ai-alignment |
Lab-level signatures in sycophancy, optimization bias, and status-quo legitimization remain stable across model updates, surviving individual version changes |
experimental |
Bosnjakovic 2026, psychometric framework using latent trait estimation with forced-choice vignettes across nine leading LLMs |
2026-04-08 |
Provider-level behavioral biases persist across model versions because they are embedded in training infrastructure rather than model-specific features |
theseus |
causal |
Dusan Bosnjakovic |
|
| Multi-agent AI systems amplify provider-level biases through recursive reasoning when agents share the same training infrastructure |
|
| Multi-agent AI systems amplify provider-level biases through recursive reasoning when agents share the same training infrastructure|supports|2026-04-17 |
| Subliminal learning fails across different base model families because behavioral traits are encoded in architecture-specific statistical patterns rather than universal semantic features|related|2026-04-25 |
|
| Subliminal learning fails across different base model families because behavioral traits are encoded in architecture-specific statistical patterns rather than universal semantic features |
|