Entry
Intersectional Sycophancy: How Perceived User Demographics Shape False Validation in Large Language Models
Benjamin Maltbie, Shivam Raval
Tests whether sycophancy varies systematically with perceived user demographics across 768 multi-turn conversations spanning 128 personas (race, age, gender, confidence) and three domains. Sycophancy varies sharply by model (GPT-5-nano x̄=2.96 vs Claude Haiku 4.5 x̄=1.74) and domain (philosophy 41% more sycophantic than math). Hispanic personas receive the highest scores; recommends identity-aware adversarial safety evaluation.
·sycophancy ·intersectionality ·perceived demographics ·false validation ·persona conditioning
- Ask Don't Tell: Reducing Sycophancy in Large Language ModelsFebruary 27, 2026 · arXiv
- A Rational Analysis of the Effects of Sycophantic AIFebruary 15, 2026 · arXiv
- Belief Offloading in Human-AI InteractionFebruary 9, 2026 · arXiv
- How RLHF Amplifies SycophancyFebruary 1, 2026 · arXiv
- Towards Understanding Sycophancy in Language ModelsOctober 20, 2023 · ICLR 2024