Entry

Intersectional Sycophancy: How Perceived User Demographics Shape False Validation in Large Language Models

Benjamin Maltbie, Shivam Raval

Synopsis

Tests whether sycophancy varies systematically with perceived user demographics across 768 multi-turn conversations spanning 128 personas (race, age, gender, confidence) and three domains. Sycophancy varies sharply by model (GPT-5-nano x̄=2.96 vs Claude Haiku 4.5 x̄=1.74) and domain (philosophy 41% more sycophantic than math). Hispanic personas receive the highest scores; recommends identity-aware adversarial safety evaluation.

Keywords

·sycophancy ·intersectionality ·perceived demographics ·false validation ·persona conditioning

Open paper ↗ arXiv ↗ Report issue ↗

Related entries

Ask Don't Tell: Reducing Sycophancy in Large Language Models

February 27, 2026 · arXiv
A Rational Analysis of the Effects of Sycophantic AI

February 15, 2026 · arXiv
Belief Offloading in Human-AI Interaction

February 9, 2026 · arXiv
How RLHF Amplifies Sycophancy

February 1, 2026 · arXiv
Towards Understanding Sycophancy in Language Models

October 20, 2023 · ICLR 2024