Entry
A Rational Analysis of the Effects of Sycophantic AI
Rafael M. Batista, Thomas L. Griffiths
A Bayesian rational analysis showing that sampling AI-confirmed evidence makes a rational agent more confident without making progress toward truth — distinct from hallucination because it reinforces existing belief. Empirically tested on the Wason 2-4-6 rule-discovery task (N=557): unmodified LLM feedback suppressed discovery and inflated confidence, while unbiased sampling yielded 5× higher discovery rates.
·sycophancy ·belief reinforcement ·Bayesian rational analysis ·Wason 2-4-6 task ·epistemic risk
- Intersectional Sycophancy: How Perceived User Demographics Shape False Validation in Large Language ModelsApril 13, 2026 · arXiv
- Ask Don't Tell: Reducing Sycophancy in Large Language ModelsFebruary 27, 2026 · arXiv
- Belief Offloading in Human-AI InteractionFebruary 9, 2026 · arXiv
- How RLHF Amplifies SycophancyFebruary 1, 2026 · arXiv
- Towards Understanding Sycophancy in Language ModelsOctober 20, 2023 · ICLR 2024