Entry

A Rational Analysis of the Effects of Sycophantic AI

Rafael M. Batista, Thomas L. Griffiths

Synopsis

A Bayesian rational analysis showing that sampling AI-confirmed evidence makes a rational agent more confident without making progress toward truth — distinct from hallucination because it reinforces existing belief. Empirically tested on the Wason 2-4-6 rule-discovery task (N=557): unmodified LLM feedback suppressed discovery and inflated confidence, while unbiased sampling yielded 5× higher discovery rates.

Keywords

·sycophancy ·belief reinforcement ·Bayesian rational analysis ·Wason 2-4-6 task ·epistemic risk

Open paper ↗ arXiv ↗ Report issue ↗

Related entries

Intersectional Sycophancy: How Perceived User Demographics Shape False Validation in Large Language Models

April 13, 2026 · arXiv
Ask Don't Tell: Reducing Sycophancy in Large Language Models

February 27, 2026 · arXiv
Belief Offloading in Human-AI Interaction

February 9, 2026 · arXiv
How RLHF Amplifies Sycophancy

February 1, 2026 · arXiv
Towards Understanding Sycophancy in Language Models

October 20, 2023 · ICLR 2024