H Human–AI Coevolution

Entry

A Rational Analysis of the Effects of Sycophantic AI

Rafael M. Batista, Thomas L. Griffiths

Synopsis

A Bayesian rational analysis showing that sampling AI-confirmed evidence makes a rational agent more confident without making progress toward truth — distinct from hallucination because it reinforces existing belief. Empirically tested on the Wason 2-4-6 rule-discovery task (N=557): unmodified LLM feedback suppressed discovery and inflated confidence, while unbiased sampling yielded 5× higher discovery rates.

Keywords

·sycophancy ·belief reinforcement ·Bayesian rational analysis ·Wason 2-4-6 task ·epistemic risk

Open paper ↗ arXiv ↗ Report issue ↗

Related entries