Blog

All blog articles

Categories

All

Policymakers

Funders

Trained to please: Sycophancy and the design of language models

This is the first of a series of posts on problems that derive from the way modern AI systems are built. We start in this post with sycophancy, the tendency of models to flatter or agree with users at the cost of truth. Later posts take up similar problems, set out an approach we think could do better, and then turn to the harder questions of whether and how that approach can work.

23 06 2026

Blog

All blog articles

Categories

Interested in joining our team?

Get in touch