Blog

Categories
This is the first of a series of posts on problems that derive from the way modern AI systems are built. We start in this post with sycophancy, the tendency of models to flatter or agree with users at the cost of truth. Later posts take up similar problems, set out an approach we think could do better, and then turn to the harder questions of whether and how that approach can work.