•  
  •  
 

Faculty Advisor

Jonathan Barlow

Faculty Advisor Email

barlow@datascience.msstate.edu

Abstract

As large language models (LLMs) usage grows across different domains, sycophancy, the tendency for output to align with users, is increasingly being recognized as a primary issue arising from applying LLMs into critical areas. Current research has provided a variety of theoretical definitions, mitigation techniques, and quantification for sycophancy. However, there is little to no consistency across different papers. This scoping review seeks to connect different works on LLM sycophancy by identifying themes in theoretical definitions, measurement methods, and inducement techniques of sycophancy. By analyzing 26 papers (preprints, conference proceedings, and journal articles) from arXiv, ACL Anthology, and Scopus, this review has found that currently, LLM sycophancy is being recognized more as an operational tradeoff than manipulation done through flattery. Additionally, this review has found that researchers generally opt to induce sycophancy through a combination of user-authority framing and user-preference signaling. This review provides a standardized taxonomy for research procedures which creates a stronger foundation for future LLM research to be more cross-compatible.

Rights

Permission for reproduction must be obtained from the publisher. Additional rights must be obtained from the authors.

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.