Sycophancy Claims about Language Models: The Missing Human-in-the-Loop

2512.00656v1 cs.CL, cs.CY 2025-12-04

Авторы:

Jan Batzner, Volker Stocker, Stefan Schmid, Gjergji Kasneci

Abstract

Sycophantic response patterns in Large Language Models (LLMs) have been increasingly claimed in the literature. We review methodological challenges in measuring LLM sycophancy and identify five core operationalizations. Despite sycophancy being inherently human-centric, current research does not evaluate human perception. Our analysis highlights the difficulties in distinguishing sycophantic responses from related concepts in AI alignment and offers actionable recommendations for future research.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Найти цитирования в Google Scholar
Поиск в Semantic Scholar
Другие статьи категории cs.CL, cs.CY

Sycophancy Claims about Language Models: The Missing Human-in-the-Loop

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Identifying attributions of causality in political text

CAIRNS: Balancing Readability and Scientific Accuracy in Climate Adaptation Ques...

Gender Bias in Emotion Recognition by Large Language Models

Analysing Personal Attacks in U.S. Presidential Debates

PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reas...

Навигация