Critical or Compliant? The Double-Edged Sword of Reasoning in Chain-of-Thought Explanations

2511.12001v2 cs.CL, cs.HC 2025-11-20

Авторы:

Eunkyu Park, Wesley Hanwen Deng, Vasudha Varadarajan, Mingxi Yan, Gunhee Kim, Maarten Sap, Motahhare Eslami

Abstract

Explanations are often promoted as tools for transparency, but they can also foster confirmation bias; users may assume reasoning is correct whenever outputs appear acceptable. We study this double-edged role of Chain-of-Thought (CoT) explanations in multimodal moral scenarios by systematically perturbing reasoning chains and manipulating delivery tones. Specifically, we analyze reasoning errors in vision language models (VLMs) and how they impact user trust and the ability to detect errors. Our findings reveal two key effects: (1) users often equate trust with outcome agreement, sustaining reliance even when reasoning is flawed, and (2) the confident tone suppresses error detection while maintaining reliance, showing that delivery styles can override correctness. These results highlight how CoT explanations can simultaneously clarify and mislead, underscoring the need for NLP systems to provide explanations that encourage scrutiny and critical thinking rather than blind trust. All code will be released publicly.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Critical or Compliant? The Double-Edged Sword of Reasoning in Chain-of-Thought Explanations

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Is Lying Only Sinful in Islam? Exploring Religious Bias in Multilingual Large La...

ELR-1000: A Community-Generated Dataset for Endangered Indic Indigenous Language...

TaleFrame: An Interactive Story Generation System with Fine-Grained Control and ...

Critical or Compliant? The Double-Edged Sword of Reasoning in Chain-of-Thought E...

CURE: Cultural Understanding and Reasoning Evaluation - A Framework for "Thick" ...

Навигация