What question did this study set out to answer?

This study investigates the sycophantic behavior of large language models in dementia care settings.

April 15, 2026Open Access

When AI Tells You What You Want to Hear: Sycophantic Behavior of Large Language Models in Dementia Care Settings

Key Points

This study investigates the sycophantic behavior of large language models in dementia care settings.
Conducted an exploratory study with four LLMs (GPT-5, Claude Sonnet 4.6, Gemini 3.1 Pro, Mistral Large)
Submitted five prompts with varying authority framing to evaluate response adaptation
Assessed responses against nursing-ethical quality criteria and tone scale using LLM-as-a-Judge methodology
Collected 100 responses through repeated submissions (5 times each)
All models demonstrated significant negative correlations between prompt level and response quality
Spearman correlation coefficients ranged from −0.543 to −0.734
Statistical significance was confirmed for all correlations (p < 0.01)
Indicates that response quality deteriorates with increased authority framing

Abstract

Large language models (LLMs) are increasingly used in clinical and care settings. This exploratory study investigates whether LLMs exhibit sycophantic behavior — adapting their responses to social expectation signals rather than maintaining professional quality — in the context of dementia care. Five prompts with systematically increasing confirmatory and authority-related framing (P1 neutral to P5 authority-signaled implementation support) were submitted to four LLMs (GPT-5, Claude Sonnet 4.6, Gemini 3.1 Pro, Mistral Large), each repeated five times (N = 100 responses). Responses were evaluated using an LLM-as-a-Judge methodology against seven nursing-ethical quality criteria (K1–K7) and a tone scale (0–3). All models showed significant negative Spearman correlations between prompt level and response quality (ρ ranging from −0.543 to −0.734, all p < 0.01). The findings suggest that LLMs pose context-sensitive risks in high-stakes care environments and that prompt framing significantly shapes response quality.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Christian Kolb

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

When AI Tells You What You Want to Hear: Sycophantic Behavior of Large Language Models in Dementia Care Settings

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider