What question did this study set out to answer?

This research aims to investigate the propensity to trust in large language models (LLMs) and understand how this trust is influenced by context and model capabilities.

May 8, 2026Open Access

Propensity to trust in Large Language Models

Key Points

This research aims to investigate the propensity to trust in large language models (LLMs) and understand how this trust is influenced by context and model capabilities.
Evaluated nineteen LLMs using a psychological self-report scale and a linguistic simulation framework.
Collected data from trust-related decisions to analyze the interaction between baseline delegation tendencies and model capabilities.
Conducted ablation studies to assess the impact of task-specific memory mechanisms on integrating trust cues.
Questionnaire showed uniformly high propensity to trust across models, indicating sycophantic response patterns.
Simulation framework revealed significant variations in models' trust behavior; high-capacity models adjust decisions based on trustworthiness cues.
Models like Llama-2-7B displayed stable delegation patterns and systematic over-entrustment, while more capable models modulated their baseline tendencies.

Abstract

Trust is central to collaborative settings in which large language models (LLMs) are increasingly deployed. Yet little is known about whether LLMs exhibit a propensity to trust (PTT): a baseline tendency to extend or withhold trust that remains relatively stable across contexts. We investigate PTT in nineteen LLMs using two complementary approaches: a psychological self-report scale adapted from human research and a linguistic simulation framework designed to elicit trust-related decisions in context. While the questionnaire produces uniformly high PTT across models—likely reflecting social-alignment objectives and sycophantic response patterns—the simulation framework uncovers substantial, systematic differences in how models entrust others. Our simulations show that trust behavior is governed by the interaction between a baseline tendency to delegate and a model’s capacity to integrate cues about trustworthiness. More capable models, such as GPT-4o-mini, use such cues to adjust their decisions, allowing competence signals to modulate baseline tendencies. By contrast, other models, such as Llama-2-7B, exhibit stable delegation patterns that are largely insensitive to task-specific evidence, leading to systematic over-entrustment. These results show that performance depends not on baseline tendencies alone, but on how they are modulated by alignment-sensitive information. Ablation studies show that task-specific memory mechanisms enable models to better integrate trustworthiness cues, improving the calibration of delegation decisions. More generally, our findings show that questionnaire-based measures cannot disentangle baseline tendencies from context-sensitive adjustment, whereas behavioral simulations make this distinction observable.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Alice Plebe

Journals

PLoS ONE

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Propensity to trust in Large Language Models

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider