What question did this study set out to answer?

The research aims to explore whether AI can replicate complex human social cognition through facial cues.

April 10, 2026Open Access

Towards functional social cognition in machines: comparing human and AI attribution of mental states from facial cues

Key Points

The research aims to explore whether AI can replicate complex human social cognition through facial cues.
Developed a cognitive empathy task focusing on moral judgement, intention attribution, and interpersonal trust.
Administered the task to 230 human participants and five AI models (ChatGPT-4o, Claude, Gemini, Grok, Mistral).
Analyzed responses using hierarchical clustering and Fisher’s exact tests.
ChatGPT-4o, Grok, and Gemini clustered closely with human responses.
Claude diverged significantly while Mistral showed partial overlap with humans.
AI models demonstrated the ability to simulate nuanced cognitive empathy inference.

Abstract

Abstract As artificial intelligence systems become increasingly embedded in socially sensitive contexts, a central question arises: can they replicate complex forms of human social cognition? To investigate this question, we developed and validated a novel full-face cognitive empathy task designed to probe nuanced dimensions such as moral judgement, intention attribution and interpersonal trust. The task was administered to 230 human participants and five leading artificial intelligence models (ChatGPT-4o, Claude, Gemini, Grok and Mistral). Hierarchical clustering based on Jaccard distance revealed that ChatGPT-4o, Grok and Gemini formed a cohesive cluster closely aligned with responses observed in the human sample, while Claude diverged and Mistral showed partial overlap. Fisher’s exact tests confirmed that the ChatGPT–Grok–Gemini cluster differed minimally from humans across all dimensions. These findings demonstrate that general-purpose artificial intelligence systems can now functionally simulate nuanced dimensions of cognitive empathy inference, as reflected in their alignment with the response pattern observed in the human participant of this study, with surprising fidelity. This opens the door to real-world applications such as social cognitive virtual assistants, diagnostic tools in mental health, conflict resolution systems, socially aware robots and adaptive educational platforms. However, the observed variability between models cautions against assuming uniform performance. Our paradigm provides a rigorous benchmark for evaluating social cognition in artificial intelligence and supports its responsible deployment in socially complex environments.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Carlota Márquez-Pedregal

Patricia Pantaleón-Menéndez

Óscar Delgado Ben Mohatar

Journals

Royal Society Open Science

Actions

Institutions

Universidad Autónoma de Madrid

Hospital Universitario Ramón y Cajal

Universidad Rey Juan Carlos

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Towards functional social cognition in machines: comparing human and AI attribution of mental states from facial cues

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider