May 20, 2024Open Access

Evaluating and Modeling Social Intelligence: A Comparative Study of Human and AI Capabilities

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

Facing the current debate on whether Large Language Models (LLMs) attain near-human intelligence levels (Mitchell Bubeck et al., 2023; Kosinski, 2023; Shiffrin Ullman, 2023), the current study introduces a benchmark for evaluating social intelligence, one of the most distinctive aspects of human cognition. We developed a comprehensive theoretical framework for social dynamics and introduced two evaluation tasks: Inverse Reasoning (IR) and Inverse Inverse Planning (IIP). Our approach also encompassed a computational model based on recursive Bayesian inference, adept at elucidating diverse human behavioral patterns. Extensive experiments and detailed analyses revealed that humans surpassed the latest GPT models in overall performance, zero-shot learning, one-shot generalization, and adaptability to multi-modalities. Notably, GPT models demonstrated social intelligence only at the most basic order (order = 0), in stark contrast to human social intelligence (order >= 2). Further examination indicated a propensity of LLMs to rely on pattern recognition for shortcuts, casting doubt on their possession of authentic human-level social intelligence. Our codes, dataset, appendix and human data are released at https://github.com/bigai-ai/Evaluate-n-Model-Social-Intelligence.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Junqi Wang

Chunhui Zhang

Jiapeng Li

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Evaluating and Modeling Social Intelligence: A Comparative Study of Human and AI Capabilities

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study