What question did this study set out to answer?

This research aims to assess the robustness of large language models in relation to neural predictivity using various datasets.

April 29, 2026Open Access

Spurious alignment between large language models and brains can emerge from non-robust methods and overlooked confounds

Key Points

This research aims to assess the robustness of large language models in relation to neural predictivity using various datasets.
Analyzed a wide range of large language models and methodological approaches
Examined three popular neural datasets for robustness
Evaluated the impact of shuffled train-test splits and extraction methods on results
Found that shuffled train-test splits contributed to spurious findings
Identified biases in results related to how activations are extracted from models
Determined that confounding variables rival trained models in accounting for neural predictivity

Abstract

Abstract Emerging research seeks to draw neuroscientific insights from the neural predictivity of large language models (LLMs). However, as results rapidly proliferate, there is a growing need for large-scale assessments of their robustness. Here, we analyze a wide range of models and methodological approaches across three widely used neural datasets. We find that the use of shuffled train-test splits has contributed to findings that are influential but spurious. Furthermore, how activations are extracted from LLMs can bias results in favor of specific model classes. Lastly, we find that confounding variables, particularly positional signals and word rate, perform competitively with trained LLMs and fully account for the neural predictivity of untrained LLMs on these neural datasets. Although many studies in the field avoid these pitfalls, our results indicate that some apparent alignment between LLMs and brains has emerged from non-robust methods and overlooked confounds.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Nima Hadidi

Ebrahim Feghhi

Bryan H Song

Journals

Nature Communications

Actions

Institutions

University of California, Los Angeles

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Spurious alignment between large language models and brains can emerge from non-robust methods and overlooked confounds

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study