June 29, 2024Open Access

Shared functional specialization in transformer-based language models and the human brain

Key Points

Key points are not available for this paper at this time.

Abstract

Abstract When processing language, the brain is thought to deploy specialized computations to construct meaning from complex linguistic structures. Recently, artificial neural networks based on the Transformer architecture have revolutionized the field of natural language processing. Transformers integrate contextual information across words via structured circuit computations. Prior work has focused on the internal representations (“embeddings”) generated by these circuits. In this paper, we instead analyze the circuit computations directly: we deconstruct these computations into the functionally-specialized “transformations” that integrate contextual information across words. Using functional MRI data acquired while participants listened to naturalistic stories, we first verify that the transformations account for considerable variance in brain activity across the cortical language network. We then demonstrate that the emergent computations performed by individual, functionally-specialized “attention heads” differentially predict brain activity in specific cortical regions. These heads fall along gradients corresponding to different layers and context lengths in a low-dimensional cortical space.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Sreejan Kumar

Theodore R. Sumers

Takateru Yamakoshi

Journals

Nature Communications

Actions

Institutions

Princeton University

The University of Tokyo

Hebrew University of Jerusalem

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Shared functional specialization in transformer-based language models and the human brain

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider