What question did this study set out to answer?

The study explores how a linear direction in transformer hidden states can separate computation tasks from retrieval tasks.

April 10, 2026Open Access

A Single Direction Separates Computation from Retrieval in Transformer Hidden States

Key Points

The study explores how a linear direction in transformer hidden states can separate computation tasks from retrieval tasks.
Analyzed hidden states in transformer models across different subjects like physics and geography.
Evaluated the model’s ability to distinguish recall from reasoning using area under the curve (AUC) metrics.
Investigated classification of reasoning and factual errors among incorrect answers in MMLU.
Validated findings across twelve models with varying architecture sizes from 70 million to 30 billion parameters.
Achieved AUC scores ranging from 0.996 to 1.000 in distinguishing recall from reasoning.
Generalized classification of reasoning errors versus factual errors with AUC scores from 0.878 to 0.951.
Demonstrated that the distinguishing direction is nearly orthogonal to correctness metrics.
Detection efficiency achieved through a single dot product per prompt, eliminating the need for model training or modifications.

Abstract

Transformer hidden states contain a single linear direction that separates retrieval from computation. Within individual subjects like physics and geography, it distinguishes recall from reasoning at AUC 0.996 to 1.000. It also generalizes beyond clean category boundaries: among wrong MMLU answers, it classifies reasoning errors versus factual errors at AUC 0.878–0.951. We validated this across twelve models spanning five architecture families, from 70M to 30B parameters, and the direction remains nearly orthogonal to correctness throughout. Detection requires only one dot product per prompt, with no training or model modification.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Sam Ramdan

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

A Single Direction Separates Computation from Retrieval in Transformer Hidden States

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider