What type of study is this?

This is a Quantitative Study study.

October 10, 2025Open Access

Learning Linear Regression with Low-Rank Tasks in-Context

Key Points

The study reveals that the generalization error shows a sharp phase transition influenced by task structure.
Statistical fluctuations in pre-training data create implicit regularization that affects prediction quality.
A linear attention model effectively characterizes predictions and identifies distribution patterns in high-dimensional settings.
The work provides a theoretical framework for understanding how transformers learn from structured tasks in real-world applications.

Abstract

In-context learning (ICL) is a key building block of modern large language models, yet its theoretical mechanisms remain poorly understood. It is particularly mysterious how ICL operates in real-world applications where tasks have a common structure. In this work, we address this problem by analyzing a linear attention model trained on low-rank regression tasks. Within this setting, we precisely characterize the distribution of predictions and the generalization error in the high-dimensional limit. Moreover, we find that statistical fluctuations in finite pre-training data induce an implicit regularization. Finally, we identify a sharp phase transition of the generalization error governed by task structure. These results provide a framework for understanding how transformers learn to learn the task structure.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Kaito Takanami

T. Takahashi

Yoshiyuki Kabashima

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Learning Linear Regression with Low-Rank Tasks in-Context

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study