June 5, 2024Open Access

How Truncating Weights Improves Reasoning in Language Models

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

In addition to the ability to generate fluent text in various languages, large language models have been successful at tasks that involve basic forms of logical "reasoning" over their context. Recent work found that selectively removing certain components from weight matrices in pre-trained models can improve such reasoning capabilities. We investigate this phenomenon further by carefully studying how certain global associations tend to be stored in specific weight components or Transformer blocks, in particular feed-forward layers. Such associations may hurt predictions in reasoning tasks, and removing the corresponding components may then improve performance. We analyze how this arises during training, both empirically and theoretically, on a two-layer Transformer trained on a basic reasoning task with noise, a toy associative memory model, and on the Pythia family of pre-trained models tested on simple reasoning tasks.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Chen et al. (Wed,) studied this question.

www.synapsesocial.com/papers/68e660e5b6db6435875ef44c — DOI: https://doi.org/10.48550/arxiv.2406.03068

Authors

Lei Chen

Joan Bruna

Alberto Bietti

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

How Truncating Weights Improves Reasoning in Language Models

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Also consider