What type of study is this?

This is a Quantitative Study study.

October 5, 2025Open Access

Support Basis: Fast Attention Beyond Bounded Entries

Puntos clave

Support-basis decomposition achieves sub-quadratic attention approximation in large language models.
The approach splits query and key matrix entries based on their behavior, enabling efficient computation.
Theoretical guarantees prove the method's runtime efficiency without distributional assumptions.
Polynomial attention approximates softmax attention successfully, expanding applicability beyond bounded entries.

Resumen

The quadratic complexity of softmax attention remains a central bottleneck in scaling large language models (LLMs). Alman and Song, NeurIPS 2023 proposed a sub-quadratic attention approximation algorithm, but it works only under the restrictive bounded-entry assumption. Since this assumption rarely holds in practice, its applicability to modern LLMs is limited. In this paper, we introduce support-basis decomposition, a new framework for efficient attention approximation beyond bounded entries. We empirically demonstrate that the entries of the query and key matrices exhibit sub-Gaussian behavior. Our approach uses this property to split large and small entries, enabling exact computation on sparse components and polynomial approximation on dense components. We establish rigorous theoretical guarantees, proving a sub-quadratic runtime, and extend the method to a multi-threshold setting that eliminates all distributional assumptions. Furthermore, we provide the first theoretical justification for the empirical success of polynomial attention Kacham, Mirrokni, and Zhong, ICML 2024, showing that softmax attention can be closely approximated by a combination of multiple polynomial attentions with sketching.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Aliakbarpour et al. (Thu,) studied this question.

www.synapsesocial.com/papers/68e25385d6d66a53c2474ca3 — DOI: https://doi.org/10.48550/arxiv.2510.01643

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Authors

Maryam Aliakbarpour

Vladimir Braverman

Junze Yin

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Support Basis: Fast Attention Beyond Bounded Entries

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion