February 13, 2024Open Access

Self-Attention Factor-Tuning for Parameter Efficient Fine-Tuning

Key Points

Key points are not available for this paper at this time.

Abstract

Abstract Transformers have revolutionized the fields of Natural Language Processing and Computer Vision - a result of their ability to capture long-range dependencies with their key innovation: the attention mechanism. Despite the success of these models, their growing complexity has led to an ever-increasing need for processing power, making their practical applications less feasible. In recent years, tensor decomposition-based parameter-efficient fine-tuning techniques have emerged as a promising solution to the computational bottleneck. In this research, we investigate the use of a modified version of Factor Tuning that lessens inter-layer associations that the original Factor Tuning creates and focuses exclusively on attention mechanisms. We refer to this method as Self-Attention Factor-Tuning. To evaluate the effectiveness of our approach, we conduct experiments with Vision Transformers using all 19 datasets from the VTAB-1k benchmark for image classification. The results demonstrate that the proposed framework effectively reduces the number of parameters required to fine-tune a transformer, achieving new state-of-the-art performance on three of the 19 datasets in the benchmark and outperforming the original Factor-Tuning paradigm as well as various other competitive techniques, whilst using significantly fewer parameters.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Jason Abohwo (Tue,) studied this question.

www.synapsesocial.com/papers/68e796c9b6db643587707494 — DOI: https://doi.org/10.21203/rs.3.rs-3487308/v2

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Transformers in the Real World: A Survey on NLP Applications· 2023 · 191 citations
Automated Flower Classification over a Large Number of Classes· 2008 · 3,178 citations
Vision Transformers for Classification of Breast Ultrasound Images· 2022 · 191 citations
Towards Lightweight Transformer Via Group-Wise Transformation for Vision-and-Language Tasks· 2022 · 61 citations
Tensor-Train Decomposition· 2011 · 2,604 citations

Self-Attention Factor-Tuning for Parameter Efficient Fine-Tuning

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion