June 16, 2024Open Access

Mixture-of-Subspaces in Low-Rank Adaptation

Key Points

Key points are not available for this paper at this time.

Abstract

In this paper, we introduce a subspace-inspired Low-Rank Adaptation (LoRA) method, which is computationally efficient, easy to implement, and readily applicable to large language, multimodal, and diffusion models. Initially, we equivalently decompose the weights of LoRA into two subspaces, and find that simply mixing them can enhance performance. To study such a phenomenon, we revisit it through a fine-grained subspace lens, showing that such modification is equivalent to employing a fixed mixer to fuse the subspaces. To be more flexible, we jointly learn the mixer with the original LoRA weights, and term the method Mixture-of-Subspaces LoRA (MoSLoRA). MoSLoRA consistently outperforms LoRA on tasks in different modalities, including commonsense reasoning, visual instruction tuning, and subject-driven text-to-image generation, demonstrating its effectiveness and robustness. Codes are available at https: //github. com/wutaiqiang/MoSLoRAgithub.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Wu et al. (Sun,) studied this question.

www.synapsesocial.com/papers/68e64883b6db6435875d9f29 — DOI: https://doi.org/10.48550/arxiv.2406.11909

Authors

Taiqiang Wu

Jiahao Wang

Zhe Zhao

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Mixture-of-Subspaces in Low-Rank Adaptation

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion