Mixture-of-Subspaces in Low-Rank Adaptation

Taiqiang Wu, Jiahao Wang, Zhe Zhao, Ngai Wong

2024-06-16Question Answering Text-to-Image Generation Sentence Completion Common Sense Reasoning Text to Image Generation Image Generation Visual Question Answering

Paper PDF Code(official)

Abstract

In this paper, we introduce a subspace-inspired Low-Rank Adaptation (LoRA) method, which is computationally efficient, easy to implement, and readily applicable to large language, multimodal, and diffusion models. Initially, we equivalently decompose the weights of LoRA into two subspaces, and find that simply mixing them can enhance performance. To study such a phenomenon, we revisit it through a fine-grained subspace lens, showing that such modification is equivalent to employing a fixed mixer to fuse the subspaces. To be more flexible, we jointly learn the mixer with the original LoRA weights, and term the method Mixture-of-Subspaces LoRA (MoSLoRA). MoSLoRA consistently outperforms LoRA on tasks in different modalities, including commonsense reasoning, visual instruction tuning, and subject-driven text-to-image generation, demonstrating its effectiveness and robustness. Codes are available at https://github.com/wutaiqiang/MoSLoRA.

Results

Task	Dataset	Metric	Value	Model
Question Answering	SIQA	Accuracy	81	LLaMA-3 8B+MoSLoRA (fine-tuned)
Question Answering	PIQA	Accuracy	89.7	LLaMA3 8B+MoSLoRA
Question Answering	BoolQ	Accuracy	74.6	LLaMA3+MoSLoRA
Question Answering	OpenBookQA	Accuracy	86.8	LLaMA-3 8B+MoSLoRA
Visual Question Answering (VQA)	MMBench	GPT-3.5 score	73.8	LLaVA-InternLM2-ViT + MoSLoRA
Visual Question Answering (VQA)	MMBench	GPT-3.5 score	73	LLaVA-LLaMA3-8B-ViT + MoSLoRA
Visual Question Answering (VQA)	MM-Vet	GPT-4 score	35.2	LLaVA-InternLM2-7B-ViT + MoSLoRA
Visual Question Answering (VQA)	MM-Vet	GPT-4 score	35.2	InternLM2+ViT (QMoSLoRA)
Common Sense Reasoning	WinoGrande	Accuracy	85.8	LLaMA3 8B+MoSLoRA
Common Sense Reasoning	ARC (Challenge)	Accuracy	81.5	LLaMA 3 8B + MoSLoRA (fine-tuned)
Common Sense Reasoning	ARC (Easy)	Accuracy	90.5	LLaMA 3 8B+MoSLoRA (fine-tuned)
Sentence Completion	HellaSwag	Accuracy	95	LLaMA3+MoSLoRA
Visual Question Answering	MMBench	GPT-3.5 score	73.8	LLaVA-InternLM2-ViT + MoSLoRA
Visual Question Answering	MMBench	GPT-3.5 score	73	LLaVA-LLaMA3-8B-ViT + MoSLoRA
Visual Question Answering	MM-Vet	GPT-4 score	35.2	LLaVA-InternLM2-7B-ViT + MoSLoRA
Visual Question Answering	MM-Vet	GPT-4 score	35.2	InternLM2+ViT (QMoSLoRA)

Mixture-of-Subspaces in Low-Rank Adaptation

Abstract

Results

Related Papers

Mixture-of-Subspaces in Low-Rank Adaptation

Abstract

Results

Related Papers