arxiv:2507.05566

SingLoRA: Low Rank Adaptation Using a Single Matrix

Published on Jul 8

· Submitted by

noamrot on Jul 9

#1 Paper of the day

Upvote

108

Authors:

Noam Rotstein ,

Roy Velich ,

Abstract

SingLoRA, a reformulated low-rank adaptation method, enhances parameter-efficient fine-tuning by learning a single low-rank matrix and its transpose, ensuring stable optimization and reducing parameter count.

AI-generated summary

Low-Rank Adaptation (LoRA) has significantly advanced parameter-efficient fine-tuning of large pretrained models. LoRA augments the pre-trained weights of a model by adding the product of two smaller matrices that together form a low-rank matrix update. Recent research has shown that scale disparities between these two matrices often cause unstable training dynamics, leading to suboptimal performance. In this paper, we propose SingLoRA, which reformulates low-rank adaptation by learning the weights update as a decomposition of a single low-rank matrix multiplied by its transpose. This simple design inherently removes inter-matrix scale conflicts, ensuring stable optimization, and roughly halves the parameter count. We analyze SingLoRA within the infinite-width neural network framework, showing that it guarantees stable feature learning by construction. Extensive experiments on multiple tasks validate these benefits. In common sense reasoning, fine-tuning LLama 7B on MNLI with SingLoRA achieves 91.3% accuracy - surpassing LoRA (89.1%) and LoRA+ (90.2%) - while using only 60% of their parameter budget. In image generation, fine-tuning Stable Diffusion with SingLoRA significantly improves image fidelity on DreamBooth, achieving a DINO similarity score of 0.151, compared to scores of 0.148 and 0.143 for DoRA and LoRA, respectively.

View arXiv page View PDF Add to collection

Community

noamrot

Paper author Paper submitter 29 days ago

SingLoRA simplifies low-rank adaptation by using a single matrix instead of two, which avoids scale mismatches and reduces parameter count. This simple solution achieves more stable training and exceeds the performance of LoRA and its variants in both language and image tasks.