BoRA: Towards More Expressive Low-Rank Adaptation with Block Diversity
Abstract
Low-rank adaptation (LoRA) is a parameter-efficient fine-tuning (PEFT) method widely used in large language models (LLMs). It approximates the update of a pretrained weight matrix W∈Rm× n by the product of two low-rank matrices, BA, where A ∈Rr× n and B∈Rm× r (r\m,n\). Increasing the dimension r can raise the rank of LoRA weights (i.e., BA), which typically improves fine-tuning performance but also significantly increases the number of trainable parameters. In this paper, we propose Block Diversified Low-Rank Adaptation (BoRA), which improves the rank of LoRA weights with a small number of additional parameters. Specifically, BoRA treats the product BA as a block matrix multiplication, where A and B are partitioned into b blocks along the columns and rows, respectively (i.e., A=[A1,…,Ab] and B=[B1,…,Bb]). Consequently, the product BA becomes the concatenation of the block products BiAj for i,j∈[b]. To enhance the diversity of different block products, BoRA introduces a unique diagonal matrix i,j ∈ Rr× r for each block multiplication, resulting in Bi i,j Aj. By leveraging these block-wise diagonal matrices, BoRA increases the rank of LoRA weights by a factor of b while only requiring b2r additional parameters. Extensive experiments across multiple datasets and models demonstrate the superiority of BoRA, and ablation studies further validate its scalability.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.