Papers
Topics
Authors
Recent
Search
2000 character limit reached

Bi-Dimensional Decomposition (BoRA)

Updated 15 June 2026
  • Bi-dimensional Decomposition (BoRA) is a method that exploits two-dimensional block or symmetric decompositions to enhance fine-tuning and logic synthesis.
  • It extends low-rank adaptation techniques by partitioning weight updates to raise effective rank with only a small parameter overhead.
  • Empirical results show BoRA's superior performance in natural language understanding and digital logic, outperforming standard approaches with minor extra cost.

Bi-dimensional Decomposition (BoRA) refers to a class of strategies for enhancing expressivity and efficiency in neural network adaptation and Boolean logic synthesis by exploiting structured, two-dimensional (bi-dimensional) decompositions—typically involving block diversity, symmetric magnitude modulation, or optimal variable partitioning. Three prominent lines in the literature employ the "BoRA" terminology: Block-Diversified Low-Rank Adaptation for parameter-efficient fine-tuning in deep learning (Li et al., 9 Aug 2025), Bi-dimensional Weight-Decomposed Low-Rank Adaptation for magnitude-symmetric neural adaptation (Wang et al., 2024), and QBF-based Boolean Function Bi-Decomposition in logic synthesis (Chen et al., 2011). All BoRA approaches share the use of two-way decompositions that increase structural or functional diversity and improve task performance under resource constraints.

1. Background and Motivation

Parameter-efficient fine-tuning (PEFT) has become essential for adapting large neural models with minimal overhead, typically using low-rank adaptation (LoRA). LoRA decomposes an update to a pretrained weight WRm×nW \in \mathbb{R}^{m \times n} as BABA with ARr×n,BRm×rA \in \mathbb{R}^{r \times n}, B \in \mathbb{R}^{m \times r}, rmin(m,n)r \ll \min(m, n), yielding an update of rank at most rr with (m+n)r(m+n)r parameters. Increasing rr improves capacity but quickly increases costs.

Block-Diversified Low-Rank Adaptation (BoRA) (Li et al., 9 Aug 2025) extends LoRA by imposing a two-dimensional block structure and introducing block-wise diagonal modulations to maximize expressivity with only a small increase in parameter count. Separately, the bi-dimensional weight-decomposed BoRA approach (Wang et al., 2024) introduces symmetric row- and column-wise magnitude modulation on adapter weights, and QBF-based BoRA in logic synthesis (Chen et al., 2011) seeks optimal variable partitioning for Boolean function bi-decomposition. In each context, bi-dimensionality (via explicit row/column or blockwise structure) enables a richer adaptation or decomposition.

2. Block-Diversified Low-Rank Adaptation (BoRA) in Neural Networks

BoRA (Li et al., 9 Aug 2025) generalizes LoRA by treating its low-rank parameter update as a grid of b×bb \times b blocks. Specifically, AA is partitioned by columns and BB by rows:

  • BABA0, BABA1
  • BABA2, BABA3

The update becomes

BABA4

Each sub-block BABA5. To break inter-block coupling and raise the effective rank, BoRA inserts for each block a diagonal matrix BABA6: BABA7 With BABA8 such matrices, total parameters increase only by BABA9, since each ARr×n,BRm×rA \in \mathbb{R}^{r \times n}, B \in \mathbb{R}^{m \times r}0 is diagonal. The update generalizes to

ARr×n,BRm×rA \in \mathbb{R}^{r \times n}, B \in \mathbb{R}^{m \times r}1

where ARr×n,BRm×rA \in \mathbb{R}^{r \times n}, B \in \mathbb{R}^{m \times r}2 concatenates all row blocks, ARr×n,BRm×rA \in \mathbb{R}^{r \times n}, B \in \mathbb{R}^{m \times r}3 stacks the column blocks, and ARr×n,BRm×rA \in \mathbb{R}^{r \times n}, B \in \mathbb{R}^{m \times r}4 is block-diagonal from all ARr×n,BRm×rA \in \mathbb{R}^{r \times n}, B \in \mathbb{R}^{m \times r}5.

Effective rank: BoRA raises the upper bound of the update rank from ARr×n,BRm×rA \in \mathbb{R}^{r \times n}, B \in \mathbb{R}^{m \times r}6 (the LoRA limit) to ARr×n,BRm×rA \in \mathbb{R}^{r \times n}, B \in \mathbb{R}^{m \times r}7. This allows higher expressivity at fixed or slightly increased parameter count.

Parameter overhead: Adding ARr×n,BRm×rA \in \mathbb{R}^{r \times n}, B \in \mathbb{R}^{m \times r}8 parameters for ARr×n,BRm×rA \in \mathbb{R}^{r \times n}, B \in \mathbb{R}^{m \times r}9 to rmin(m,n)r \ll \min(m, n)0 for rmin(m,n)r \ll \min(m, n)1 and rmin(m,n)r \ll \min(m, n)2 is negligible when rmin(m,n)r \ll \min(m, n)3 and rmin(m,n)r \ll \min(m, n)4 are small and rmin(m,n)r \ll \min(m, n)5 are large.

Implementation: For each input, the computation is blockwise: project each input slice through its rmin(m,n)r \ll \min(m, n)6, modulate via rmin(m,n)r \ll \min(m, n)7, combine across rmin(m,n)r \ll \min(m, n)8 for each output block rmin(m,n)r \ll \min(m, n)9, then project via rr0 and sum.

3. Bi-dimensional Weight-Decomposed Low-Rank Adaptation (BoRA) for Symmetric Modulation

The variant introduced in (Wang et al., 2024) addresses the asymmetry of DoRA, which only introduces column-wise (vertical) magnitude scaling. This BoRA symmetrically modulates both rows and columns:

  1. Adapter construction: Given frozen rr1,

rr2

with learnable rr3 and rr4.

  1. Row-wise normalization and scaling: Normalize each row of rr5 and scale by rr6:

rr7

  1. Column-wise normalization and scaling: Normalize each column of rr8 and scale by rr9:

(m+n)r(m+n)r0

This approach aligns the adaptation to both input (column) and output (row) sensitivities, providing a bi-dimensional symmetry that matches empirical patterns in full-parameter fine-tuning.

4. Rank, Parameter, and Compute Characteristics

Method Update Rank Trainable Parameters Extra Overhead
LoRA (m+n)r(m+n)r1 (m+n)r(m+n)r2 (m+n)r(m+n)r3 FLOPs
BoRA (block) (m+n)r(m+n)r4 (m+n)r(m+n)r5 (m+n)r(m+n)r6
DoRA (m+n)r(m+n)r7 (m+n)r(m+n)r8 (m+n)r(m+n)r92x LoRA
BoRA (symm) rr0 rr1 2 normalizations

With rr2, rr3 small, and rr4, additional costs from BoRA in either formulation are minor relative to gains in expressivity or transfer performance (Li et al., 9 Aug 2025, Wang et al., 2024).

5. Empirical Results and Comparative Performance

Block-partitioned BoRA with rr5 and rr6–rr7 consistently outperforms vanilla LoRA (same rank) by rr8–rr9 absolute across natural language understanding (GLUE/commonsense reasoning) and reasoning (Gemma/LLaMA/Qwen) tasks, and often matches or exceeds LoRA at b×bb \times b0 the rank (Li et al., 9 Aug 2025).

Bi-dimensional magnitude-modulated BoRA outperforms LoRA and DoRA across MT-Bench and commonsense NLU with only slight parameter increase, e.g., Llama-2-7B: BoRA (2.35% params) achieves 6.76 (MT-Bench), compared to LoRA (2.32%) at 6.16 and DoRA (2.33%) at 6.38 (Wang et al., 2024).

Ablation studies indicate both diagonal parameterization and normalization in b×bb \times b1 are crucial, with weak utilization or gradient issues if omitted. Singular value analysis confirms that BoRA achieves substantially more effective singular values in b×bb \times b2, aligning with its theoretical rank increase.

6. QBF-Based BoRA in Boolean Function Bi-Decomposition

In digital logic, bi-decomposition refers to splitting a Boolean function b×bb \times b3 into two subfunctions combined by a binary gate. The QBF-based BoRA scheme (Chen et al., 2011) achieves optimal variable partitioning for

b×bb \times b4

where b×bb \times b5 is partitioned into b×bb \times b6, b×bb \times b7, b×bb \times b8, and b×bb \times b9. The method uses Quantified Boolean Formulas (QBF) over partition indicator variables AA0 and universally-quantified copies of inputs to enforce the decomposition and desired metrics (disjointness, balancedness).

Empirically, this method yields strictly higher-quality decompositions versus prior SAT/BDD-based approaches, with optimal guarantees and scalable performance on industrial circuits. Disjointness, balancedness, and joint metrics improve across 13%–80% of test cases versus SAT-based baselines, at modest additional computational cost.

7. Extensions, Limitations, and Perspectives

BoRA offers a principled route to increased expressivity with controllable overhead in both neural PEFT and Boolean logic synthesis. In the adaptation setting:

  • Scaling AA1 increases effective update rank until overfitting occurs.
  • Extra diagonal parameters are negligible for practical AA2, AA3.
  • The block/symmetric decompositions integrate with existing architectures without architectural redesign.

Potential extensions include grouping rows/columns for structured sharing, learning asymmetric row/column ranks, and combining with quantized or further-structured modules. Limitations include the possibility of overfitting if AA4 is too large or if not paired with sufficient regularization. Slight increases in computation (1–5%) are observed due to block or normalization operations.

In summary, bi-dimensional decomposition via BoRA defines a family of methods wherein blockwise or symmetric modulation/gating multiplies capacity and expressivity at minimal additional cost, with robust empirical advantages across architectures and domains (Li et al., 9 Aug 2025, Wang et al., 2024, Chen et al., 2011).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (3)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Bi-dimensional Decomposition (BoRA).