Linear Gradient Matching in Deep Learning

Updated 23 November 2025

Linear gradient matching is a technique that formulates per-layer linear systems with gradient constraints to reconstruct training data or merge models effectively.
It employs least-squares solutions and gradient-matching corrections to resolve underdetermined systems, addressing privacy leakage and model aggregation challenges.
Empirical studies show that uncertainty-based gradient matching enhances accuracy and robustness in vision and language models through optimized gradient alignment.

Linear gradient matching refers to a family of methodologies in deep learning that leverage linear system formulations and gradient alignment constraints for either reconstructing training data from gradients or linearly merging models by optimizing gradient agreement. Two prominent frameworks—training-data reconstruction via linear systems with gradient matching (Chen et al., 2022) and model merging via uncertainty-based gradient matching (Daheim et al., 2023)—illustrate the theoretical and practical landscape of linear gradient matching.

1. Formulation of Per-Layer Linear Systems in Deep Networks

In the context of training-data leakage, linear gradient matching exploits the fact that both the forward and backward passes of a neural network layer yield linear relationships between the network inputs and gradients. Consider a deep network with $d$ layers, where for the $i$ th layer:

$\bm{w}^{(i)} \in \mathbb{R}^{m_i \times n_i}$ : layer weights
$\bm{b}^{(i)} \in \mathbb{R}^{m_i}$ : biases
$\bm{x}^{(i)} \in \mathbb{R}^{n_i}$ : input to layer
$\bm{z}^{(i)} = \bm{w}^{(i)}\bm{x}^{(i)} + \bm{b}^{(i)}$ : pre-activation
$\alpha^{(i)}$ : invertible activation

The forward constraint is $\bm{w}^{(i)}\bm{x}^{(i)} + \bm{b}^{(i)} = \bm{z}^{(i)}$ , and the gradient constraint is $\nabla_{\bm{z}^{(i)}} L \, \bm{x}^{(i)\top} = \nabla_{\bm{w}^{(i)}} L$ , vectorized appropriately. These are jointly represented as the linear system: $\bm{u}^{(i)} \bm{x}^{(i)} = \bm{v}^{(i)}$ where $\bm{u}^{(i)}$ and $\bm{v}^{(i)}$ stack the forward and backward constraints. The least-squares solution is used when the coefficient matrix is not square or is rank-deficient, which is common in convolutional layers (Chen et al., 2022).

2. Gradient-Matching Correction for Reconstruction or Merging

The space of solutions to the underdetermined linear system often contains a nontrivial null-space, particularly for convolutional architectures. To resolve this ambiguity, a gradient-matching correction step is performed.

In reconstruction, one seeks $\bm{x}^{(i)}$ that matches not only the linear constraints but also yields the true downstream gradients on all weights, typically by minimizing the cosine distance between true and candidate gradients: $\min_{\bm{x}^{(i)}} \, \mathcal{D} \big( \bm{g}(\bm{x}^{(i)}), \bm{g}_\text{true}^{(i)} \big ) \quad \text{s.t.} \quad \bm{u}^{(i)} \bm{x}^{(i)} = \bm{v}^{(i)}$ where $\mathcal{D}$ denotes cosine distance. In practice, the constraint can be relaxed via a penalty, and regularized with total variation if needed. Optimization leverages automatic differentiation and standard optimizers (e.g., ADAM) over a small number of iterations (Chen et al., 2022).

For model merging, the gradient-matching objective instead seeks linear combinations of model weights yielding minimal mismatch between the merged model’s gradients and those of the constituent models, typically expressed as: $D(\alpha) = \bigg\| \nabla L(\bar \theta) - \sum_{t=1}^T \alpha_t \nabla L_t(\theta_t) \bigg\|^2$ Reducing this metric correlates with improved merged model accuracy (Daheim et al., 2023).

3. Solvability, Set-Identifiability, and Security Metrics

A linear system $\bm{u}^{(i)} \bm{x}^{(i)} = \bm{v}^{(i)}$ is solvable if $\mathrm{rank}(\bm{u}^{(i)}) = \mathrm{rank}([\bm{u}^{(i)} \mid \bm{v}^{(i)}])$ . Solution uniqueness holds when $\mathrm{rank}(\bm{u}^{(i)}) = n_i$ ; otherwise, null-space ambiguity persists.

To systematically quantify a network’s vulnerability to gradient leakage, the security index is defined: $c(M) = \sum_{i=1}^d \frac{d-(i-1)}{d} (\mathrm{rank}(\bm{u}^{(i)}) - n_i)$ with $c(M) \le 0$ . $c(M) = 0$ implies full rank and maximal leakage risk; increasingly negative values indicate greater intrinsic security due to widespread underdeterminedness (Chen et al., 2022).

4. Relationship to Network Architecture

Analytic invertibility is structurally possible for fully-connected layers. Specifically, for a nonzero bias and suitable gradient, all input components can be exactly inferred from gradients: $x_\ell^{(i)} = \frac{ \partial L/\partial w_{k\ell}^{(i)} }{ \partial L/\partial b_k^{(i)} }$ Convolutional layers, due to weight-sharing and architectural redundancy, produce highly rectangular systems with large null-spaces. Architectures with deep, high-dimensional early layers and few filters further exacerbate this rank deficiency, increasing effective security (i.e., reducing leakage) (Chen et al., 2022). Slimmer early layers and aggressive pooling yield higher per-layer ranks and increased vulnerability.

5. Linear Gradient Matching in Model Merging

Gradient matching underpins a unifying analytical framework for linearly merging models. The objective is to construct a merged model $\bar{\theta}$ from base and fine-tuned models by finding weights $\alpha_t$ that minimize the gradient mismatch: $\bar{\theta} = \theta_0 + \sum_{t=1}^T \alpha_t (\theta_t - \theta_0)$ An explicit Taylor expansion connects the optimality conditions for joint training to the mismatch between merged and optimal parameters, revealing that minimizing gradient mismatch reduces error: $D(\alpha) = \| \nabla L(\bar{\theta}) - \sum_{t=1}^T \alpha_t \nabla L_t(\theta_t) \|^2$ Curvature information (Hessian or Fisher) is leveraged to define “preconditioners,” yielding a closed-form merging rule: $\hat{\theta} = \theta_0 + \sum_{t=1}^T \alpha_t \left[ \bar{H}^{-1} H_{0+t} \right] ( \theta_t - \theta_0 )$ where $H_{0+t}$ and $\bar{H}$ are combinations of prior and task curvatures. This scheme generalizes averaging, task arithmetic, and Fisher-averaging as special cases via curvature choices (Daheim et al., 2023).

6. Empirical Observations and Limitations

Empirically, reducing gradient mismatch strongly correlates with improved model performance in vision and language tasks. Uncertainty-based gradient matching consistently delivers gains in accuracy and robustness to hyperparameter variations, outperforming arithmetic mean and task arithmetic merging, especially when merging vision transformers or LLMs. This advantage is robust even in data removal or debiasing scenarios.

Linear merging is most successful when constituent models are mode-connected (i.e., interpolable in parameter space) and local curvature approximations are valid. Failure modes include incompatibility in representations or highly non-Gaussian, non-smooth curvature, where Taylor and Laplace approximations become invalid. The choice and estimation of curvature via diagonal Fisher matrices are critical for practical application (Daheim et al., 2023).

7. Synthesis and Theoretical Unification

Linear gradient matching provides a principled, theoretically grounded connection between the analytic solvability of gradient-based data or parameter reconstruction and both the architectural properties of deep networks and model aggregation protocols. The core insight is that, whether reconstructing training data or merging task-specialized models, the alignment and rank properties of linear gradient systems dictate information recoverability and the efficacy of linear operations. The explicit use of gradient-matching optimization, fortified by architectural or Hessian-derived regularization, unifies and enhances the analytic approaches to both privacy attacks and distributed model merging (Chen et al., 2022, Daheim et al., 2023).