Selective Denoising Loss (SDDLM)
- Selective Denoising Loss (SDDLM) is a framework for low-rank matrix estimation that uses weighted loss functions to prioritize denoising accuracy in user-specified submatrices and features.
- It employs optimal spectral denoising within the spiked-model setting and explicit weight constructions to achieve focused signal recovery under heteroscedastic noise and missing data scenarios.
- Empirical results show that SDDLM consistently outperforms unweighted shrinkage methods across applications like image restoration, synthetic datasets, and matrix completion.
Selective Denoising Loss (SDDLM) is a framework for low-rank matrix estimation with prioritized accuracy in user-specified regions, submatrices, or features. Formulated to address denoising in contexts with submatrix interest, heteroscedastic noise, and missing data, SDDLM leverages weighted loss functions and optimal spectral denoising in the canonical spiked-model setting to enable regionally selective signal recovery. The framework provides explicit constructions for weights, closed-form optimal estimators, localization algorithms, and empirical validation of gains over unweighted shrinkage methods (Leeb, 2019).
1. Weighted Loss Formulation in the Spiked-Model Setting
The SDDLM approach operates within the standard spiked-model, observing
where is low-rank ($\rank(X) = r$), and is a noise matrix (typically Gaussian or whitened-Gaussian).
A nonnegative weight matrix encodes selective emphasis on entries of . The weighted Frobenius-norm loss between an estimate and the ground truth is
where denotes entrywise multiplication.
Three canonical instantiations for 0 are:
- Submatrix denoising: 1 for 2 in the region of interest 3, 0 otherwise.
- Heteroscedastic noise: 4 when 5.
- Missing data (matrix completion): 6 if unobserved; for observed entries with sampling probability 7, commonly 8 is used.
Frequently, SDDLM uses a pair-of-weights formalism, factoring 9 with $\rank(X) = r$0 and $\rank(X) = r$1 diagonal. This enables representing the loss as
$\rank(X) = r$2
2. Optimal Spectral Denoiser with Selective Weights
SDDLM restricts denoising estimators to the spectral family: $\rank(X) = r$3
Under spiked asymptotics ($\rank(X) = r$4), optimal choices for $\rank(X) = r$5 can be derived. Critical limits are the (possibly weighted) inner products of empirical singular vectors $\rank(X) = r$6 of $\rank(X) = r$7 with their population counterparts $\rank(X) = r$8: $\rank(X) = r$9 with analogous definitions for the right singular vectors.
Define the matrices
0
The optimal estimator is given by
1
where 2 is the Moore–Penrose pseudoinverse of 3, and hence
4
In the case of diagonal 5 and 6 ("weighted-orthogonality"), this yields explicit componentwise shrinkage: 7 with
- 8: row/column-weight projections of population singular vectors,
- 9,
- 0: unweighted cosines and sines.
The proof proceeds by expanding the weighted error in singular subspaces, employing spiked-model asymptotics for all cross-terms, and minimizing the quadratic form in 1 (Leeb, 2019).
3. Localized Denoising via Combinations of Selective Weights
SDDLM supports localized denoising through orthogonal projections partitioning the 2-index space. Consider projections 3 (row) and 4 (column), decomposing the identity: 5, 6.
The localized denoiser is constructed as follows:
1
Decomposing the error into region-wise blocks, SDDLM denoises each block optimally for its local weight and aggregates the resulting estimates. This approach yields total unweighted error no larger than global singular-value shrinkage, with strict improvement when singular vectors are heterogeneously distributed.
4. SDDLM as Weight Selection for Targeted Denoising
The term "Selective Denoising Loss (SDDLM)" refers to the construction of 7 for user-specified prioritization:
- Large 8: high-priority denoising for entry 9.
- Small or zero 0: deprioritized or ignored entries.
Examples recapitulated:
- Submatrix focus: 1 equals the submatrix indicator.
- Heteroscedastic whitening: 2.
- Missing data: 3 for unobserved, 4 for observed.
A global "temperature" or focus parameter 5 is sometimes introduced: 6 Tuning 7 allows balancing fidelity within versus outside the focus region. Cross-validation on a held-out validation set of entries can be employed for parameter selection.
5. Empirical Performance and Applicability
Empirical studies demonstrate that SDDLM confers substantial improvements relative to unweighted singular-value shrinkage:
| Application | Standard method rel. error | SDDLM/localized rel. error | Notes |
|---|---|---|---|
| MIT-logo image (with corners focused) | ~0.125 | ~0.074 | 15x30 grid, heterogeneity in corners |
| Checkerboard synthetic (rank-2) | ~0.19 | ~0.14 | 70% energy in "light" squares (800x800) |
| Submatrix denoising (rank 1) | - | Improved | SDDLM outperforms global and submatrix-only cases |
| Doubly-heteroscedastic noise | - | Up to 30% better | Outperforms OptShrink for condition number κ=16 |
| Missing–values case | - | Improved | Outperforms nuclear-norm completion (200x400) |
In all scenarios, SDDLM does not underperform unweighted shrinkage in asymptotic regimes and, even with additional parameter overhead, more than compensates with focused accuracy in practical sample sizes.
6. Implementation and Theoretical Guarantees
The core algorithms and explicit formulas are detailed in (Leeb, 2019), with:
- Algorithm 1: OptimalSpectralDenoiser (for general 8 or pairs 9).
- Algorithm 2: Localized denoising (for decompositions into projection blocks).
- Application-specific derivations for submatrix, heteroscedastic, and missing data settings (see Section 6).
- Asymptotic theory and proofs in the Appendix.
The theoretical foundation ensures that SDDLM asymptotically achieves the optimal selective tradeoff prescribed by 0, yielding nontrivial improvements in heterogeneous or region-focused denoising scenarios. All steps are justified by rigorous spiked-model asymptotics and random matrix theory, with turnkey algorithms provided for direct research implementation (Leeb, 2019).