Plug-and-Play Bregman Proximal Gradient
- The paper demonstrates that PnP–BPGM replace quadratic norms with Bregman divergences, yielding explicit convergence and efficient algorithms for inverse problems.
- The method integrates learned denoisers tuned to the problem geometry, achieving robust noise and blur removal in Poisson deblurring tasks.
- The framework generalizes traditional plug‐and‐play methods, combining rigorous theory with strong empirical performance validated by competitive PSNR metrics.
Plug-and-Play Bregman Proximal Gradient Methods (PnP–BPGM) constitute a class of optimization algorithms designed for inverse problems with non-Euclidean data-fidelity terms, particularly those arising in Poisson and other non-Gaussian noise models. By systematically replacing the quadratic norm in traditional Proximal Gradient Methods with a Bregman divergence adapted to the problem geometry, and integrating learned denoisers parametrized to this geometry, PnP–BPGM generalizes plug-and-play frameworks for a broader class of signal restoration settings. The approach is grounded in convex and nonconvex optimization theory and features explicit convergence guarantees, efficient algorithms, and strong empirical performance, especially for Poisson inverse problems (Hurault et al., 2023, Al-Shabili et al., 2022).
1. Mathematical Foundations and Bregman Geometry
The Bregman divergence , defined for a strictly convex, differentiable function , captures non-Euclidean geometry intrinsic to many inverse problems. For -strongly convex with -Lipschitz , the divergence is
admitting lower and upper quadratic bounds: If , specializes to the standard Euclidean norm. For Poisson noise and non-negativity constraints (0), Burg entropy 1 yields 2, closely tracking the local structure of the Kullback–Leibler data fidelity (Hurault et al., 2023, Al-Shabili et al., 2022).
2. Problem Formulation and Classical Methods
Inverse problems are formulated as minimizations of composite functionals,
3
where 4 encodes the data-fidelity term—often a negative log-likelihood such as, for Poisson statistics,
5
with 6—and 7 imposes a (possibly nonconvex) regularizer.
Classical Proximal Gradient Methods (PGM) assume 8 is 9-smooth (Lipschitz gradient), leading to a majorize–minimize update,
0
In the Plug-and-Play (PnP) paradigm, proximal operators are replaced by powerful Gaussian denoisers, yielding iterates 1 with 2. However, this Euclidean setting is ill-suited for non-Gaussian models or settings lacking efficient proximal maps (Al-Shabili et al., 2022).
3. Bregman Proximal Gradient and Plug-and-Play Integration
The Bregman Proximal Gradient Method (BPGM) generalizes PGM by using 3-adapted geometry. Assuming 4 is 5-smooth relative to 6 (7 convex), the update is
8
This decomposes into a mirror-descent step 9, followed by a Bregman proximal step 0.
Plug-and-Play BPGM (PnP–BPGM) replaces the Bregman proximal operation with a learned denoiser 1 tuned to the geometry,
2
For Poisson models, the denoiser is parameterized to respect the Bregman noise model: 3, with 4 a learnable potential, ensuring 5 approximates the MMSE estimator under Bregman noise (Hurault et al., 2023).
4. Algorithmic Framework and Variants
Two principal algorithms are established:
- Bregman RED (B-RED): Employs a Bregman descent with backtracking and explicit update, 2
- Bregman PnP (B-PnP): Fixed-step implementation with plug-and-play denoising, 3 B-RED allows adaptive step-size adjustment; B-PnP enforces 6 for fixed-τ convergence (Hurault et al., 2023). Both leverage Burg entropy for Poisson problems but can accommodate other choices of 7 for different noise or constraints (Al-Shabili et al., 2022).
5. Theoretical Analysis: Convergence Guarantees
Convergence is established under the following:
- 8 is Legendre, 9, strongly convex;
- 0 is semialgebraic, proper, coercive, and 1 on 2;
- The “NoLip” property: 3 convex;
- 4 and 5 Lipschitz on sublevel sets;
- Learnable potentials 6, 7 lower bounded, semialgebraic.
Invoking the Kurdyka–Łojasiewicz property, the following is shown:
- The sequence 8 is nonincreasing and converges.
- Iterates 9 remain bounded in a compact sublevel set.
- 0, with 1.
- All accumulation points are stationary for the explicit global functional,
- 2 for B-RED,
- 3 for B-PnP (Hurault et al., 2023).
In the strongly convex case, Lipschitz-averaged denoiser conditions yield global contraction, and step-size bounds can be explicitly characterized (Al-Shabili et al., 2022).
6. Implementation and Empirical Evaluation
When specialized to Poisson deblurring, the forward model 4 is paired with Burg entropy, yielding closed-form mirror steps. For practical denoising, DnCNN-type architectures are trained according to Bregman noise and integrated as 5 (Hurault et al., 2023, Al-Shabili et al., 2022).
Experimental evaluation on CBSD68 (color images) with four blur kernels (real camera shakes, uniform 6, Gaussian 7) demonstrates:
| Noise Level (α) | B-RED PSNR (dB) | B-PnP PSNR (dB) |
|---|---|---|
| 20 | 23.58 | 23.29 |
| 40 | 24.54 | 24.54 |
| 60 | 24.90 | 24.80 |
Both methods exhibit monotonic decrease of 8 and convergence rate 9, achieving effective noise and blur removal without visual artifacts. In comparative studies, PnP–BPGM outperforms Euclidean PnP and direct U-Net baselines for Poisson denoising (Hurault et al., 2023, Al-Shabili et al., 2022).
7. Broader Significance and Research Directions
PnP–BPGM generalizes plug-and-play inverse problem solvers beyond the Euclidean setting, adapting the optimization geometry to the statistical structure of the data-fidelity term and the implied signal constraints. This flexibility is critical for Poisson models, nonnegativity, simplex constraints, and other scenarios where the standard quadratic proximal geometry is poorly matched. Theoretical results cover both strongly convex and certain nonconvex settings, but extensions to fully general convex/nonconvex cases, automated selection or learning of 0, and integration with more complex noise or signal geometries are active research topics. Limitations include the need for tractable forms of 1 and strong convexity assumptions in current theory (Al-Shabili et al., 2022).
PnP–BPGM thus represents a principled framework for optimizing and learning in non-Euclidean statistical inverse problems, with rigorous theoretical guarantees and practical efficacy validated in Poisson linear imaging and beyond (Hurault et al., 2023, Al-Shabili et al., 2022).