Papers
Topics
Authors
Recent
Search
2000 character limit reached

Plug-and-Play Bregman Proximal Gradient

Updated 28 April 2026
  • The paper demonstrates that PnP–BPGM replace quadratic norms with Bregman divergences, yielding explicit convergence and efficient algorithms for inverse problems.
  • The method integrates learned denoisers tuned to the problem geometry, achieving robust noise and blur removal in Poisson deblurring tasks.
  • The framework generalizes traditional plug‐and‐play methods, combining rigorous theory with strong empirical performance validated by competitive PSNR metrics.

Plug-and-Play Bregman Proximal Gradient Methods (PnP–BPGM) constitute a class of optimization algorithms designed for inverse problems with non-Euclidean data-fidelity terms, particularly those arising in Poisson and other non-Gaussian noise models. By systematically replacing the quadratic norm in traditional Proximal Gradient Methods with a Bregman divergence adapted to the problem geometry, and integrating learned denoisers parametrized to this geometry, PnP–BPGM generalizes plug-and-play frameworks for a broader class of signal restoration settings. The approach is grounded in convex and nonconvex optimization theory and features explicit convergence guarantees, efficient algorithms, and strong empirical performance, especially for Poisson inverse problems (Hurault et al., 2023, Al-Shabili et al., 2022).

1. Mathematical Foundations and Bregman Geometry

The Bregman divergence Dh(x,y)D_h(x, y), defined for a strictly convex, differentiable function h:CRnRh: C \subseteq \mathbb{R}^n \to \mathbb{R}, captures non-Euclidean geometry intrinsic to many inverse problems. For hh μ\mu-strongly convex with LhL_h-Lipschitz h\nabla h, the divergence is

Dh(x,y)=h(x)h(y)h(y),xy,x,ydomh,D_h(x, y) = h(x) - h(y) - \langle \nabla h(y),\, x-y \rangle, \quad x, y \in \operatorname{dom} h,

admitting lower and upper quadratic bounds: μh2xy2Dh(x,y)Lh2xy2.\frac{\mu_h}{2}\|x-y\|^2 \le D_h(x, y) \le \frac{L_h}{2}\|x-y\|^2. If h(x)=12x2h(x) = \frac12\|x\|^2, DhD_h specializes to the standard Euclidean norm. For Poisson noise and non-negativity constraints (h:CRnRh: C \subseteq \mathbb{R}^n \to \mathbb{R}0), Burg entropy h:CRnRh: C \subseteq \mathbb{R}^n \to \mathbb{R}1 yields h:CRnRh: C \subseteq \mathbb{R}^n \to \mathbb{R}2, closely tracking the local structure of the Kullback–Leibler data fidelity (Hurault et al., 2023, Al-Shabili et al., 2022).

2. Problem Formulation and Classical Methods

Inverse problems are formulated as minimizations of composite functionals,

h:CRnRh: C \subseteq \mathbb{R}^n \to \mathbb{R}3

where h:CRnRh: C \subseteq \mathbb{R}^n \to \mathbb{R}4 encodes the data-fidelity term—often a negative log-likelihood such as, for Poisson statistics,

h:CRnRh: C \subseteq \mathbb{R}^n \to \mathbb{R}5

with h:CRnRh: C \subseteq \mathbb{R}^n \to \mathbb{R}6—and h:CRnRh: C \subseteq \mathbb{R}^n \to \mathbb{R}7 imposes a (possibly nonconvex) regularizer.

Classical Proximal Gradient Methods (PGM) assume h:CRnRh: C \subseteq \mathbb{R}^n \to \mathbb{R}8 is h:CRnRh: C \subseteq \mathbb{R}^n \to \mathbb{R}9-smooth (Lipschitz gradient), leading to a majorize–minimize update,

hh0

In the Plug-and-Play (PnP) paradigm, proximal operators are replaced by powerful Gaussian denoisers, yielding iterates hh1 with hh2. However, this Euclidean setting is ill-suited for non-Gaussian models or settings lacking efficient proximal maps (Al-Shabili et al., 2022).

3. Bregman Proximal Gradient and Plug-and-Play Integration

The Bregman Proximal Gradient Method (BPGM) generalizes PGM by using hh3-adapted geometry. Assuming hh4 is hh5-smooth relative to hh6 (hh7 convex), the update is

hh8

This decomposes into a mirror-descent step hh9, followed by a Bregman proximal step μ\mu0.

Plug-and-Play BPGM (PnP–BPGM) replaces the Bregman proximal operation with a learned denoiser μ\mu1 tuned to the geometry,

μ\mu2

For Poisson models, the denoiser is parameterized to respect the Bregman noise model: μ\mu3, with μ\mu4 a learnable potential, ensuring μ\mu5 approximates the MMSE estimator under Bregman noise (Hurault et al., 2023).

4. Algorithmic Framework and Variants

Two principal algorithms are established:

  • Bregman RED (B-RED): Employs a Bregman descent with backtracking and explicit update, Dh(x,y)=h(x)h(y)h(y),xy,x,ydomh,D_h(x, y) = h(x) - h(y) - \langle \nabla h(y),\, x-y \rangle, \quad x, y \in \operatorname{dom} h,2
  • Bregman PnP (B-PnP): Fixed-step implementation with plug-and-play denoising, Dh(x,y)=h(x)h(y)h(y),xy,x,ydomh,D_h(x, y) = h(x) - h(y) - \langle \nabla h(y),\, x-y \rangle, \quad x, y \in \operatorname{dom} h,3 B-RED allows adaptive step-size adjustment; B-PnP enforces μ\mu6 for fixed-τ convergence (Hurault et al., 2023). Both leverage Burg entropy for Poisson problems but can accommodate other choices of μ\mu7 for different noise or constraints (Al-Shabili et al., 2022).

5. Theoretical Analysis: Convergence Guarantees

Convergence is established under the following:

  • μ\mu8 is Legendre, μ\mu9, strongly convex;
  • LhL_h0 is semialgebraic, proper, coercive, and LhL_h1 on LhL_h2;
  • The “NoLip” property: LhL_h3 convex;
  • LhL_h4 and LhL_h5 Lipschitz on sublevel sets;
  • Learnable potentials LhL_h6, LhL_h7 lower bounded, semialgebraic.

Invoking the Kurdyka–Łojasiewicz property, the following is shown:

  • The sequence LhL_h8 is nonincreasing and converges.
  • Iterates LhL_h9 remain bounded in a compact sublevel set.
  • h\nabla h0, with h\nabla h1.
  • All accumulation points are stationary for the explicit global functional,

In the strongly convex case, Lipschitz-averaged denoiser conditions yield global contraction, and step-size bounds can be explicitly characterized (Al-Shabili et al., 2022).

6. Implementation and Empirical Evaluation

When specialized to Poisson deblurring, the forward model h\nabla h4 is paired with Burg entropy, yielding closed-form mirror steps. For practical denoising, DnCNN-type architectures are trained according to Bregman noise and integrated as h\nabla h5 (Hurault et al., 2023, Al-Shabili et al., 2022).

Experimental evaluation on CBSD68 (color images) with four blur kernels (real camera shakes, uniform h\nabla h6, Gaussian h\nabla h7) demonstrates:

Noise Level (α) B-RED PSNR (dB) B-PnP PSNR (dB)
20 23.58 23.29
40 24.54 24.54
60 24.90 24.80

Both methods exhibit monotonic decrease of h\nabla h8 and convergence rate h\nabla h9, achieving effective noise and blur removal without visual artifacts. In comparative studies, PnP–BPGM outperforms Euclidean PnP and direct U-Net baselines for Poisson denoising (Hurault et al., 2023, Al-Shabili et al., 2022).

7. Broader Significance and Research Directions

PnP–BPGM generalizes plug-and-play inverse problem solvers beyond the Euclidean setting, adapting the optimization geometry to the statistical structure of the data-fidelity term and the implied signal constraints. This flexibility is critical for Poisson models, nonnegativity, simplex constraints, and other scenarios where the standard quadratic proximal geometry is poorly matched. Theoretical results cover both strongly convex and certain nonconvex settings, but extensions to fully general convex/nonconvex cases, automated selection or learning of Dh(x,y)=h(x)h(y)h(y),xy,x,ydomh,D_h(x, y) = h(x) - h(y) - \langle \nabla h(y),\, x-y \rangle, \quad x, y \in \operatorname{dom} h,0, and integration with more complex noise or signal geometries are active research topics. Limitations include the need for tractable forms of Dh(x,y)=h(x)h(y)h(y),xy,x,ydomh,D_h(x, y) = h(x) - h(y) - \langle \nabla h(y),\, x-y \rangle, \quad x, y \in \operatorname{dom} h,1 and strong convexity assumptions in current theory (Al-Shabili et al., 2022).

PnP–BPGM thus represents a principled framework for optimizing and learning in non-Euclidean statistical inverse problems, with rigorous theoretical guarantees and practical efficacy validated in Poisson linear imaging and beyond (Hurault et al., 2023, Al-Shabili et al., 2022).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Plug-and-Play Bregman Proximal Gradient (PnP–BPGM).