Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 175 tok/s
Gemini 2.5 Pro 54 tok/s Pro
GPT-5 Medium 38 tok/s Pro
GPT-5 High 37 tok/s Pro
GPT-4o 108 tok/s Pro
Kimi K2 180 tok/s Pro
GPT OSS 120B 447 tok/s Pro
Claude Sonnet 4.5 36 tok/s Pro
2000 character limit reached

SurgiATM: Surgical Smoke Removal Technique

Updated 15 November 2025
  • SurgiATM is a physics-guided module that combines atmospheric scattering models with deep neural architectures for effective surgical smoke removal.
  • It leverages the dark channel prior and a mixture-of-experts method to improve accuracy, stability, and generalizability without adding extra trainable parameters.
  • The approach uses only two hyperparameters and minimal computational overhead, enabling plug-and-play integration for real-time intraoperative applications.

The Surgical Atmospheric Model (SurgiATM) is a physics-guided, parameter-efficient module for surgical smoke removal that bridges the strengths of physics-based scattering models and deep neural architectures. Designed as a lightweight, plug-and-play wrapper, SurgiATM enhances the accuracy, stability, and generalizability of existing deep learning–based desmoking methods without introducing extra trainable weights or significant computational overhead. SurgiATM leverages the dark channel prior from classic dehazing, reframes model outputs via a statistically principled mixture-of-experts approach, and operates with only two hyperparameters, thereby facilitating real-time intraoperative deployment and broad compatibility across network architectures and surgical procedures (Sheng et al., 7 Nov 2025).

1. Theoretical Foundation: Physics-Based Atmospheric Scattering

SurgiATM is grounded in the classic Atmospheric Scattering Model (ASM), which analytically characterizes the formation of smoky or hazy images as a convex combination of direct scene radiance attenuated by the medium and an additive atmospheric veil. The image I(x,c)I(\mathbf{x},c) at pixel x\mathbf{x} and color channel cc is modeled as:

I(x,c)=J(x,c)t(x)+A(c)(1t(x))I(\mathbf{x},c) = J(\mathbf{x},c) \cdot t(\mathbf{x}) + A(c) \cdot (1 - t(\mathbf{x}))

where J(x,c)J(\mathbf{x},c) is true scene radiance, t(x)=exp(β(λ)d(x))t(\mathbf{x}) = \exp(-\beta(\lambda) \cdot d(\mathbf{x})) is the transmission determined by scattering coefficient β(λ)\beta(\lambda) and scene depth d(x)d(\mathbf{x}), and A(c)A(c) is the smoke veil intensity. The normalized radiance ρ(x,c)\rho(\mathbf{x},c) is such that J(x,c)=A(c)ρ(x,c)J(\mathbf{x},c) = A(c) \cdot \rho(\mathbf{x},c).

To estimate t(x)t(\mathbf{x}), SurgiATM employs the Dark Channel Prior (DCP):

D(x)=minc{R,G,B}[minuΩ(x;z)I(u,c)A(c)]D(\mathbf{x}) = \min_{c \in \{R,G,B\}} \Big[ \min_{\mathbf{u} \in \Omega(\mathbf{x};z)} \frac{I(\mathbf{u},c)}{A(c)} \Big]

with t(x)=1D(x)t(\mathbf{x}) = 1 - D(\mathbf{x}), and scene restoration via:

JDCP(x,c)=I(x,c)A(c)1D(x)+A(c)J_{DCP}(\mathbf{x},c) = \frac{I(\mathbf{x},c) - A(c)}{1 - D(\mathbf{x})} + A(c)

2. Mixture-of-Experts: Statistical Bridge Between Physics and Deep Learning

Rather than relying exclusively on the DCP or a deep learning (DNN) prediction JDNN(x,c)J_{DNN}(\mathbf{x},c), SurgiATM treats both as "experts" in a mixture-of-experts (MoE) framework. Each predictor's error (denoted ϵDCP\epsilon_{DCP}, ϵDNN\epsilon_{DNN}) is modeled as a zero-mean Laplacian, conditioned on D(x)D(\mathbf{x}). An optimal pixelwise gate W(x;D)W^*(\mathbf{x}; D) is derived by minimizing the variance and squared bias of the mixture error:

JMoE(x,c)=W(x)JDCP(x,c)+(1W(x))JDNN(x,c)J_{MoE}(\mathbf{x}, c) = W(\mathbf{x}) J_{DCP}(\mathbf{x}, c) + (1 - W(\mathbf{x})) J_{DNN}(\mathbf{x}, c)

with the closed-form optimal gate:

W(x;D)=2b22μ2(μ1μ2)2(b12+b22)+(μ1μ2)2W^*(\mathbf{x};D) = \frac{2 b_2^2 - \mu_2 (\mu_1 - \mu_2)}{2(b_1^2 + b_2^2) + (\mu_1 - \mu_2)^2}

where μi\mu_i, bib_i are mean and scale parameters for DCP and DNN, respectively.

Empirically, W(x)W^*(\mathbf{x}) is strongly anti-correlated with D(x)D(\mathbf{x}) across architectures and datasets, motivating the practical approximation:

W^(x)=1D(x)\widehat{W}(\mathbf{x}) = 1 - D(\mathbf{x})

This removes the need for explicit veil estimation and simplifies computation.

3. Restoration Formula and Plug-in Integration

Substituting the approximated gate and leveraging JDNN=A(c)ρDNNJ_{DNN} = A(c) \rho_{DNN} yields the SurgiATM restoration formula:

JSurgiATM(x,c)=I(x,c)A(c)D(x)+D(x)JDNN(x,c)J_{SurgiATM}(\mathbf{x},c) = I(\mathbf{x},c) - A(c) D(\mathbf{x}) + D(\mathbf{x}) J_{DNN}(\mathbf{x},c)

which can be re-expressed as:

JSurgiATM(x,c)=I(x,c)D(x)A(c)[1ρDNN(x,c)]J_{SurgiATM}(\mathbf{x},c) = I(\mathbf{x},c) - D(\mathbf{x}) A(c) [1 - \rho_{DNN}(\mathbf{x},c)]

Defining the "denormalized dark channel" D(x)=D(x)A(c)=minuΩ(x;z)mincI(u,c)\mathcal{D}(\mathbf{x}) = D(\mathbf{x}) A(c) = \min_{\mathbf{u} \in \Omega(\mathbf{x};z)} \min_c I(\mathbf{u},c), computable by a single min-filter operation, the update becomes:

JSurgiATM(x,c)=I(x,c)D(x)[1ρDNN(x,c)]J_{SurgiATM}(\mathbf{x},c) = I(\mathbf{x},c) - \mathcal{D}(\mathbf{x}) [1 - \rho_{DNN}(\mathbf{x},c)]

In practice, given a pretrained or trainable desmoking network Φθ(I;Z)\Phi_\theta(I; Z) that estimates ρDNN\rho_{DNN} (typically via sigmoid activation), SurgiATM is appended as a post-processing arithmetic layer. The loss function becomes:

L(J,J^)=L(J(x,c),I(x,c)D(x)[1ρDNN(x,c)])\mathcal{L}(J, \hat{J}) = \mathcal{L}( J(\mathbf{x},c), I(\mathbf{x},c) - \mathcal{D}(\mathbf{x}) [1 - \rho_{DNN}(\mathbf{x},c)] )

No extra trainable weights or modifications to network topology are introduced, aside from the minimal addition of D(x)\mathcal{D}(\mathbf{x}) computation and a pointwise multiply-subtract operation. This enables negligible latency overhead even in real-time clinical settings.

4. Hyperparameters: Window Size and Smoothing

SurgiATM introduces two hyperparameters:

  • Window size zz: Governs the local patch size when computing D(x)\mathcal{D}(\mathbf{x}) or D(x)D(\mathbf{x}). The default is z=15z=15 (as in He et al.'s DCP). Plausibly, larger windows may better smooth over noise or small variations for some architectures, while smaller windows can enhance localization of smoke for others.
  • Smoothing factor η\eta: Prevents zero gradients through redefinition:

D^(x)=D(x)+η1+η\hat{\mathcal{D}}(\mathbf{x}) = \frac{\mathcal{D}(\mathbf{x}) + \eta}{1 + \eta}

As η\eta \to \infty, D^1\hat{\mathcal{D}} \to 1 and the restoration formula reduces to a residual network. Empirical ablation over η{0,0.1,1,5,10}\eta \in \{0, 0.1, 1, 5, 10\} on VASST-desmoke indicates that η=0.1\eta = 0.1 yields the lowest RMSE.

5. Experimental Validation and Impact

SurgiATM's efficacy is demonstrated across three public surgical datasets:

  • Cholec80: Synthetic smoke blended with clean frames from laparoscopic cholecystectomy ($500$ frames/video).
  • VASST-desmoke: Real smoke, covering cholecystectomy, partial nephrectomy, and diaphragm dissection.
  • Hamlyn: External, unpaired test set with diverse lighting and procedure conditions.

Ten baseline desmoking methods encompassing CNNs, U-Nets, cGANs, CycleGANs, GANs with perceptual/structural losses (e.g., SSIM-PAN), and Transformer-based models were evaluated. Training used five-fold cross-validation with Adam optimizer (learning rate 2×1042\times 10^{-4}), for 50 epochs per split.

Quantitative results on VASST-desmoke show systematic improvements when applying SurgiATM as a postprocessing wrapper:

Method CIEDE2000↓ PSNR↑ RMSE↓ SSIM↑
AODNet 10.35→6.33 17.38→21.99 0.147→0.090 0.693→0.789
GCANet 7.89→5.37 21.09→23.91 0.095→0.073 0.729→0.806
De-smokeGCN 9.58→7.35 19.78→22.13 0.115→0.094 0.550→0.717
RSTN 5.39→4.98 23.26→24.31 0.074→0.071 0.750→0.824

Performance improvements hold across synthetic-to-real (Cholec80→VASST) transfer, with CIEDE2000 reduced by 121{\text{–}}2 units and PSNR increased by 232{\text{–}}3 dB for all architectures, indicating enhanced generalizability.

Non-reference dehazing metrics (FADE) improve on Hamlyn, while BRISQUE/NIQE show mixed trends, plausibly influenced by domain gap with natural images.

6. Qualitative Observations and Operational Considerations

SurgiATM yields tangible improvements in stability and perceptual quality: grid artifacts in U-Net and Transformer models are suppressed, color casts in CNN architectures are corrected, and GAN-based outputs evidence less residual smoke. The generalizability of performance gains across both synthetic and real smoke domains, procedures, and institutions is empirically supported.

The min-filter and elementwise operations required by SurgiATM introduce negligible GPU latency, thus supporting deployment in time-sensitive, intraoperative applications. No architectural modifications to existing backbone networks are necessary.

7. Limitations and Future Directions

SurgiATM's gating mechanism W^(x)=1D(x)\widehat{W}(\mathbf{x}) = 1 - D(\mathbf{x}) constitutes a global approximation. Method-specific or learned gating functions—provided they avoid significant increases in parameter count or computational demand—may further improve restoration accuracy. The min-filter employed to estimate D(x)\mathcal{D}(\mathbf{x}) is inherently coarse; incorporating finer spatial priors (e.g., via guided filtering or learned spatial attention) could yield more accurate smoke localization and tighter coupling with DNN reflectance predictions.

Clinical evaluation in live surgical workflows and integration of temporal consistency for video remain open research areas.

SurgiATM establishes a generalizable, computationally lightweight approach for surgical smoke removal, systematically reducing restoration error (e.g., PSNR\uparrow4–6 dB, RMSE\downarrow30–40%, SSIM\uparrow0.05–0.10) while being compatible with diverse deep learning desmoking paradigms and surgical imaging conditions.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)
Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to Surgical Atmospheric Model (SurgiATM).