Grad-CAM: Visual Explanations in CNNs

Updated 1 December 2025

Grad-CAM is a gradient-based localization technique that generates coarse, class-specific heatmaps to highlight key regions influencing CNN decisions.
It computes feature importance by spatially averaging gradients over convolutional layers, ensuring model-agnostic applicability across diverse neural network architectures.
Extensions like Grad-CAM++, Smooth Grad-CAM++, and Winsor-CAM enhance resolution, robustness, and interpretability, supporting high-stakes applications in fields such as medical imaging and object detection.

Gradient-weighted Class Activation Mapping (Grad-CAM) is a class-discriminative, gradient-based localization technique for visualizing the regions of a given input that drive deep neural network (DNN) decisions. Originally motivated by the limitations of both pixel-space sensitivity analyses and model architectures restricted to global average pooling, Grad-CAM generalizes class activation mapping to arbitrary convolutional neural network (CNN) structures and outputs, leveraging the gradient signal flowing into any chosen convolutional layer to provide coarse but class-specific localization maps.

1. Mathematical Basis and Derivation

Let $y^c$ denote the pre-softmax score for class $c$ , and let $A^k \in \mathbb{R}^{u \times v}$ be the $k$ -th feature map from a convolutional layer of interest. Grad-CAM computes the importance of each feature map $k$ for class $c$ by spatially global average pooling the partial derivatives of $y^c$ with respect to $A^k$ :

$\alpha_k^c = \frac{1}{uv} \sum_{i=1}^{u} \sum_{j=1}^{v} \frac{\partial y^c}{\partial A^k_{ij}}$

Given these weights, the class activation map is then

$L^c_{\mathrm{Grad\text{-}CAM}}(i,j) = \mathrm{ReLU} \Bigl( \sum_k \alpha_k^c A^k_{ij} \Bigr )$

This produces a low-resolution heatmap indicating supportive evidence for class $c$ . The map is usually upsampled (e.g., bilinear interpolation) to match input resolution for visualization and interpretation. Use of the ReLU removes locations with net negative influence on $y^c$ .

Variants such as Grad-CAM++ introduce higher-order weighting coefficients at the spatial level, but practical studies show that, for most standard architectures, gradients need only be positively thresholded to achieve near-identical results to these second-order corrections, leading to the so-called “Grad-CAM⁺” formulation (Lerma et al., 2022).

2. Algorithmic Formulation and Implementation Details

The standard Grad-CAM pipeline is as follows (Selvaraju et al., 2016, Selvaraju et al., 2016):

Forward pass: Input $x$ is forwarded to obtain activations $\{A^k\}$ and class scores $\{y^c\}$ .
Backward pass: All gradients except $\partial y^c/\partial y^c$ are set to zero. Backpropagate to obtain $\partial y^c/\partial A^k_{ij}$ .
Global average pooling: Compute $\alpha_k^c$ via spatial averaging.
Map computation: Form the coarse map $L^c_{\mathrm{Grad\text{-}CAM}}$ using a weighted combination of $A^k$ and apply ReLU.
Upsampling: Rescale the map to input size for display.
(Optional) Enhancement: Multiply with guided backpropagation for a high-resolution, class-discriminative Guided Grad-CAM.

This algorithm is differentiable and requires only one forward and one backward pass through the CNN. Grad-CAM is model-agnostic and functions with all architectures supporting differentiation through convolutional layers, including networks employing fully connected, recurrent, residual, or multimodal structures (Selvaraju et al., 2016, Selvaraju et al., 2016, Yang et al., 2019).

A variety of extensions and enhancements to Grad-CAM have been proposed to address its resolution, faithfulness, and interpretability:

Grad-CAM++: Incorporates location-sensitive second- and third-order gradient terms ( $\alpha_{ij}^{kc}$ ), but analysis demonstrates these coefficients are close to constant for typical models, making Grad-CAM with positive-gradient pooling sufficient (Lerma et al., 2022).
Smooth Grad-CAM++: Averages higher-order derivatives across multiple Gaussian-perturbed copies of the input, yielding sharper, less noisy maps with improved object coverage, particularly in images with multiple instances (Omeiza et al., 2019).
FD-CAM: Combines gradient-based weights with grouped channel-switching derived from activation-similarity and perturbations to optimize both faithfulness (quantified by insertion/deletion AUC) and discriminability (pointing-game accuracy), outperforming both Grad-CAM and Ablation-CAM in comprehensive benchmarks (Li et al., 2022).
SEG-GRAD-CAM: Extends Grad-CAM to semantic segmentation by locally aggregating gradients for arbitrary pixel sets (instances, classes, or regions), facilitating spatially resolved attribution across networks producing dense output (Vinogradova et al., 2020).
Axiom-based (XGrad-CAM): Formalizes attribution by enforcing sensitivity (the change in output from ablation should match the attribution) and conservation (attributions sum to the original score), leading to an activation-normalized weighting that substantially reduces axiom violation relative to vanilla Grad-CAM (Fu et al., 2020).
Layer Aggregation (Winsor-CAM, Integrative CAM): Overcomes loss of information at deep layers by aggregating class activation maps across all convolutional layers, using layer-importance scoring, winsorization to control outlier weights, and bias correction, resulting in saliency maps with higher spatial fidelity and semantic tunability (Wall et al., 14 Jul 2025, Singh et al., 2 Dec 2024).

A comparison of representative variants is summarized in the following table:

Variant	Channel Weighting	Notable Modification
Grad-CAM	avg. gradient	final conv layer, ReLU postcombination
Grad-CAM++	spatial, higher-order	location-aware $\alpha_{ij}^{kc}$
Smooth Grad-CAM++	smoothed gradients	average derivatives over noisy copies
XGrad-CAM	activation-normalized	weights enforce sensitivity/conservation
FD-CAM	gradient + score-based	grouped channel on/off perturbation
Winsor-CAM	layer-wise, scored/trimmed	multi-layer aggregation, winsorization

4. Practical Applications and Case Studies

Grad-CAM and its derivatives are deployed in a spectrum of domains:

Image classification and captioning: Grad-CAM explanations highlight objects and scene regions supporting class labels or words, aiding debugging, fairness, and model trust (Selvaraju et al., 2016).
Facial attribute classification: User-guided Grad-CAM allows manual ROI specification (e.g., mouth for "Wearing Lipstick"), supporting domain-informed fine-tuning that mitigates co-occurrence bias and improves transfer performance (Yang et al., 2019).
Medical imaging: Grad-CAM maps over MRI slices provide clinician-interpretable explanations for dementia stage prediction, with overlayed maps indicating anatomical region focus corresponding to clinical knowledge (Yuen, 20 Aug 2024).
Meteorological event detection: Hybrid Grad-CAM schemes, integrating deep-layer focusing with shallow-layer precision, yield subpixel object localization in meteor frames, reducing manual postprocessing (Peña-Asensio et al., 2023).
Time-series interpretation: Grad-CAM over temporal CNNs reveals which trajectory segments are crucial for mechanism classification in anomalous diffusion, enabling targeted augmentation for robustness (Bae et al., 21 Oct 2024).
Semantic segmentation: Region-wise Grad-CAM offers pixel/instance-specific explanations for dense prediction models (Vinogradova et al., 2020).
Embedding networks: Aggregated multi-triplet Grad-CAM establishes stable and interpretable region attribution for triplet-loss–based recognition and metric learning tasks (Chen et al., 2020).

5. Quantitative Evaluation, Metrics, and Limitations

Evaluation of Grad-CAM methods relies on several metrics:

Localization (IoU, Dice, Pointing-Game): Measure overlap or accuracy of predicted regions versus ground truth (Wall et al., 14 Jul 2025, Li et al., 2022, Fahim et al., 2022).
Faithfulness (Insertion/Deletion AUC): Quantify probability change as salient/context regions are masked/revealed (Li et al., 2022, Fahim et al., 2022).
Perturbation Sensitivity: Drop in class score upon removal of the most salient pixels (Fu et al., 2020).
Similarity under adversarial attack: MOD/VID metrics (mean observed/variation in dissimilarity) track Grad-CAM attention shift under adversarial perturbations (Chakraborty et al., 2022).

Noted limitations and caveats include: intrinsic resolution bounds due to spatial size of chosen convolutional layers, potential gradient saturation issues, possible misleading focus under adversarial or spurious-correlation conditions (e.g., passive fooling (Piland et al., 10 Jun 2025)), and absence of negative attributions unless specifically extracted.

6. Recent Developments and Ongoing Directions

Recent literature has focused on overcoming both theoretical weaknesses and practical shortcomings of original Grad-CAM:

Robustness to adversarial manipulation: Contrastive formulations such as DiffGradCAM address alignment of explanations with the true softmax decision boundary, resisting entropy-aware passive fooling attacks and stabilizing heatmaps without accuracy tradeoff (Piland et al., 10 Jun 2025).
Multi-layer, bias-corrected fusion: Integrative CAM and Winsor-CAM synthesize activation, gradient, and bias terms across CNN hierarchies to provide comprehensive multi-scale attributions, including semantic-depth tunability (Wall et al., 14 Jul 2025, Singh et al., 2 Dec 2024).
Layer-wise and Riemann-Stieltjes integration: Techniques such as RSI-Grad-CAM numerically integrate gradient fields along activation paths at arbitrary depths, improving numerical stability (especially in saturated or deep layers) and offering sharper, more reliable localization (Lucas et al., 2022).
Axiomatic and perturbative approaches: Explicit satisfaction of faithfulness and conservation axioms, as well as guidance maps that preserve local gradient information, advance the theoretical basis and empirical selectivity of CAM-based visualizations (Fu et al., 2020, Fahim et al., 2022).

These directions collectively enhance the fidelity, discriminability, and interpretability of visual explanations for DNNs, extending Grad-CAM far beyond its original formulation and supporting deployment in critical, high-stakes applications.