MVQN: Multi-Valued Quantum Neurons
- MVQN are advanced neural models that encode multiple quantum states per neuron, enhancing representational capacity and nuanced decision-making.
- They leverage game-theoretic attribution methods, such as Shapley values, to rigorously quantify individual neuron contributions to accuracy, fairness, and robustness.
- MVQN facilitate effective pruning and targeted model repair, leading to improved efficiency, interpretability, and resilience in complex network architectures.
The Neuron Shapley approach is a class of game-theoretic attribution methods designed to quantify the importance of individual neurons, filters, or layers within deep neural networks. By leveraging the Shapley value from cooperative game theory, Neuron Shapley measures the marginal contribution of each network component to various performance metrics—such as accuracy, fairness, or robustness—while fully accounting for interactions and redundancy among components. This methodology supports principled neuron ranking, model interpretability, fairness diagnostics, adversarial vulnerability analysis, and targeted model repair across a range of architectures and tasks (Ghorbani et al., 2020, Stier et al., 2019, Adamczewski et al., 2019, Li et al., 2019, Sun et al., 1 Apr 2026).
1. Formalization of the Neuron Shapley Value
Let denote the set of neurons (or filters) in a neural network. Define as a black-box performance metric (e.g., accuracy, loss, class recall, fairness) measured on the subnetwork retaining only neurons in , with all other neurons zeroed out. The Neuron Shapley value is then:
Alternatively, in permutation form:
where is the set of neurons preceding in permutation of . This framework treats each neuron as a player in a coalitional game, with the characteristic function 0 specifying the payout to each coalition (Ghorbani et al., 2020, Stier et al., 2019, Adamczewski et al., 2019).
This setup extends to layers, filters, or even activation groups, depending on architectural granularity (Adamczewski et al., 2019, Sun et al., 1 Apr 2026).
2. Capturing High-Order Neuron Interactions
The Shapley framework captures all higher-order interactions, including synergy, redundancy, and conditional indispensability. Marginal contributions 1 are aggregated over all subsets 2 of other neurons, thus ensuring sensitivity to contexts where:
- Individual neurons are only valuable jointly (e.g., 3 while 4).
- Redundant or antagonistic behavior is appropriately distributed across interacting units.
- Pruning or reweighting can be done while respecting both unique and overlapping contributions.
Shapley’s axioms (null-player, symmetry, additivity/linearity) uniquely define this “fair” attribution and guarantee consistency in neuron ranking (Ghorbani et al., 2020, Stier et al., 2019, Adamczewski et al., 2019).
3. Efficient Estimation Algorithms
Exact computation of Neuron Shapley values is exponential in 5. Key estimation methods include:
- Monte Carlo Permutation Sampling: Draw 6 random permutations of neurons. For each permutation, accumulate the marginal contribution of each neuron as it is “added” during the forward scan. Averaging over 7 samples gives an unbiased estimate. Typically 8–9 yields adequate accuracy for moderate 0 (Stier et al., 2019, Adamczewski et al., 2019).
- Truncated Multi-Armed Bandit (TMAB): Adaptively samples only the top-k candidate neurons by focusing computation where Shapley values are close to the k-th largest. Early truncates permutation trajectories when performance drops below a threshold 1, reducing the number of forward passes needed. Empirical Bernstein bounds guarantee error control to within 2 with probability 3 (Ghorbani et al., 2020).
- Deep SHAP Approximation: In large models or when tight approximations suffice, Deep SHAP linearizes layers around a background set and propagates attributions, yielding approximate neuron-level or layer-level Shapley values efficiently in a few backward passes (Sun et al., 1 Apr 2026).
Sample Pseudocode: TMAB-Shapley Algorithm (Ghorbani et al., 2020)
4. Empirical Properties and Diagnostic Applications
Model Destruction via Neuron Removal
- Removing the top 30 filters by 4 in Inception-v3 (ImageNet) collapses accuracy from 74% to near-random (1–2%). Removal of random filters leaves performance largely intact. The top 10, 20, and 30 most-important filters drop accuracy to 38%, 8%, and random guessing, respectively (Ghorbani et al., 2020).
- The majority of high-5 filters for accuracy reside in early or mid layers, matching the intuition that early features are reused across classes (Ghorbani et al., 2020).
Interpretability and Visualization
- Visualization (e.g., DeepDream, activation maximization) of highest-6 filters highlights patterns such as color bands, edges, or class-specific motifs (e.g., zebra stripes, bar shapes) (Ghorbani et al., 2020, Adamczewski et al., 2019).
- Shapley-guided feature selection aligns closely with variational Bayes-based “importance-switch” approaches; both identify overlapping sets of critical filters (Adamczewski et al., 2019).
Targeted Model Repair
- In facial gender classification (SqueezeNet on CelebA/PPB), removing 50–100 most negative-7 filters—those harming fairness—raises Black Female accuracy from 54.7% to 81.9%, with negligible drop in overall test accuracy (Ghorbani et al., 2020).
- For adversarial robustness (Inception-v3 under PGD 8 attacks), zeroing top 16 highly exploited filters reduces white-box attack success from ≈100% to 0.1%, with only moderate clean accuracy loss (Ghorbani et al., 2020).
- SHARPEN applies Deep SHAP-based coarse-to-fine ranking and returns a list of “suspicious” neurons prioritized for repair by derivative-free algorithms such as CMA-ES, enabling targeted, architecture-agnostic repair of vulnerabilities or biases (Sun et al., 1 Apr 2026).
Compression and Pruning
- Shapley-driven pruning drastically reduces parameter count with minimal accuracy loss. On MNIST, pruning by cumulative Shapley mass (SVbucket) achieves >90% accuracy with 4–8× fewer neurons compared to random or weight-norm heuristics (Stier et al., 2019).
- On LeNet-5 and VGG-16, removing low-SV filters and retraining achieves error rates comparable to original models, confirming SV’s utility for structured pruning (Adamczewski et al., 2019).
5. Extensions, Variations, and Theoretical Properties
- Analytical Approximations: For ReLU activations, closed-form Gaussian approximations of input Shapley value yield fast, layer-wise relevance propagation and a smooth, nonvanishing Shapley gradient. The “Shapley Activation” (SA) implements this differentiation for improved optimization stability and convergence (Li et al., 2019).
- Layer/Region Attribution: The methodology generalizes to layers, spatial regions, or composite groups—relevant for hierarchical localization or structured pruning (Sun et al., 1 Apr 2026).
- Efficient Repair Pipeline: SHARPEN demonstrates a hierarchical attribution approach integrating Deep SHAP for localization and (optionally) CMA-ES for parameter repair, applicable to backdoors, adversarial vulnerability, and unfairness across architectures (Sun et al., 1 Apr 2026).
- Interpretability Guarantees: By construction, the Shapley value uniquely satisfies fairness axioms. While the characteristic function need not be monotonic (in which case “efficiency” is not strict), empirical rankings remain robust and unbiased under Monte Carlo approximation (Stier et al., 2019, Adamczewski et al., 2019).
Table: Key Neuron Shapley Method Variations
| Variant | Core Mechanism | Application Focus |
|---|---|---|
| Standard Neuron Shapley | Permutation Monte Carlo, TMAB | Importance, pruning, fairness, adversarial robustness (Ghorbani et al., 2020) |
| Deep SHAP Approximation | Linearization, SHAP multipliers | Fault localization, scalable repair (Sun et al., 1 Apr 2026) |
| Analytical Shapley | Gaussian approx., closed-form LRP | ReLU training/interpretation (Li et al., 2019) |
6. Limitations and Considerations
- Shapley-based methods assume the performance metric can be efficiently evaluated on subnetworks, necessitating massive computation for large 9 if not for sampling-based or approximate methods (Ghorbani et al., 2020, Adamczewski et al., 2019).
- Deep SHAP and Gaussian approximations may degrade when input correlations are strong, or in networks architectures not amenable to linearization (Li et al., 2019, Sun et al., 1 Apr 2026).
- In practice, higher-order interactions beyond 0-sized coalitions are ignored in some approximations, though empirical results show robust performance for moderate 1 (Adamczewski et al., 2019).
- Choice of performance metric 2 is critical; non-monotonic or ill-conditioned 3 can yield negative or otherwise pathological attributions, necessitating careful interpretation (Ghorbani et al., 2020).
7. Theoretical and Practical Significance
Neuron Shapley extends the formal machinery of cooperative game theory to network analysis, providing a principled, axiomatically grounded tool for neuron ranking, structured pruning, bias and vulnerability remediation, and interpretable neural diagnostics. Empirically, a small set of neurons or filters dominate performance in large networks; their identification and neutralization (via zeroing or repair) yield substantial gains in interpretability, fairness, robustness, and model size reduction—all without requiring retraining or model redesign. The core methods—permutation sampling, multi-armed bandit acceleration, Deep SHAP approximations, and activation-level analytical proxies—cover a range of practical requirements from full attribution to real-time repair (Ghorbani et al., 2020, Sun et al., 1 Apr 2026, Stier et al., 2019, Adamczewski et al., 2019, Li et al., 2019).