Papers
Topics
Authors
Recent
Search
2000 character limit reached

MVQN: Multi-Valued Quantum Neurons

Updated 16 May 2026
  • MVQN are advanced neural models that encode multiple quantum states per neuron, enhancing representational capacity and nuanced decision-making.
  • They leverage game-theoretic attribution methods, such as Shapley values, to rigorously quantify individual neuron contributions to accuracy, fairness, and robustness.
  • MVQN facilitate effective pruning and targeted model repair, leading to improved efficiency, interpretability, and resilience in complex network architectures.

The Neuron Shapley approach is a class of game-theoretic attribution methods designed to quantify the importance of individual neurons, filters, or layers within deep neural networks. By leveraging the Shapley value from cooperative game theory, Neuron Shapley measures the marginal contribution of each network component to various performance metrics—such as accuracy, fairness, or robustness—while fully accounting for interactions and redundancy among components. This methodology supports principled neuron ranking, model interpretability, fairness diagnostics, adversarial vulnerability analysis, and targeted model repair across a range of architectures and tasks (Ghorbani et al., 2020, Stier et al., 2019, Adamczewski et al., 2019, Li et al., 2019, Sun et al., 1 Apr 2026).

1. Formalization of the Neuron Shapley Value

Let N={1,2,,n}N = \{1, 2, \ldots, n\} denote the set of neurons (or filters) in a neural network. Define V(S)V(S) as a black-box performance metric (e.g., accuracy, loss, class recall, fairness) measured on the subnetwork retaining only neurons in SNS \subseteq N, with all other neurons zeroed out. The Neuron Shapley value ϕi(V,N)\phi_i(V, N) is then:

ϕi(V,N)=1NSN{i}V(S{i})V(S)(N1S)\phi_i(V, N) = \frac{1}{|N|} \sum_{S \subseteq N \setminus \{i\}} \frac{V(S \cup \{i\}) - V(S)}{\binom{|N|-1}{|S|}}

Alternatively, in permutation form:

ϕi=Eπ[V(Sπi{i})V(Sπi)]\phi_i = \mathbb{E}_{\pi}\Big[V\left(S^i_{\pi} \cup \{i\}\right) - V(S^i_{\pi})\Big]

where SπiS^i_\pi is the set of neurons preceding ii in permutation π\pi of NN. This framework treats each neuron as a player in a coalitional game, with the characteristic function V(S)V(S)0 specifying the payout to each coalition (Ghorbani et al., 2020, Stier et al., 2019, Adamczewski et al., 2019).

This setup extends to layers, filters, or even activation groups, depending on architectural granularity (Adamczewski et al., 2019, Sun et al., 1 Apr 2026).

2. Capturing High-Order Neuron Interactions

The Shapley framework captures all higher-order interactions, including synergy, redundancy, and conditional indispensability. Marginal contributions V(S)V(S)1 are aggregated over all subsets V(S)V(S)2 of other neurons, thus ensuring sensitivity to contexts where:

  • Individual neurons are only valuable jointly (e.g., V(S)V(S)3 while V(S)V(S)4).
  • Redundant or antagonistic behavior is appropriately distributed across interacting units.
  • Pruning or reweighting can be done while respecting both unique and overlapping contributions.

Shapley’s axioms (null-player, symmetry, additivity/linearity) uniquely define this “fair” attribution and guarantee consistency in neuron ranking (Ghorbani et al., 2020, Stier et al., 2019, Adamczewski et al., 2019).

3. Efficient Estimation Algorithms

Exact computation of Neuron Shapley values is exponential in V(S)V(S)5. Key estimation methods include:

  • Monte Carlo Permutation Sampling: Draw V(S)V(S)6 random permutations of neurons. For each permutation, accumulate the marginal contribution of each neuron as it is “added” during the forward scan. Averaging over V(S)V(S)7 samples gives an unbiased estimate. Typically V(S)V(S)8–V(S)V(S)9 yields adequate accuracy for moderate SNS \subseteq N0 (Stier et al., 2019, Adamczewski et al., 2019).
  • Truncated Multi-Armed Bandit (TMAB): Adaptively samples only the top-k candidate neurons by focusing computation where Shapley values are close to the k-th largest. Early truncates permutation trajectories when performance drops below a threshold SNS \subseteq N1, reducing the number of forward passes needed. Empirical Bernstein bounds guarantee error control to within SNS \subseteq N2 with probability SNS \subseteq N3 (Ghorbani et al., 2020).
  • Deep SHAP Approximation: In large models or when tight approximations suffice, Deep SHAP linearizes layers around a background set and propagates attributions, yielding approximate neuron-level or layer-level Shapley values efficiently in a few backward passes (Sun et al., 1 Apr 2026).

ϕi(V,N)\phi_i(V, N)4 (Ghorbani et al., 2020)

4. Empirical Properties and Diagnostic Applications

Model Destruction via Neuron Removal

  • Removing the top 30 filters by SNS \subseteq N4 in Inception-v3 (ImageNet) collapses accuracy from 74% to near-random (1–2%). Removal of random filters leaves performance largely intact. The top 10, 20, and 30 most-important filters drop accuracy to 38%, 8%, and random guessing, respectively (Ghorbani et al., 2020).
  • The majority of high-SNS \subseteq N5 filters for accuracy reside in early or mid layers, matching the intuition that early features are reused across classes (Ghorbani et al., 2020).

Interpretability and Visualization

  • Visualization (e.g., DeepDream, activation maximization) of highest-SNS \subseteq N6 filters highlights patterns such as color bands, edges, or class-specific motifs (e.g., zebra stripes, bar shapes) (Ghorbani et al., 2020, Adamczewski et al., 2019).
  • Shapley-guided feature selection aligns closely with variational Bayes-based “importance-switch” approaches; both identify overlapping sets of critical filters (Adamczewski et al., 2019).

Targeted Model Repair

  • In facial gender classification (SqueezeNet on CelebA/PPB), removing 50–100 most negative-SNS \subseteq N7 filters—those harming fairness—raises Black Female accuracy from 54.7% to 81.9%, with negligible drop in overall test accuracy (Ghorbani et al., 2020).
  • For adversarial robustness (Inception-v3 under PGD SNS \subseteq N8 attacks), zeroing top 16 highly exploited filters reduces white-box attack success from ≈100% to 0.1%, with only moderate clean accuracy loss (Ghorbani et al., 2020).
  • SHARPEN applies Deep SHAP-based coarse-to-fine ranking and returns a list of “suspicious” neurons prioritized for repair by derivative-free algorithms such as CMA-ES, enabling targeted, architecture-agnostic repair of vulnerabilities or biases (Sun et al., 1 Apr 2026).

Compression and Pruning

  • Shapley-driven pruning drastically reduces parameter count with minimal accuracy loss. On MNIST, pruning by cumulative Shapley mass (SVbucket) achieves >90% accuracy with 4–8× fewer neurons compared to random or weight-norm heuristics (Stier et al., 2019).
  • On LeNet-5 and VGG-16, removing low-SV filters and retraining achieves error rates comparable to original models, confirming SV’s utility for structured pruning (Adamczewski et al., 2019).

5. Extensions, Variations, and Theoretical Properties

  • Analytical Approximations: For ReLU activations, closed-form Gaussian approximations of input Shapley value yield fast, layer-wise relevance propagation and a smooth, nonvanishing Shapley gradient. The “Shapley Activation” (SA) implements this differentiation for improved optimization stability and convergence (Li et al., 2019).
  • Layer/Region Attribution: The methodology generalizes to layers, spatial regions, or composite groups—relevant for hierarchical localization or structured pruning (Sun et al., 1 Apr 2026).
  • Efficient Repair Pipeline: SHARPEN demonstrates a hierarchical attribution approach integrating Deep SHAP for localization and (optionally) CMA-ES for parameter repair, applicable to backdoors, adversarial vulnerability, and unfairness across architectures (Sun et al., 1 Apr 2026).
  • Interpretability Guarantees: By construction, the Shapley value uniquely satisfies fairness axioms. While the characteristic function need not be monotonic (in which case “efficiency” is not strict), empirical rankings remain robust and unbiased under Monte Carlo approximation (Stier et al., 2019, Adamczewski et al., 2019).

Table: Key Neuron Shapley Method Variations

Variant Core Mechanism Application Focus
Standard Neuron Shapley Permutation Monte Carlo, TMAB Importance, pruning, fairness, adversarial robustness (Ghorbani et al., 2020)
Deep SHAP Approximation Linearization, SHAP multipliers Fault localization, scalable repair (Sun et al., 1 Apr 2026)
Analytical Shapley Gaussian approx., closed-form LRP ReLU training/interpretation (Li et al., 2019)

6. Limitations and Considerations

  • Shapley-based methods assume the performance metric can be efficiently evaluated on subnetworks, necessitating massive computation for large SNS \subseteq N9 if not for sampling-based or approximate methods (Ghorbani et al., 2020, Adamczewski et al., 2019).
  • Deep SHAP and Gaussian approximations may degrade when input correlations are strong, or in networks architectures not amenable to linearization (Li et al., 2019, Sun et al., 1 Apr 2026).
  • In practice, higher-order interactions beyond ϕi(V,N)\phi_i(V, N)0-sized coalitions are ignored in some approximations, though empirical results show robust performance for moderate ϕi(V,N)\phi_i(V, N)1 (Adamczewski et al., 2019).
  • Choice of performance metric ϕi(V,N)\phi_i(V, N)2 is critical; non-monotonic or ill-conditioned ϕi(V,N)\phi_i(V, N)3 can yield negative or otherwise pathological attributions, necessitating careful interpretation (Ghorbani et al., 2020).

7. Theoretical and Practical Significance

Neuron Shapley extends the formal machinery of cooperative game theory to network analysis, providing a principled, axiomatically grounded tool for neuron ranking, structured pruning, bias and vulnerability remediation, and interpretable neural diagnostics. Empirically, a small set of neurons or filters dominate performance in large networks; their identification and neutralization (via zeroing or repair) yield substantial gains in interpretability, fairness, robustness, and model size reduction—all without requiring retraining or model redesign. The core methods—permutation sampling, multi-armed bandit acceleration, Deep SHAP approximations, and activation-level analytical proxies—cover a range of practical requirements from full attribution to real-time repair (Ghorbani et al., 2020, Sun et al., 1 Apr 2026, Stier et al., 2019, Adamczewski et al., 2019, Li et al., 2019).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Multi-Valued Quantum Neurons (MVQN).