Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 154 tok/s
Gemini 2.5 Pro 37 tok/s Pro
GPT-5 Medium 21 tok/s Pro
GPT-5 High 23 tok/s Pro
GPT-4o 96 tok/s Pro
Kimi K2 169 tok/s Pro
GPT OSS 120B 347 tok/s Pro
Claude Sonnet 4.5 36 tok/s Pro
2000 character limit reached

Shapley Value Approximation Based on k-Additive Games (2502.04763v1)

Published 7 Feb 2025 in cs.GT and cs.LG

Abstract: The Shapley value is the prevalent solution for fair division problems in which a payout is to be divided among multiple agents. By adopting a game-theoretic view, the idea of fair division and the Shapley value can also be used in machine learning to quantify the individual contribution of features or data points to the performance of a predictive model. Despite its popularity and axiomatic justification, the Shapley value suffers from a computational complexity that scales exponentially with the number of entities involved, and hence requires approximation methods for its reliable estimation. We propose SVA$k_{\text{ADD}}$, a novel approximation method that fits a $k$-additive surrogate game. By taking advantage of $k$-additivity, we are able to elicit the exact Shapley values of the surrogate game and then use these values as estimates for the original fair division problem. The efficacy of our method is evaluated empirically and compared to competing methods.

Summary

  • The paper presents a novel algorithm (SVA‑k_ADD) that approximates Shapley values using k‑additive surrogate games to reduce computational complexity.
  • It employs a weighted least squares approach with optimal coalition sampling to guarantee exact recovery for k=1,2,3 under full sampling.
  • Empirical results demonstrate that SVA‑k_ADD outperforms or matches state‑of‑the‑art methods in feature attribution tasks across diverse datasets.

Shapley Value Approximation via k-Additive Games: Theory, Algorithms, and Empirical Analysis

Introduction and Motivation

The Shapley value is a canonical solution concept in cooperative game theory, uniquely satisfying axioms of fairness for distributing the value generated by a coalition among its members. Its adoption in machine learning, particularly for feature attribution and model explainability, has been extensive due to its axiomatic justification and interpretability. However, the exponential computational complexity of exact Shapley value calculation (O(2n)O(2^n) for nn players/features) renders it infeasible for practical use in high-dimensional settings, necessitating efficient approximation algorithms.

This paper introduces SVAkADDk_{\text{ADD}}, a novel Shapley value approximation method leveraging the structure of kk-additive games. By fitting a kk-additive surrogate to sampled coalition values, the method enables polynomial-time estimation of Shapley values, with theoretical guarantees for exact recovery under full sampling and empirical evidence of competitive performance against state-of-the-art baselines. Figure 1

Figure 1: The SVAkADDk_{\text{ADD}} pipeline: sampled coalition values ν(A1),,ν(AT)\nu(A_1),\ldots,\nu(A_T) are used to fit a kk-additive surrogate game (N,νk)(N,\nu_k), from which Shapley values ϕ1k,,ϕnk\phi_1^k,\ldots,\phi_n^k are computed directly and serve as estimates for the original game.

Theoretical Foundations: k-Additivity and Shapley Interactions

A cooperative game (N,ν)(N, \nu) is defined by a set of players NN and a value function ν:P(N)R\nu: \mathcal{P}(N) \to \mathbb{R}. The Shapley value for player ii is given by:

ϕi=AN{i}(nA1)!A!n![ν(A{i})ν(A)]\phi_i = \sum_{A \subseteq N \setminus \{i\}} \frac{(n - |A| - 1)! |A|!}{n!} [\nu(A \cup \{i\}) - \nu(A)]

The concept of kk-additivity restricts the game such that all interaction indices I(S)I(S) for S>k|S| > k vanish. This reduces the number of parameters from 2n2^n to O(nk)O(n^k), enabling tractable surrogate modeling. The surrogate game (N,νk)(N, \nu_k) is parameterized by interaction indices up to order kk, and the Shapley value for each player is directly given by the singleton interaction Ik({i})I^k(\{i\}).

The paper proves that, with a specific choice of coalition sampling weights wA=(n2A1)1w_A^* = \binom{n-2}{|A|-1}^{-1}, the solution to the kk-additive weighted least squares problem yields the exact Shapley values for k=1,2,3k=1,2,3 when all coalitions are observed. This result holds regardless of the true game's additivity, establishing the theoretical soundness of the approach.

SVAkADDk_{\text{ADD}}: Algorithmic Details

The SVAkADDk_{\text{ADD}} algorithm proceeds as follows:

  1. Sampling: Draw TT coalitions (including \emptyset and NN) according to the derived optimal weights.
  2. Surrogate Fitting: Solve a constrained weighted least squares problem to fit the kk-additive surrogate νk\nu_k to the sampled coalition values.
  3. Shapley Value Extraction: Compute the Shapley values ϕk\phi^k from the fitted surrogate, which serve as estimates for the original game.

The optimization problem is:

minIkAM{,N}wA(ν(A)νk(A))2\min_{I^k} \sum_{A \in \mathcal{M} \setminus \{\emptyset, N\}} w_A \left( \nu(A) - \nu_k(A) \right)^2

subject to the efficiency constraint ν(N)ν()=νk(N)νk()\nu(N) - \nu(\emptyset) = \nu_k(N) - \nu_k(\emptyset). The problem is solved analytically or via standard linear algebra routines, as detailed in the appendix.

Empirical Evaluation: Approximation Quality and Trade-offs

The empirical analysis spans global, local, and unsupervised feature importance tasks across diverse datasets (Diabetes, Titanic, Wine, Adult, ImageNet, IMDB, Breast Cancer, Big Five, FIFA). The main evaluation metric is mean squared error (MSE) between estimated and true Shapley values, computed exhaustively for tractable nn. Figure 2

Figure 2

Figure 2

Figure 2

Figure 2

Figure 2

Figure 2

Figure 2

Figure 2

Figure 2: MSE of SVAkADDk_{\text{ADD}} for varying kk and sample budget TT across datasets; higher kk yields faster convergence but requires more samples for identifiability.

Key findings:

  • Additivity Degree (kk) Trade-off: Low kk (e.g., k=1k=1) leads to slow convergence due to insufficient model flexibility. Increasing kk accelerates convergence but raises sample requirements. Empirically, k=3k=3 offers a favorable balance for most datasets.
  • Comparison to Baselines: SVAkADDk_{\text{ADD}} (with k=3k=3) matches or outperforms stratified SVARM and KernelSHAP in most settings, especially for local attribution tasks. For very small sample budgets, stratified SVARM may be superior, but SVAkADDk_{\text{ADD}} overtakes as TT increases. Figure 3

Figure 3

Figure 3

Figure 3

Figure 3

Figure 3

Figure 3

Figure 3

Figure 3

Figure 3: Comparative MSE of SVAkADDk_{\text{ADD}} and competing methods (Stratified SVARM, KernelSHAP, Permutation sampling) as a function of sample budget TT.

  • Domain Independence: The method is model-agnostic and applicable to any cooperative game, including unsupervised settings where the value function is based on total correlation.

Implementation Considerations

  • Computational Complexity: For fixed kk, the surrogate fitting scales polynomially in nn and linearly in the number of samples TT. The main bottleneck is coalition value evaluation, especially when retraining models for each coalition.
  • Sample Budget: The minimum required TT grows with kk and nn due to identifiability constraints. For k=3k=3, T=O(n3)T = O(n^3) suffices.
  • Deployment: SVAkADDk_{\text{ADD}} is suitable for settings where coalition evaluation is expensive and sample efficiency is critical. It is particularly advantageous when higher-order interactions are negligible or when interpretability of interaction effects is desired.

Limitations and Future Directions

  • Scalability: While polynomial in nn for fixed kk, the approach becomes impractical for large nn and high kk due to the combinatorial growth of interaction terms.
  • Approximation Guarantees: No theoretical bounds are provided for the approximation error under partial sampling; empirical results suggest strong performance, but formal analysis remains open.
  • Interaction Effects: The method estimates interaction indices up to order kk, enabling analysis of feature redundancy and complementarity, which is valuable for feature engineering and scientific discovery.

Conclusion

SVAkADDk_{\text{ADD}} provides a theoretically justified and empirically validated approach for Shapley value approximation via kk-additive surrogate games. It offers a flexible trade-off between model expressiveness and sample efficiency, competitive performance against leading baselines, and broad applicability across explanation tasks. Future work may extend theoretical analysis to arbitrary kk, explore adaptive sampling strategies, and leverage estimated interaction indices for deeper interpretability.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.