Approximating the Shapley Value without Marginal Contributions

Published 1 Feb 2023 in cs.LG and cs.GT | (2302.00736v5)

Abstract: The Shapley value, which is arguably the most popular approach for assigning a meaningful contribution value to players in a cooperative game, has recently been used intensively in explainable artificial intelligence. Its meaningfulness is due to axiomatic properties that only the Shapley value satisfies, which, however, comes at the expense of an exact computation growing exponentially with the number of agents. Accordingly, a number of works are devoted to the efficient approximation of the Shapley value, most of them revolve around the notion of an agent's marginal contribution. In this paper, we propose with SVARM and Stratified SVARM two parameter-free and domain-independent approximation algorithms based on a representation of the Shapley value detached from the notion of marginal contribution. We prove unmatched theoretical guarantees regarding their approximation quality and provide empirical results including synthetic games as well as common explainability use cases comparing ourselves with state-of-the-art methods.

Abstract PDF Upgrade to Chat

Citations (16)

View on Semantic Scholar

Summary

The paper introduces novel SVARM and Stratified SVARM algorithms to approximate Shapley values without costlier marginal contributions.
The methods achieve improved computational efficiency with theoretical variance bounds and fixed space complexity.
Empirical results show competitive performance in both synthetic games and real-world explainability tasks, outperforming traditional approaches.

Approximating the Shapley Value without Marginal Contributions

Introduction

The Shapley value provides a theoretically strong solution for determining the contribution of individual players to the overall benefit in cooperative games. However, the computational complexity of calculating the exact Shapley value renders it impractical for large-scale applications, especially in scenarios where function evaluations are costly, such as assessing feature importance in machine learning models. Traditional approximation methods rely heavily on an agent's marginal contributions, leading to inefficiencies when the number of players increases.

This paper introduces two novel algorithms, SVARM and Stratified SVARM, to efficiently approximate the Shapley value without relying on marginal contributions. These algorithms provide unmatched theoretical guarantees concerning approximation quality, featuring properties such as being unbiased, parameter-free, and domain-independent.

Methodology

SVARM

SVARM, or Shapley Value Approximation without Requesting Marginals, leverages a reformulated expression of the Shapley value that separates the positive and negative contributions, $\phi_i^+$ and $\phi_i^-$ . Using two separate sampling processes $P^+$ and $P^-$ , SVARM directly samples coalition values, enabling updates of multiple Shapley value estimates concurrently. This approach greatly enhances computational efficiency compared to traditional marginal contribution methods.

Figure 1: Illustration of SVARM's sampling process and update rule: Each player i has two urns for sampling.

The pseudocode for SVARM involves initializing estimates and sample counts for players, performing a warm-up phase, and iterating through updates by sampling coalitions and updating all affected Shapley values. The algorithm proves computationally efficient with theoretical bounds on variance being constrained by $\mathcal{O}(\frac{\log n}{T - n})$ , indicating improved precision with increased budget allocation.

Stratified SVARM

Stratified SVARM enhances SVARM by incorporating a stratified sampling approach. It divides the coalition population into homogeneous strata based on their size, allowing the variance within each stratum to be reduced. This significantly accelerates the convergence of estimates by leveraging the maximum sample reuse principle, where the estimates for all players can be updated with a single coalition sample.

Figure 2: Illustration of Stratified SVARM's sampling process and update rule.

Theoretical guarantees show Stratified SVARM's variance is bounded by $\mathcal{O}\big(\frac{\log n}{T - n\log n}\big)$ . This method is incremental and operates with a fixed computational space complexity of $\mathcal{O}(n^2)$ .

Figure 3: Averaged MSE and standard errors over 100 repetitions in dependence of fixed budget T: (1) Airport game, (2) Shoe game, (3) SOUG game, (4) NLP sentiment analysis, (5) Image classifier, (6) Adult classification.

Empirical Results

The empirical assessment of SVARM and Stratified SVARM algorithms has been conducted using both synthetic and real-world explainability scenarios. The evaluation emphasized the accuracy of the Shapley value approximations under fixed computational budget constraints.

Synthetic Games

For synthetic games such as the Airport, Shoe, and SOUG games, both SVARM and Stratified SVARM were compared against established methodologies. The evaluations demonstrated that Stratified SVARM consistently outperformed its competitors, offering superior theoretical guarantees without necessitating additional constraints or domain-specific knowledge. This was observed across a broad range of player numbers, with Stratified SVARM achieving a logarithmic dependence on $n$ , a notable improvement over traditional approaches like ApproShapley.

Explainability Games

In real-world explainability scenarios including NLP sentiment analysis, image classification, and adult income classification, the performance of Stratified SVARM was robust and comparable to the state-of-the-art KernelSHAP. Stratified SVARM $^+$ , which samples without replacement for better empirical results, demonstrated notable competitive accuracy, achieving similar if not superior results across various test scenarios.

Conclusion

This study introduces the SVARM and Stratified SVARM algorithms as novel methods for approximating the Shapley value without the need for marginal contributions. By directly sampling coalition values and applying a dual distribution approach, these algorithms significantly improve computational efficiency while providing theoretical performance guarantees. Stratified SVARM further enhances precision through a refined stratified approach, achieving remarkable empirical results, particularly in synthetic games. These algorithms offer practical and effective solutions for Shapley value approximation in complex cooperative scenarios. Future research may focus on extending these methods to larger player scenarios and exploring structural impacts on approximation quality.

Markdown