KernelSHAP-IQ: Weighted Least-Square Optimization for Shapley Interactions (2405.10852v2)
Abstract: The Shapley value (SV) is a prevalent approach of allocating credit to ML entities to understand black box ML models. Enriching such interpretations with higher-order interactions is inevitable for complex systems, where the Shapley Interaction Index (SII) is a direct axiomatic extension of the SV. While it is well-known that the SV yields an optimal approximation of any game via a weighted least square (WLS) objective, an extension of this result to SII has been a long-standing open problem, which even led to the proposal of an alternative index. In this work, we characterize higher-order SII as a solution to a WLS problem, which constructs an optimal approximation via SII and $k$-Shapley values ($k$-SII). We prove this representation for the SV and pairwise SII and give empirically validated conjectures for higher orders. As a result, we propose KernelSHAP-IQ, a direct extension of KernelSHAP for SII, and demonstrate state-of-the-art performance for feature interactions.
- Slic superpixels compared to state-of-the-art superpixel methods. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(11):2274–2282, 2012. doi: 10.1109/TPAMI.2012.120.
- Banzhaf III, J. F. Weighted voting doesn’t work: A mathematical analysis. Rutgers Law Review, 19:317, 1964.
- From Shapley Values to Generalized Additive Models and back. In International Conference on Artificial Intelligence and Statistics (AISTATS 2023), volume 206 of Proceedings of Machine Learning Research, pp. 709–745. PMLR, 2023.
- Polynomial calculation of the Shapley value based on sampling. Computers & Operations Research, 36(5):1726–1730, 2009. doi: 10.1016/j.cor.2008.04.004.
- Extremal Principle Solutions of Games in Characteristic Function Form: Core, Chebychev and Shapley Value Generalizations, volume 11, pp. 123–133. Springer Netherlands, 1988. doi: 10.1007/978-94-009-3677-5˙7.
- Algorithms to estimate Shapley value feature attributions. Nature Machine Intelligence, 5:590–601, 2023. doi: 10.1038/s42256-023-00657-x.
- Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD 2016), pp. 785–794. ACM, 2016. doi: 10.1145/2939672.2939785.
- Improving KernelSHAP: Practical Shapley Value Estimation Using Linear Regression. In The 24th International Conference on Artificial Intelligence and Statistics, (AISTATS 2021), volume 130 of Proceedings of Machine Learning Research, pp. 3457–3465. PMLR, 2021.
- Explaining by Removing: A Unified Framework for Model Explanation. Journal of Machine Learning Research, 22(209):1–90, 2021. doi: 10.5555/3546258.3546467.
- Unifying Fourteen Post-hoc Attribution Methods with Taylor Interactions. IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1–17, 2024. doi: 10.1109/TPAMI.2024.3358410.
- Imagenet: A large-scale hierarchical image database. In 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, (CVPR 2009), pp. 248–255. IEEE Computer Society, 2009. doi: 10.1109/CVPR.2009.5206848.
- On the complexity of cooperative solution concepts. Mathematics of Operations Research, 19(2):257–266, 1994. doi: 10.1287/moor.19.2.257.
- Interaction transform of set functions over a finite set. Information Sciences, 121(1–2):149–170, 1999. doi: 10.1016/S0020-0255(99)00099-7.
- Event Labeling Combining Ensemble Detectors and Background Knowledge. Progress in Artificial Intelligence, 2(2):113–127, 2014. doi: 10.1007/s13748-013-0040-3.
- OpenML-Python: an extensible Python API for OpenML. CoRR, abs/1911.02490, 2020.
- Axiomatic characterizations of probabilistic and cardinal-probabilistic interaction indices. Games and Economic Behavior, 55(1):72–99, 2006. doi: 10.1016/j.geb.2005.03.002.
- SHAP-IQ: Unified approximation of any-order shapley interactions. In Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023), 2023.
- Data shapley: Equitable valuation of data for machine learning. In Proceedings of the 36th International Conference on Machine Learning, (ICML 2019), volume 97 of Proceedings of Machine Learning Research, pp. 2242–2251. PMLR, 2019.
- Grabisch, M. k-order additive discrete fuzzy measures and their representation. Fuzzy Sets and Systems, 92(2):167–189, 1997. doi: 10.1016/S0165-0114(97)00168-1.
- Grabisch, M. Set Functions, Games and Capacities in Decision Making, volume 46. Springer International Publishing Switzerland, 2016. ISBN 978-3-319-30690-2. doi: 10.1007/978-3-319-30690-2.
- An axiomatic approach to the concept of interaction among players in cooperative games. International Journal of Game Theory, 28(4):547–565, 1999. doi: 10.1007/s001820050125.
- Equivalent representations of set functions. Mathematics of Operations Research, 25(2):157–178, 2000. doi: 10.1287/moor.25.2.157.12225.
- Approximations of pseudo-boolean functions; applications to game theory. ZOR Mathematical Methods of Operations Research, 36(1):3–21, 1992. doi: 10.1007/BF01541028.
- Joint shapley values: a measure of joint feature importance. In The Tenth International Conference on Learning Representations, (ICLR 2022). OpenReview.net, 2022.
- Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, (CVPR 2016), pp. 770–778. IEEE Computer Society, 2016. doi: 10.1109/CVPR.2016.90.
- Decomposing Global Feature Effects Based on Feature Interactions. CoRR, abs/2306.00541, 2023.
- Statistical aspects of shap: Functional anova for model interpretation. CoRR, abs/2208.09970, 2022.
- Unifying local and global model explanations by functional decomposition of low dimensional structures. In International Conference on Artificial Intelligence and Statistics (AISTATS 2023), volume 206 of Proceedings of Machine Learning Research, pp. 7040–7060. PMLR, 2023.
- Hooker, G. Discovering additive structure in black box functions. In Kim, W., Kohavi, R., Gehrke, J., and DuMouchel, W. (eds.), Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD 2004), pp. 575–580. ACM, 2004. doi: 10.1145/1014052.1014122.
- Hooker, G. Generalized Functional ANOVA Diagnostics for High-Dimensional Functions of Dependent Variables. Journal of Computational and Graphical Statistics, 16(3):709–732, 2007. doi: 10.1198/106186007X237892.
- Explaining explanations: Axiomatic feature interactions for deep networks. Journal of Machine Learning Research, 22(104):1–54, 2021.
- FastSHAP: Real-Time Shapley Value Estimation. In The Tenth International Conference on Learning Representations (ICLR 2022). OpenReview.net, 2022.
- Sparse spatial autoregressions. Statistics & Probability Letters, 33(3):291–297, 1997. doi: https://doi.org/10.1016/S0167-7152(96)00140-X.
- Kohavi, R. Scaling up the accuracy of naive-bayes classifiers: A decision-tree hybrid. In Proceedings of International Conference on Knowledge Discovery and Data Mining (KDD 1996), pp. 202–207, 1996.
- Approximating the Shapley Value without Marginal Contributions. CoRR, abs/2302.00736, 2023.
- SVARM-IQ: Efficient Approximation of Any-order Shapley Interactions through Stratification. CoRR, abs/2401.13371, 2024.
- Shapley Residuals: Quantifying the limits of the Shapley value for explanations. In Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021 NeurIPS 2021, pp. 26598–26608, 2021.
- Problems with Shapley-value-based explanations as feature importance measures. In Proceedings of the 37th International Conference on Machine Learning (ICML 2020), volume 119 of Proceedings of Machine Learning Research, pp. 5491–5500. PMLR, 2020.
- Purifying Interaction Effects with the Functional ANOVA: An Efficient Algorithm for Recovering Identifiable Additive Models. In The 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020), volume 108 of Proceedings of Machine Learning Research, pp. 2402–2412. PMLR, 2020.
- Datasets: A community library for natural language processing. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, (EMNLP 2021), pp. 175–184. Association for Computational Linguistics, 2021.
- A Unified Approach to Interpreting Model Predictions. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, NeurIPS 2017, pp. 4765–4774, 2017.
- From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence, 2(1):56–67, 2020. doi: 10.1038/s42256-019-0138-9.
- Learning word vectors for sentiment analysis. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, (HLT 2011), pp. 142–150. Association for Computational Linguistics, 2011.
- The Chaining Interaction Index among Players in Cooperative Games, pp. 69–85. Springer Netherlands, 1999. doi: 10.1007/978-94-017-0647-6˙5.
- Beyond word importance: Contextual decomposition to extract interactions from lstms. In 6th International Conference on Learning Representations, (ICLR 2018), 2018.
- Beyond TreeSHAP: Efficient Computation of Any-Order Shapley Interactions for Tree Ensembles. CoRR, abs/2401.12069, 2024.
- Pytorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, (NeurIPS 2019), pp. 8024–8035. Curran Associates, Inc., 2019.
- Scikit-learn: Machine learning in python. Journal of Machine Learning Research, 12:2825–2830, 2011. doi: 10.5555/1953048.2078195.
- A k-additive choquet integral-based approach to approximate the SHAP values for local interpretability in machine learning. Artificial Intelligence, 325:104014, 2023. doi: 10.1016/J.ARTINT.2023.104014.
- The family of least square values for transferable utility games. Games and Economic Behavior, 24(1):109–130, 1998. doi: https://doi.org/10.1006/game.1997.0622.
- Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter. CoRR, abs/1910.01108, 2019.
- Shapley, L. S. A Value for n-Person Games. In Contributions to the Theory of Games (AM-28), Volume II, pp. 307–318. Princeton University Press, 1953.
- Fooling LIME and SHAP: adversarial attacks on post hoc explanation methods. In AAAI/ACM Conference on AI, Ethics, and Society (AIES 2020), pp. 180–186. ACM, 2020. doi: 10.1145/3375627.3375830.
- The Shapley Taylor Interaction Index. In Proceedings of the 37th International Conference on Machine Learning, (ICML 2020), volume 119 of Proceedings of Machine Learning Research, pp. 9259–9268. PMLR, 2020.
- Faith-Shap: The Faithful Shapley Interaction Index. Journal of Machine Learning Research, 24(94):1–42, 2023.
- Detecting statistical interactions from neural network weights. In 6th International Conference on Learning Representations, (ICLR 2018), 2018.
- Feature interaction interpretability: A case for explaining ad-recommendation systems via neural interaction detection. In 8th International Conference on Learning Representations, (ICLR 2020), 2020a.
- How does This Interaction Affect Me? Interpretable Attribution for Feature Interactions. In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems (NeurIPS 2020), pp. 6147–6159, 2020b.
- SNP interaction detection with Random Forests in high-dimensional genetic data. BMC Bioinformatics, 13:164, 2012. doi: 10.1186/1471-2105-13-164.
- Transformers: State-of-the-art natural language processing. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, (EMNLP 2020), pp. 38–45. Association for Computational Linguistics, 2020. doi: 10.18653/v1/2020.emnlp-demos.6.
- Do little interactions get lost in dark random forests? BMC Bioinform., 17:145, 2016. doi: 10.1186/s12859-016-0995-8.
- Linear tree shap. In Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, (NeurIPS 2022), 2022.
- Interventional SHAP values and interaction values for piecewise linear regression trees. In Thirty-Seventh AAAI Conference on Artificial Intelligence, (AAAI 2023), pp. 11164–11173. AAAI Press, 2023. doi: 10.1609/AAAI.V37I9.26322.
- Interpreting Multivariate Shapley Interactions in DNNs. In Thirty-Fifth AAAI Conference on Artificial Intelligence, (AAAI 2021), pp. 10877–10886. AAAI Press, 2021.