RankingSHAP -- Listwise Feature Attribution Explanations for Ranking Models (2403.16085v1)
Abstract: Feature attributions are a commonly used explanation type, when we want to posthoc explain the prediction of a trained model. Yet, they are not very well explored in IR. Importantly, feature attribution has rarely been rigorously defined, beyond attributing the most important feature the highest value. What it means for a feature to be more important than others is often left vague. Consequently, most approaches focus on just selecting the most important features and under utilize or even ignore the relative importance within features. In this work, we rigorously define the notion of feature attribution for ranking models, and list essential properties that a valid attribution should have. We then propose RankingSHAP as a concrete instantiation of a list-wise ranking attribution method. Contrary to current explanation evaluation schemes that focus on selections, we propose two novel evaluation paradigms for evaluating attributions over learning-to-rank models. We evaluate RankingSHAP for commonly used learning-to-rank datasets to showcase the more nuanced use of an attribution method while highlighting the limitations of selection-based explanations. In a simulated experiment we design an interpretable model to demonstrate how list-wise ranking attributes can be used to investigate model decisions and evaluate the explanations qualitatively. Because of the contrastive nature of the ranking task, our understanding of ranking model decisions can substantially benefit from feature attribution explanations like RankingSHAP.
- Towards rigorous interpretations: a formalisation of feature attribution. In International Conference on Machine Learning. PMLR, 76–86.
- Explainable Information Retrieval: A Survey. arXiv preprint arXiv:2211.02405 (2022).
- Explainable Information Retrieval. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 3448–3451.
- Arthur Câmara and Claudia Hauff. 2020. Diagnosing BERT with Retrieval Heuristics. Advances in Information Retrieval 12035 (2020), 605.
- Interpreting Neural Ranking Models Using Grad-CAM. arXiv preprint arXiv:2005.05768 (2020).
- Rank-LIME: Local Model-agnostic Feature Attribution for Learning to Rank. In Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval. 33–37.
- Comparing Top k Lists. SIAM Journal on Discrete Mathematics 17, 1 (2003), 134–160.
- A Study on the Interpretability of Neural Retrieval Models Using DeepSHAP. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (Paris, France) (SIGIR’19). ACM, New York, NY, USA, 1005–1008.
- Feature Selection for Ranking. In SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, The Netherlands, July 23-27, 2007, Wessel Kraaij, Arjen P. de Vries, Charles L. A. Clarke, Norbert Fuhr, and Noriko Kando (Eds.). ACM, 407–414.
- Fast Feature Selection for Learning to Rank. In Proceedings of the 2016 ACM on International Conference on the Theory of Information Retrieval, ICTIR 2016, Newark, DE, USA, September 12- 6, 2016, Ben Carterette, Hui Fang, Mounia Lalmas, and Jian-Yun Nie (Eds.). ACM, 167–170.
- Intra-Document Cascading: Learning to Select Passages for Neural Document Ranking. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (Virtual Event, Canada) (SIGIR ’21). Association for Computing Machinery, New York, NY, USA, 1349–1358.
- LightGBM: A Highly Efficient Gradient Boosting Decision Tree. In Advances in neural information processing systems, Vol. 30.
- Maurice G. Kendall. 1938. A New Measure of Rank Correlation. Biometrika 30, 1/2 (1938), 81–93.
- The Disagreement Problem in Explainable Machine Learning: A Practitioner’s Perspective. arXiv preprint arXiv:2202.01602 (2022).
- Problems with Shapley-value-based Explanations as Feature Importance Measures. In International Conference on Machine Learning. PMLR, 5491–5500.
- Yongchan Kwon and James Y Zou. 2022. WeightedSHAP: Analyzing and Improving Shapley based Feature Attributions. Advances in Neural Information Processing Systems 35 (2022), 34363–34376.
- Rationalizing Neural Predictions. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. 107–117.
- Learnt Sparsity for Effective and Interpretable Document Ranking. arXiv preprint arXiv:2106.12460 (2021).
- A Multistakeholder Approach Towards Evaluating AI Transparency Mechanisms. In ACM CHI Workshop on Operationalizing Human-Centered Perspectives in Explainable AI. ACM.
- Scott M. Lundberg and Su-In Lee. 2017. A Unified Approach to Interpreting Model Predictions. Advances in Neural Information Processing Systems 30 (2017).
- Lijun Lyu and Avishek Anand. 2023. Listwise Explanations for Ranking Models Using Multiple Explainers. In European Conference on Information Retrieval. Springer Nature Switzerland Cham, 653–668.
- Tim Miller. 2019. Explanation in Artificial Intelligence: Insights from the Social Sciences. Artificial intelligence 267 (2019), 1–38.
- Christophe Molnar. 2023. Interpreting Machine Learning Models with SHAP. Independently published.
- Pairwise Review-based Explanations for Voice Product Search. In Proceedings of the 2022 Conference on Human Information Interaction and Retrieval. 300–304.
- ShaRP: Explaining Rankings with Shapley Values. arXiv preprint arXiv:2401.16744 (2024).
- Neural Feature Selection for Learning to Rank. In European Conference on Information Retrieval. Springer, 342–349.
- Tao Qin and Tie-Yan Liu. 2013. Introducing LETOR 4.0 Datasets. arXiv preprint arXiv:1306.2597 (2013).
- An Axiomatic Approach to Diagnosing Neural IR Models. In European Conference on Information Retrieval. Springer, 489–503.
- “Why Should I Trust You?” Explaining the Predictions of Any Classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1135–1144.
- Why Should I Trust You?: Explaining the Predictions of Any Classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1135–1144.
- A Consistent and Efficient Evaluation Strategy for Attribution Methods. In Proceedings of the 39th International Conference on Machine Learning. 18770–18795.
- Lloyd S. Shapley. 1953. A Value for n-Person Games. In Contributions to the Theory of Games II. Princeton University Press, Princeton, 307–317.
- Learning Important Features Through Propagating Activation Differences. In International Conference on Machine Learning. PMLR, 3145–3153.
- Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps. In 2nd International Conference on Learning Representations, ICLR 2014, Workshop Track Proceedings.
- Jaspreet Singh and Avishek Anand. 2018. Posthoc Interpretability of Learning to Rank Models using Secondary Training Data. arXiv preprint arXiv:1806.11330 (2018).
- Jaspreet Singh and Avishek Anand. 2019. EXS: Explainable Search Using Local Model Agnostic Interpretability. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining (Melbourne VIC, Australia) (WSDM ’19). ACM, New York, NY, USA, 770–773.
- Jaspreet Singh and Avishek Anand. 2020. Model Agnostic Interpretability of Rankers via Intent Modelling. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. 618–628.
- Extracting per Query Valid Explanations for Blackbox Learning-to-Rank Models. In Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval. 203–210.
- Valid Explanations for Learning to Rank Models. International Conference on the Theory of Information Retrieval (2021).
- Erik Strumbelj and Igor Kononenko. 2010. An Efficient Explanation of Individual Classifications using Game Theory. The Journal of Machine Learning Research 11 (2010), 1–18.
- Erik Štrumbelj and Igor Kononenko. 2014. Explaining Prediction Models and Individual Predictions with Feature Contributions. Knowledge and information systems 41 (2014), 647–665.
- Axiomatic Attribution for Deep Networks. In International Conference on Machine Learning. PMLR, 3319–3328.
- Manisha Verma and Debasis Ganguly. 2019. LIRME: Locally Interpretable Ranking Model Explanation. In Proceedings of the 42nd International ACM SIGIR.
- Explaining Black-box Predictions by Generating Local Meaningful Perturbations. International Journal of Semantic Computing 16, 01 (2022), 47–68.
- Towards Axiomatic Explanations for Neural Ranking Models. International Conference on the Theory of Information Retrieval (2021).
- Probing BERT for ranking abilities. In European Conference on Information Retrieval. Springer Nature Switzerland Cham, 255–273.
- Towards Explainable Search Results: A Listwise Explanation Generator. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 669–680.
- Explain and Predict, and then Predict again. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 418–426.
- Maria Heuss (6 papers)
- Maarten de Rijke (263 papers)
- Avishek Anand (81 papers)