Dual feature-based and example-based explanation methods
Abstract: A new approach to the local and global explanation is proposed. It is based on selecting a convex hull constructed for the finite number of points around an explained instance. The convex hull allows us to consider a dual representation of instances in the form of convex combinations of extreme points of a produced polytope. Instead of perturbing new instances in the Euclidean feature space, vectors of convex combination coefficients are uniformly generated from the unit simplex, and they form a new dual dataset. A dual linear surrogate model is trained on the dual dataset. The explanation feature importance values are computed by means of simple matrix calculations. The approach can be regarded as a modification of the well-known model LIME. The dual representation inherently allows us to get the example-based explanation. The neural additive model is also considered as a tool for implementing the example-based explanation approach. Many numerical experiments with real datasets are performed for studying the approach. The code of proposed algorithms is available.
- One explanation does not fit all: A toolkit and taxonomy of AI explainability techniques. arXiv:1909.03012, Sep 2019.
- V. Belle and I. Papantonis. Principles and practice of explainable machine learning. Frontiers in Big Data, page 39, 2021.
- A survey of methods for explaining black box models. ACM computing surveys, 51(5):93, 2019.
- Explaining the black-box model: A survey of local interpretation methods for deep neural networks. Neurocomputing, 419:168–182, 2021.
- C. Molnar. Interpretable Machine Learning: A Guide for Making Black Box Models Explainable. Published online, https://christophm.github.io/interpretable-ml-book/, 2019.
- Interpretable machine learning: definitions, methods, and applications. arXiv:1901.04592, Jan 2019.
- Explainable deep learning: A field guide for the uninitiated. Journal of Artificial Intelligence Research, 73:329–396, 2022.
- Explainability of vision-based autonomous driving systems: Review and challenges. arXiv:2101.05307, January 2021.
- A survey on neural network interpretability. IEEE Transactions on Emerging Topics in Computational Intelligence, 5(5):726–742, 2021.
- “Why should I trust You?” Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pages 1135–1144, 2016.
- Neural additive models: Interpretable machine learning with neural nets. Advances in neural information processing systems, 34:4699–4711, 2021.
- T. Hastie and R. Tibshirani. Generalized additive models, volume 43. CRC press, 1990.
- A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems, pages 4765–4774, 2017.
- E. Strumbelj and I. Kononenko. An efficient explanation of individual classifications using game theory. Journal of Machine Learning Research, 11:1–18, 2010.
- L.S. Shapley. A value for n-person games. In Contributions to the Theory of Games, volume II of Annals of Mathematics Studies 28, pages 307–317. Princeton University Press, Princeton, 1953.
- E. Strumbelj and I. Kononenko. A general method for visualizing and explaining black-box regression models. In Adaptive and Natural Computing Algorithms. ICANNGA 2011, volume 6594 of Lecture Notes in Computer Science, pages 21–30, Berlin, 2011. Springer.
- E. Strumbelj and I. Kononenko. Explaining prediction models and individual predictions with feature contributions. Knowledge and Information Systems, 41:647–665, 2014.
- Ensembles of random SHAPs. Algorithms, 15(11, 431):1–27, 2022.
- R. Yousefzadeh. Deep learning generalization and the convex hull of training sets. In NeurIPS 2020 Workshop: Deep Learning through Information Geometry, pages 1–10, 2020.
- Simulation and the Monte Carlo method, 2nd Edition. Wiley, New Jersey, 2008.
- Sampling uniformly from the unit simplex. Technical Report 29, Johns Hopkins University, 2004.
- Visualizing the effects of predictor variables in black box supervised learning models. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 82(4):1059–1086, 2020.
- Alime: Autoencoder based approach for local interpretability. In Intelligent Data Engineering and Automated Learning–IDEAL 2019: 20th International Conference, Manchester, UK, November 14–16, 2019, Proceedings, Part I 20, pages 454–463. Springer, 2019.
- Anchors: High-precision model-agnostic explanations. In AAAI Conference on Artificial Intelligence, pages 1527–1535, 2018.
- Enriching visual with verbal explanations for relational concepts: Combining LIME with Aleph. In Machine Learning and Knowledge Discovery in Databases: International Workshops of ECML PKDD 2019, pages 180–192. Springer, 2020.
- SurvLIME: A method for explaining machine learning survival models. Knowledge-Based Systems, 203:106164, 2020.
- D. Garreau and U. von Luxburg. Explaining the explainer: A first theoretical analysis of LIME. In International conference on artificial intelligence and statistics, pages 1287–1296. PMLR, 2020.
- D. Garreau and U. von Luxburg. Looking deeper into tabular LIME. arXiv:2008.11092, August 2020.
- GraphLIME: Local interpretable model explanations for graph neural networks. IEEE Transactions on Knowledge and Data Engineering, 35(7):6968–6972, 2022.
- Gami-net: An explainable neural networkbased on generalized additive models with structured interactions. Pattern Recognition, 120:108192, 2021.
- Adaptive explainable neural networks (AxNNs). arXiv:2004.02353v2, April 2020.
- InterpretML: A unified framework for machine learning interpretability. arXiv:1909.09223, September 2019.
- Interpretable machine learning with an ensemble of gradient boosting machines. Knowledge-Based Systems, 222(106993):1–16, 2021.
- FastSHAP: Real-time shapley value estimation. In The Tenth International Conference on Learning Representations, ICLR 2022, pages 1–23, 2022.
- On locality of local explanation models. In Advances in neural information processing systems, volume 34, pages 18395–18407, 2021.
- SHAFF: Fast and consistent SHApley eFfect estimates via random Forests. In International Conference on Artificial Intelligence and Statistics, pages 5563–5582. PMLR, 2022.
- TimeSHAP: Explaining recurrent models through sequence perturbations. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pages 2565–2573, 2021.
- X-SHAP: towards multiplicative explainability of machine learning. arXiv:2006.04574, June 2020.
- Shapley explanation networks. arXiv:2104.02297, Apr 2021.
- R. Fong and A. Vedaldi. Explanations for attributing deep neural network predictions. In Explainable AI, volume 11700 of LNCS, pages 149–167. Springer, Cham, 2019.
- R.C. Fong and A. Vedaldi. Interpretable explanations of black boxes by meaningful perturbation. In Proceedings of the IEEE International Conference on Computer Vision, pages 3429–3437. IEEE, 2017.
- RISE: Randomized input sampling for explanation of black-box models. arXiv:1806.07421, June 2018.
- Evaluating explainers via perturbation. arXiv:1906.02032v1, Jun 2019.
- Techniques for interpretable machine learning. Communications of the ACM, 63(1):68–77, 2019.
- LEAFAGE: Example-based and feature importance-based explanations for black-box ML models. In 2019 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), pages 1–7. IEEE, 2019.
- The effects of example-based explanations in a machine learning interface. In Proceedings of the 24th International Conference on Intelligent User Interfaces, pages 258–262, 2019.
- Toward scalable and unified example-based explanation and outlier detection. IEEE Trans Image Process, 31:525–540, 2022.
- Explaining latent representations with a corpus of examples. In Advances in Neural Information Processing Systems, volume 34, pages 12154–12166, 2021.
- Interactive label cleaning with example-based explanations. In Advances in Neural Information Processing Systems, volume 34, pages 12966–12977, 2021.
- Axiomatic attribution for deep networks. In 34th International Conference on Machine Learning, ICML, volume 7, pages 5109–5118, 2017.
- Explanations based on the missing: Towards contrastive explanations with pertinent negatives. arXiv:1802.07623v2, Oct 2018.
- A. Adadi and M. Berrada. Peeking inside the black-box: A survey on explainable artificial intelligence (XAI). IEEE Access, 6:52138–52160, 2018.
- Explainable artificial intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information Fusion, 58:82–115, 2020.
- Benchmarking and survey of explanation methods for black box models. Data Mining and Knowledge Discovery, 2023.
- N. Burkart and M.F. Huber. A survey on the explainability of supervised machine learning. Journal of Artificial Intelligence Research, 70:245–317, 2021.
- Machine learning interpretability: A survey on methods and metrics. Electronics, 8(832):1–34, 2019.
- A systematic review of explainable artificial intelligence in terms of different application domains and tasks. Applied Sciences, 12(3:1353):1–38, 2022.
- Interpretable deep learning: Interpretations, interpretability, trustworthiness, and beyond. Knowledge and Information Systems, 64(12):3197–3234, 2022.
- C. Rudin. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence, 1:206–215, 2019.
- Interpretable machine learning: Fundamental principles and 10 grand challenges. arXiv:2103.11251, March 2021.
- Large data sets classification using convex-concave hull and support vector machine. Soft Computing, 17:793–804, 2013.
- Extreme vector machine for training on large data. International Journal of Machine Learning and Cybernetics, 11:33–53, 2020.
- Machine learning algorithm based on convex hull analysis. Procedia Computer Science, 186:381–386, 2021.
- Nearest convex hull classification based on linear programming. Pattern Recognition and Image Analysis, 31:205–211, 2021.
- Online support vector machine based on convex hull vertices selection. IEEE Transactions on Neural Networks and Learning Systems, 24(4):593–609, 2013.
- Learning in high dimension always amounts to extrapolation. arXiv:2110.09485v2, Oct 2021.
- A convex hull-based data selection method for data driven models. Applied Soft Computing, 47:515–533, 2016.
- Duality and geometry in svm classifiers. In ICML’00: Proceedings of the Seventeenth International Conference on Machine Learning, pages 57–64, 2000.
- T. Zhang. On the dual formulation of regularized linear systems with convex risks. Machine Learning, 46(1):91–129, 2002.
- T. Ergen and M. Pilanci. Convex duality of deep neural networks. In Proceedings of the 37 th International Conference on Machine Learning, volume PMLR 108, 2020.
- T. Ergen and M. Pilanci. Convex geometry and duality of over-parameterized neural networks. Journal of Machine Learning Research, 22(212):1–63, 2021.
- F. Farnia and D. Tse. A convex duality framework for gans. In 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), pages 1–11, 2018.
- High-dimensional similarity learning via dual-sparse random projection. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (IJCAI-18), pages 3005–3011, 2018.
- R.T. Rockafellar. Convex Analysis, volume 28 of Princeton Mathematical Series. Princeton University Press, Princeton, N.J., 1970.
- Mixup: Beyond empirical risk minimization. In Proceedings of ICLR, pages 1–13, 2018.
- J.H. Friedman. Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29:1189–1232, 2001.
- A simple and effective model-based variable importance measure. arXiv:1805.04755, May 2018.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.