Visualizing the Feature Importance for Black Box Models (1804.06620v3)

Published 18 Apr 2018 in stat.ML, cs.AI, and cs.LG

Abstract: In recent years, a large amount of model-agnostic methods to improve the transparency, trustability and interpretability of machine learning models have been developed. We introduce local feature importance as a local version of a recent model-agnostic global feature importance method. Based on local feature importance, we propose two visual tools: partial importance (PI) and individual conditional importance (ICI) plots which visualize how changes in a feature affect the model performance on average, as well as for individual observations. Our proposed methods are related to partial dependence (PD) and individual conditional expectation (ICE) plots, but visualize the expected (conditional) feature importance instead of the expected (conditional) prediction. Furthermore, we show that averaging ICI curves across observations yields a PI curve, and integrating the PI curve with respect to the distribution of the considered feature results in the global feature importance. Another contribution of our paper is the Shapley feature importance, which fairly distributes the overall performance of a model among the features according to the marginal contributions and which can be used to compare the feature importance across different models.

Authors (3)

Giuseppe Casalicchio (34 papers)
Christoph Molnar (11 papers)
Bernd Bischl (136 papers)

Citations (168)

View on Semantic Scholar

Summary

Analysis of Feature Importance Visualization for Machine Learning Models

The academic work under review presents innovative techniques for visualizing feature importance for black box machine learning models. Authored by Casalicchio, Molnar, and Bischl, the paper introduces novel visualization strategies aimed at enhancing transparency and interpretability for models often criticized for their opacity, such as neural networks and support vector machines.

Core Contributions and Methods

The authors describe two central visualization tools: Partial Importance (PI) plots and Individual Conditional Importance (ICI) plots. These methodologies build upon established techniques like Partial Dependence (PD) plots and Individual Conditional Expectation (ICE) plots, offering a deeper insight into how feature changes impact model performance. The PI plots provide a global overview, while the ICI plots focus on local feature importance for individual observations. Additionally, the authors introduce Shapley feature importance measures, which leverage cooperative game theory to fairly distribute a model's predictive performance across its features based on marginal contributions.

Quantitative Assertions and Theoretical Implications

One of the significant assertions in this paper is the relationship between averaging ICI curves and PI curves, leading to the formulation of the global feature importance. This provides a robust quantitative framework to understand how individual observations contribute to overall model behavior. Moreover, the Shapley feature importance measure allows for comparisons of feature importance across diverse models, which is particularly beneficial for cross-model analyses and transfer learning.

Numerical results in the paper present simulations demonstrating the efficacy of the PI and ICI plots. They establish that these techniques can reveal interactions among the features that could be obscured when relying solely on traditional feature importance measures. For instance, examining conditional feature importance can uncover interactions effectively, which is demonstrated via simulation paper results.

Practical Implications and Future Directions

From a practical standpoint, the paper's contributions enable data scientists and AI researchers to identify key features driving model decisions and develop more interpretable AI systems. This supports improved accountability and transparency in machine learning applications. The methodology extends beyond simple visualization, suggesting comprehensive examination tools that assist in evaluating and refining models based on nuanced feature interactions.

Looking ahead, possibilities for further research include aggregating local feature importance to assess overall observation importance within datasets—potentially linking this research with clustering techniques or anomaly detection strategies. Additionally, the integration of Shapley values into the broader landscape of model evaluation and extension of these concepts to more complex models and larger datasets promises fruitful exploration.

Conclusion

This paper's contributions to machine learning interpretability through PI and ICI plots and the advanced application of Shapley value principles offer significant insights and tools for model evaluation, increasing the accessibility and transparency of traditionally opaque AI models. The implications are both theoretical and practical, presenting directions for future research and applications in fields requiring deep understanding and transparency in predictive modeling.

PDF Markdown