Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution (2405.09800v1)

Published 16 May 2024 in cs.LG, cs.HC, and math.DG

Abstract: In this paper, we dive into the reliability concerns of Integrated Gradients (IG), a prevalent feature attribution method for black-box deep learning models. We particularly address two predominant challenges associated with IG: the generation of noisy feature visualizations for vision models and the vulnerability to adversarial attributional attacks. Our approach involves an adaptation of path-based feature attribution, aligning the path of attribution more closely to the intrinsic geometry of the data manifold. Our experiments utilise deep generative models applied to several real-world image datasets. They demonstrate that IG along the geodesics conforms to the curved geometry of the Riemannian data manifold, generating more perceptually intuitive explanations and, subsequently, substantially increasing robustness to targeted attributional attacks.

References (64)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/GeometryPapers/status/1791386909425488373

https://twitter.com/hencav/status/1791844518439891135

https://twitter.com/KennaHara/status/1826307740009980317

Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution (2405.09800v1)

Summary

Related Papers

Tweets