Extracting Explanations, Justification, and Uncertainty from Black-Box Deep Neural Networks (2403.08652v1)

Published 13 Mar 2024 in cs.LG and stat.ML

Abstract: Deep Neural Networks (DNNs) do not inherently compute or exhibit empirically-justified task confidence. In mission critical applications, it is important to both understand associated DNN reasoning and its supporting evidence. In this paper, we propose a novel Bayesian approach to extract explanations, justifications, and uncertainty estimates from DNNs. Our approach is efficient both in terms of memory and computation, and can be applied to any black box DNN without any retraining, including applications to anomaly detection and out-of-distribution detection tasks. We validate our approach on the CIFAR-10 dataset, and show that it can significantly improve the interpretability and reliability of DNNs.

References (14)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/StatMLPapers/status/1768125868130382228

Extracting Explanations, Justification, and Uncertainty from Black-Box Deep Neural Networks (2403.08652v1)

Summary

Related Papers

Tweets