Exploiting Interpretable Capabilities with Concept-Enhanced Diffusion and Prototype Networks (2410.18705v2)
Abstract: Concept-based machine learning methods have increasingly gained importance due to the growing interest in making neural networks interpretable. However, concept annotations are generally challenging to obtain, making it crucial to leverage all their prior knowledge. By creating concept-enriched models that incorporate concept information into existing architectures, we exploit their interpretable capabilities to the fullest extent. In particular, we propose Concept-Guided Conditional Diffusion, which can generate visual representations of concepts, and Concept-Guided Prototype Networks, which can create a concept prototype dataset and leverage it to perform interpretable concept prediction. These results open up new lines of research by exploiting pre-existing information in the quest for rendering machine learning more human-understandable.
- \APACrefYearMonthDay2019. \BBOQ\APACrefatitleThis Looks Like That: Deep Learning for Interpretable Image Recognition This looks like that: Deep learning for interpretable image recognition.\BBCQ \BIn \APACrefbtitle33rd Conference on Neural Information Processing Systems. 33rd conference on neural information processing systems. \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2020. \BBOQ\APACrefatitleConcept whitening for interpretable image recognition Concept whitening for interpretable image recognition.\BBCQ \APACjournalVolNumPagesNature Machine Intelligence212772–782. {APACrefURL} https://doi.org/10.1038/s42256-020-00265-z \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2021. \BBOQ\APACrefatitleDiffusion Models Beat GANs on Image Synthesis Diffusion models beat gans on image synthesis.\BBCQ \BIn \APACrefbtitleAdvances in Neural Information Processing Systems Advances in neural information processing systems (\BVOL 34, \BPGS 8780–8794). \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2017\APACmonth03. \APACrefbtitleTowards A Rigorous Science of Interpretable Machine Learning Towards A Rigorous Science of Interpretable Machine Learning (\BNUM arXiv:1702.08608). \APACaddressPublisherarXiv. {APACrefDOI} \doi10.48550/arXiv.1702.08608 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2022. \BBOQ\APACrefatitleConcept embedding models: Beyond the accuracy-explainability trade-off Concept embedding models: Beyond the accuracy-explainability trade-off.\BBCQ \BIn \APACrefbtitleAdvances in Neural Information Processing Systems Advances in neural information processing systems (\BVOL 35, \BPGS 21400–21413). \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2019. \BBOQ\APACrefatitleTowards Automatic Concept-based Explanations Towards automatic concept-based explanations.\BBCQ \BIn H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox\BCBL \BBA R. Garnett (\BEDS), \APACrefbtitleAdvances in Neural Information Processing Systems Advances in neural information processing systems (\BVOL 32, \BPG 9277–-9286). \APACaddressPublisherCurran Associates, Inc. {APACrefURL} https://proceedings.neurips.cc/paper_files/paper/2019/file/77d2afcb31f6493e350fca61764efb9a-Paper.pdf \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2024. \BBOQ\APACrefatitleMake a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation Make a cheap scaling: A self-cascade diffusion model for higher-resolution adaptation.\BBCQ \APACjournalVolNumPagesarXiv preprint arxiv:2402.10491. {APACrefURL} https://arxiv.org/abs/2402.10491 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2022. \BBOQ\APACrefatitleAddressing Leakage in Concept Bottleneck Models Addressing leakage in concept bottleneck models.\BBCQ \BIn A\BPBIH. Oh, A. Agarwal, D. Belgrave\BCBL \BBA K. Cho (\BEDS), \APACrefbtitleAdvances in Neural Information Processing Systems. Advances in neural information processing systems. {APACrefURL} https://openreview.net/forum?id=tglniD_fn9 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2020. \BBOQ\APACrefatitleDenoising Diffusion Probabilistic Models Denoising diffusion probabilistic models.\BBCQ \APACjournalVolNumPagesarXiv preprint arxiv:2006.11239. \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2021. \BBOQ\APACrefatitleClassifier-Free Diffusion Guidance Classifier-free diffusion guidance.\BBCQ \BIn \APACrefbtitleNeurIPS 2021 Workshop on Deep Generative Models and Downstream Applications. Neurips 2021 workshop on deep generative models and downstream applications. \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2024. \BBOQ\APACrefatitleConcept Bottleneck Generative Models Concept bottleneck generative models.\BBCQ \BIn \APACrefbtitleThe Twelfth International Conference on Learning Representations. The twelfth international conference on learning representations. {APACrefURL} https://openreview.net/forum?id=L9U5MJJleF \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2017. \APACrefbtitleCategorical Reparameterization with Gumbel-Softmax. Categorical reparameterization with gumbel-softmax. {APACrefURL} https://arxiv.org/abs/1611.01144 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2018. \BBOQ\APACrefatitleInterpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV) Interpretability beyond feature attribution: Quantitative testing with concept activation vectors (tcav).\BBCQ \BIn J. Dy \BBA A. Krause (\BEDS), \APACrefbtitleProceedings of the 35th International Conference on Machine Learning Proceedings of the 35th international conference on machine learning (\BVOL 80, \BPG 2668-2677). \APACaddressPublisherPMLR. {APACrefURL} http://proceedings.mlr.press/v80/kim18d.html \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2020. \BBOQ\APACrefatitleConcept Bottleneck Models Concept bottleneck models.\BBCQ \BIn H\BPBID. III \BBA A. Singh (\BEDS), \APACrefbtitleProceedings of the 37th International Conference on Machine Learning Proceedings of the 37th international conference on machine learning (\BVOL 119, \BPGS 5338–5348). \APACaddressPublisherVirtualPMLR. {APACrefURL} https://proceedings.mlr.press/v119/koh20a.html \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2009. \BBOQ\APACrefatitleAttribute and simile classifiers for face verification Attribute and simile classifiers for face verification.\BBCQ \BIn \APACrefbtitle2009 IEEE 12th International Conference on Computer Vision 2009 ieee 12th international conference on computer vision (\BPGS 365–372). \APACaddressPublisherKyoto, JapanIEEE. {APACrefURL} https://doi.org/10.1109/ICCV.2009.5459250 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2024. \BBOQ\APACrefatitleBeyond Concept Bottleneck Models: How to Make Black Boxes Intervenable? Beyond concept bottleneck models: How to make black boxes intervenable?\BBCQ \APACjournalVolNumPagesarXiv preprint arXiv:2401.13544. \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2009. \BBOQ\APACrefatitleLearning to detect unseen object classes by between-class attribute transfer Learning to detect unseen object classes by between-class attribute transfer.\BBCQ \BIn \APACrefbtitle2009 IEEE Conference on Computer Vision and Pattern Recognition. 2009 IEEE conference on computer vision and pattern recognition. \APACaddressPublisherMiami, FL, USAIEEE. {APACrefURL} https://doi.org/10.1109/CVPR.2009.5206594 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2018. \BBOQ\APACrefatitleDeep learning for case-based reasoning through prototypes: a neural network that explains its predictions Deep learning for case-based reasoning through prototypes: a neural network that explains its predictions.\BBCQ \BIn \APACrefbtitleProceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial Intelligence. Proceedings of the thirty-second aaai conference on artificial intelligence and thirtieth innovative applications of artificial intelligence conference and eighth aaai symposium on educational advances in artificial intelligence. \APACaddressPublisherAAAI Press. \PrintBackRefs\CurrentBib
- \APACinsertmetastarliptonMythosModelInterpretability2016{APACrefauthors}Lipton, Z\BPBIC. \APACrefYearMonthDay2016\APACmonth06. \BBOQ\APACrefatitleThe Mythos of Model Interpretability The Mythos of Model Interpretability.\BBCQ \APACjournalVolNumPagesCommunications of the ACM611035–43. {APACrefDOI} \doi10.48550/arxiv.1606.03490 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2023. \BBOQ\APACrefatitleThis Looks Like Those: Illuminating Prototypical Concepts Using Multiple Visualizations This looks like those: Illuminating prototypical concepts using multiple visualizations.\BBCQ \BIn \APACrefbtitleThirty-seventh Conference on Neural Information Processing Systems. Thirty-seventh conference on neural information processing systems. {APACrefURL} https://openreview.net/forum?id=dCAk9VlegR \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2017. \APACrefbtitleThe Concrete Distribution: A Continuous Relaxation of Discrete Random Variables. The concrete distribution: A continuous relaxation of discrete random variables. {APACrefURL} https://arxiv.org/abs/1611.00712 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2021. \APACrefbtitlePromises and Pitfalls of Black-Box Concept Learning Models. Promises and pitfalls of black-box concept learning models. {APACrefURL} https://doi.org/10.48550/arXiv.2106.13314 \APACrefnotearXiv:2106.13314 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2022. \APACrefbtitleGlanceNets: Interpretable, Leak-proof Concept-based Models. GlanceNets: Interpretable, leak-proof concept-based models. \APACrefnotearXiv:2205.15612 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2021. \APACrefbtitleDo Concept Bottleneck Models Learn as Intended? Do concept bottleneck models learn as intended? {APACrefURL} https://doi.org/10.48550/arXiv.2105.04289 \APACrefnotearXiv:2105.04289 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2021. \APACrefbtitleNeural Prototype Trees for Interpretable Fine-grained Image Recognition. Neural prototype trees for interpretable fine-grained image recognition. {APACrefURL} https://arxiv.org/abs/2012.02046 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2023. \BBOQ\APACrefatitleLabel-free Concept Bottleneck Models Label-free concept bottleneck models.\BBCQ \BIn \APACrefbtitleThe Eleventh International Conference on Learning Representations. The eleventh international conference on learning representations. {APACrefURL} https://openreview.net/forum?id=FlCg47MNvBA \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2021. \APACrefbtitleHigh-Resolution Image Synthesis with Latent Diffusion Models. High-resolution image synthesis with latent diffusion models. \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2022. \BBOQ\APACrefatitleInterpretable image classification with differentiable prototypes assignment Interpretable image classification with differentiable prototypes assignment.\BBCQ \BIn \APACrefbtitleEuropean Conference on Computer Vision European conference on computer vision (\BPGS 351–368). \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2021\APACmonth08. \BBOQ\APACrefatitleProtoPShare: Prototypical Parts Sharing for Similarity Discovery in Interpretable Image Classification Protopshare: Prototypical parts sharing for similarity discovery in interpretable image classification.\BBCQ \BIn \APACrefbtitleProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Proceedings of the 27th acm sigkdd conference on knowledge discovery and data mining. \APACaddressPublisherACM. {APACrefURL} http://dx.doi.org/10.1145/3447548.3467245 {APACrefDOI} \doi10.1145/3447548.3467245 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2022. \BBOQ\APACrefatitleConcept Bottleneck Model With Additional Unsupervised Concepts Concept bottleneck model with additional unsupervised concepts.\BBCQ \APACjournalVolNumPagesIEEE Access1041758–41765. {APACrefURL} https://doi.org/10.1109/ACCESS.2022.3167702 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay201507–09 Jul. \BBOQ\APACrefatitleDeep Unsupervised Learning using Nonequilibrium Thermodynamics Deep unsupervised learning using nonequilibrium thermodynamics.\BBCQ \BIn \APACrefbtitleProceedings of the 32nd International Conference on Machine Learning Proceedings of the 32nd international conference on machine learning (\BVOL 37, \BPGS 2256–2265). \APACaddressPublisherPMLR. \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2022. \BBOQ\APACrefatitleDenoising Diffusion Implicit Models Denoising diffusion implicit models.\BBCQ \APACjournalVolNumPagesarXiv preprint arxiv:2010.02502. \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2019. \BBOQ\APACrefatitleGenerative Modeling by Estimating Gradients of the Data Distribution Generative modeling by estimating gradients of the data distribution.\BBCQ \BIn \APACrefbtitleAdvances in Neural Information Processing Systems Advances in neural information processing systems (\BPGS 11895–11907). \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2024. \BBOQ\APACrefatitleStochastic Concept Bottleneck Models Stochastic concept bottleneck models.\BBCQ \BIn \APACrefbtitleICML 2024 Workshop on Structured Probabilistic Inference & Generative Modeling. Icml 2024 workshop on structured probabilistic inference & generative modeling. {APACrefURL} https://openreview.net/forum?id=8jG3Y0xX7b \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2011. \BBOQ\APACrefatitleThe caltech-ucsd birds-200-2011 dataset The caltech-ucsd birds-200-2011 dataset.\BBCQ \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2019sep. \BBOQ\APACrefatitleZero-Shot Learning—A Comprehensive Evaluation of the Good, the Bad and the Ugly Zero-shot learning—a comprehensive evaluation of the good, the bad and the ugly.\BBCQ \APACjournalVolNumPagesIEEE Transactions on Pattern Analysis & Machine Intelligence41092251-2265. {APACrefDOI} \doi10.1109/TPAMI.2018.2857768 \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2023. \BBOQ\APACrefatitlePost-hoc Concept Bottleneck Models Post-hoc concept bottleneck models.\BBCQ \BIn \APACrefbtitleThe 11th International Conference on Learning Representations. The 11th international conference on learning representations. {APACrefURL} https://openreview.net/forum?id=nA5AZ8CEyow \PrintBackRefs\CurrentBib
- \APACrefYearMonthDay2021May. \BBOQ\APACrefatitleInvertible Concept-based Explanations for CNN Models with Non-negative Concept Activation Vectors Invertible concept-based explanations for cnn models with non-negative concept activation vectors.\BBCQ \APACjournalVolNumPagesProceedings of the AAAI Conference on Artificial Intelligence351311682-11690. {APACrefURL} https://ojs.aaai.org/index.php/AAAI/article/view/17389 {APACrefDOI} \doi10.1609/aaai.v35i13.17389 \PrintBackRefs\CurrentBib