Envisioning MedCLIP: A Deep Dive into Explainability for Medical Vision-Language Models (2403.18996v1)

Published 27 Mar 2024 in cs.CV

Abstract: Explaining Deep Learning models is becoming increasingly important in the face of daily emerging multimodal models, particularly in safety-critical domains like medical imaging. However, the lack of detailed investigations into the performance of explainability methods on these models is widening the gap between their development and safe deployment. In this work, we analyze the performance of various explainable AI methods on a vision-LLM, MedCLIP, to demystify its inner workings. We also provide a simple methodology to overcome the shortcomings of these methods. Our work offers a different new perspective on the explainability of a recent well-known VLM in the medical domain and our assessment method is generalizable to other current and possible future VLMs.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (28)

Authors (3)

Anees Ur Rehman Hashmi (8 papers)
Dwarikanath Mahapatra (51 papers)
Mohammad Yaqub (77 papers)

Citations (2)

View on Semantic Scholar

Tweets

https://twitter.com/CSVisionPapers/status/1773658863939506578

Envisioning MedCLIP: A Deep Dive into Explainability for Medical Vision-Language Models (2403.18996v1)

Related Papers

Tweets