Learning Generalized Medical Image Representations through Image-Graph Contrastive Pretraining (2405.09594v1)

Published 15 May 2024 in eess.IV, cs.CV, and cs.LG

Abstract: Medical image interpretation using deep learning has shown promise but often requires extensive expert-annotated datasets. To reduce this annotation burden, we develop an Image-Graph Contrastive Learning framework that pairs chest X-rays with structured report knowledge graphs automatically extracted from radiology notes. Our approach uniquely encodes the disconnected graph components via a relational graph convolution network and transformer attention. In experiments on the CheXpert dataset, this novel graph encoding strategy enabled the framework to outperform existing methods that use image-text contrastive learning in 1% linear evaluation and few-shot settings, while achieving comparable performance to radiologists. By exploiting unlabeled paired images and text, our framework demonstrates the potential of structured clinical insights to enhance contrastive learning for medical images. This work points toward reducing demands on medical experts for annotations, improving diagnostic precision, and advancing patient care through robust medical image understanding.

View on arXiv

References (39)

Authors (4)

Sameer Khanna (4 papers)
Daniel Michael (1 paper)
Pranav Rajpurkar (69 papers)
Marinka Zitnik (79 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/CIGX/status/1791385900607246688

Learning Generalized Medical Image Representations through Image-Graph Contrastive Pretraining (2405.09594v1)

Summary

Related Papers

Tweets