Mining Gaze for Contrastive Learning toward Computer-Assisted Diagnosis (2312.06069v2)

Published 11 Dec 2023 in cs.CV

Abstract: Obtaining large-scale radiology reports can be difficult for medical images due to various reasons, limiting the effectiveness of contrastive pre-training in the medical image domain and underscoring the need for alternative methods. In this paper, we propose eye-tracking as an alternative to text reports, as it allows for the passive collection of gaze signals without disturbing radiologist's routine diagnosis process. By tracking the gaze of radiologists as they read and diagnose medical images, we can understand their visual attention and clinical reasoning. When a radiologist has similar gazes for two medical images, it may indicate semantic similarity for diagnosis, and these images should be treated as positive pairs when pre-training a computer-assisted diagnosis (CAD) network through contrastive learning. Accordingly, we introduce the Medical contrastive Gaze Image Pre-training (McGIP) as a plug-and-play module for contrastive learning frameworks. McGIP uses radiologist's gaze to guide contrastive pre-training. We evaluate our method using two representative types of medical images and two common types of gaze data. The experimental results demonstrate the practicality of McGIP, indicating its high potential for various clinical scenarios and applications.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (45)

Authors (4)

Zihao Zhao (42 papers)
Sheng Wang (239 papers)
Qian Wang (453 papers)
Dinggang Shen (153 papers)

Citations (3)

View on Semantic Scholar

GitHub

GitHub - zhaozh10/McGIP: [AAAI 2024] Mining Gaze for Contrastive Learning toward Computer-assisted Diagnosis (15 stars)

Mining Gaze for Contrastive Learning toward Computer-Assisted Diagnosis (2312.06069v2)

Related Papers

GitHub