Improving Zero-Shot Detection of Low Prevalence Chest Pathologies using Domain Pre-trained Language Models (2306.08000v1)

Published 13 Jun 2023 in physics.med-ph, cs.CL, cs.CV, cs.LG, and eess.IV

Abstract: Recent advances in zero-shot learning have enabled the use of paired image-text data to replace structured labels, replacing the need for expert annotated datasets. Models such as CLIP-based CheXzero utilize these advancements in the domain of chest X-ray interpretation. We hypothesize that domain pre-trained models such as CXR-BERT, BlueBERT, and ClinicalBERT offer the potential to improve the performance of CLIP-like models with specific domain knowledge by replacing BERT weights at the cost of breaking the original model's alignment. We evaluate the performance of zero-shot classification models with domain-specific pre-training for detecting low-prevalence pathologies. Even though replacing the weights of the original CLIP-BERT degrades model performance on commonly found pathologies, we show that pre-trained text towers perform exceptionally better on low-prevalence diseases. This motivates future ensemble models with a combination of differently trained LLMs for maximal performance.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (8)

Authors (5)

Aakash Mishra (3 papers)
Rajat Mittal (42 papers)
Christy Jestin (1 paper)
Kostas Tingos (1 paper)
Pranav Rajpurkar (69 papers)

Citations (7)

View on Semantic Scholar

Improving Zero-Shot Detection of Low Prevalence Chest Pathologies using Domain Pre-trained Language Models (2306.08000v1)

Related Papers