Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge (2310.16112v2)
Abstract: Many real-world image recognition problems, such as diagnostic medical imaging exams, are "long-tailed" $\unicode{x2013}$ there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of long-tailed learning in medical image recognition, few have studied the interaction of label imbalance and label co-occurrence posed by long-tailed, multi-label disease classification. To engage with the research community on this emerging topic, we conducted an open challenge, CXR-LT, on long-tailed, multi-label thorax disease classification from chest X-rays (CXRs). We publicly release a large-scale benchmark dataset of over 350,000 CXRs, each labeled with at least one of 26 clinical findings following a long-tailed distribution. We synthesize common themes of top-performing solutions, providing practical recommendations for long-tailed, multi-label medical image classification. Finally, we use these insights to propose a path forward involving vision-language foundation models for few- and zero-shot disease classification.
- Gregory Holste (10 papers)
- Yiliang Zhou (11 papers)
- Song Wang (313 papers)
- Ajay Jaiswal (35 papers)
- Mingquan Lin (19 papers)
- Sherry Zhuge (1 paper)
- Yuzhe Yang (43 papers)
- Dongkyun Kim (5 papers)
- Trong-Hieu Nguyen-Mau (3 papers)
- Minh-Triet Tran (69 papers)
- Jaehyup Jeong (1 paper)
- Wongi Park (5 papers)
- Jongbin Ryu (12 papers)
- Feng Hong (18 papers)
- Arsh Verma (5 papers)
- Yosuke Yamagishi (9 papers)
- Changhyun Kim (4 papers)
- Hyeryeong Seo (1 paper)
- Myungjoo Kang (45 papers)
- Leo Anthony Celi (49 papers)