Large-scale Long-tailed Disease Diagnosis on Radiology Images (2312.16151v3)
Abstract: Developing a generalist radiology diagnosis system can greatly enhance clinical diagnostics. In this paper, we introduce RadDiag, a foundational model supporting 2D and 3D inputs across various modalities and anatomies, using a transformer-based fusion module for comprehensive disease diagnosis. Due to patient privacy concerns and the lack of large-scale radiology diagnosis datasets, we utilize high-quality, clinician-reviewed radiological images available online with diagnosis labels. Our dataset, RP3D-DiagDS, contains 40,936 cases with 195,010 scans covering 5,568 disorders (930 unique ICD-10-CM codes). Experimentally, our RadDiag achieves 95.14% AUC on internal evaluation with the knowledge-enhancement strategy. Additionally, RadDiag can be zero-shot applied or fine-tuned to external diagnosis datasets sourced from various hospitals, demonstrating state-of-the-art results. In conclusion, we show that publicly shared medical data on the Internet is a tremendous and valuable resource that can potentially support building a generalist AI for healthcare.
- ICD10. https://www.icd10data.com/ICD10CM/Codes.
- Kaggle: Brain mri scans for brain tumor classification. https://www.kaggle.com/datasets/shreyag1103/brain-mri-scans-for-brain-tumor-classification.
- Kaggle: Brain tumor mri dataset. https://www.kaggle.com/dsv/2645886.
- Kaggle: Brain tumor mri images 17 classes. https://www.kaggle.com/datasets/fernando2rad/brain-tumor-mri-images-17-classes.
- Radiopaedia. https://radiopaedia.org.
- Dataset of breast ultrasound images. Data In Brief, 28:104863, 2020.
- The lung image database consortium (lidc) and image database resource initiative (idri): a completed reference database of lung nodules on ct scans. Medical Physics, 38(2):915–931, 2011.
- Aucreshaping: improved sensitivity at high-specificity. Scientific Reports, 13(1):21097, 2023.
- Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of mrnet. PLoS Medicine, 15(11), 2018.
- POCOVID-Net: automatic detection of COVID-19 from a new lung ultrasound imaging dataset (POCUS). arXiv preprint arXiv:2004.12084, 2020.
- Andrew P Bradley. The use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recognition, 30(7):1145–1159, 1997.
- Padchest: A large chest x-ray image dataset with multi-label annotated reports. Medical Image Analysis, 66:101797, 2020.
- Deep learning system to screen coronavirus disease 2019 pneumonia. Applied Intelligence (Dordrecht, Netherlands), page 1, 2020.
- Breast microcalcification diagnosis using deep convolutional neural network from digital mammograms. Computational and Mathematical Methods in Medicine, 2019, 2019.
- Kai Cao et al. Large-scale pancreatic cancer detection via non-contrast ct and deep learning. Nature Medicine, 2023.
- Monai: An open-source framework for deep learning in healthcare. arXiv preprint arXiv:2211.02701, 2022.
- Enhanced performance of brain tumor classification via tumor region augmentation and partition. PloS One, 10(10):e0140381, 2015.
- The advantages of the matthews correlation coefficient (mcc) over f1 score and accuracy in binary classification evaluation. BMC Genomics, 21(1):1–13, 2020.
- Can ai help in screening viral and covid-19 pneumonia? IEEE Access, 8:132665–132676, 2020.
- Transmed: Transformers advance multi-modal medical image classification. Diagnostics, 11(8):1384, 2021.
- Brain tumor classification using deep cnn features via transfer learning. Computers in Biology and Medicine, 111:103345, 2019.
- Preparing a collection of radiology examinations for distribution and retrieval. Journal of the American Medical Informatics Association, 23(2):304–310, 2016.
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
- A multiple resampling method for learning from imbalanced data sets. Computational Intelligence, 20(1):18–36, 2004.
- Crowdsourcing pneumothorax annotations using machine learning annotations on the nih chest x-ray dataset. Journal of Digital Imaging, 33:490–496, 2020.
- COVID-VIT: Classification of COVID-19 from CT chest images based on vision transformer models. arXiv preprint arXiv:2107.01682, 2021.
- Vision transformers for classification of breast ultrasound images. In 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), pages 480–483. IEEE, 2022.
- Lvis: A dataset for large vocabulary instance segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2019.
- A hybrid cnn-glcm classifier for detection and grade classification of brain tumor. Brain Imaging and Behavior, 16(3):1410–1427, 2022.
- Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 770–778, 2015.
- JF Healthcare. Object-CXR - Automatic detection of foreign objects on chest X-rays. 2020.
- The digital database for screening mammography. In Proceedings of the Fifth International Workshop on Digital Mammography, pages 212–218. Medical Physics Publishing, 2001.
- Intracranial hemorrhage segmentation using a deep convolutional model. Data, 5(1), 2020.
- CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 590–597, 2019.
- The alzheimer’s disease neuroimaging initiative (adni): Mri methods. Journal of Magnetic Resonance Imaging: An Official Journal of the International Society for Magnetic Resonance in Medicine, 27(4):685–691, 2008.
- Two public chest x-ray datasets for computer-aided screening of pulmonary diseases. Quantitative Imaging in Medicine and Surgery, 4(6):475, 2014.
- Medcpt: Contrastive pre-trained transformers with large-scale pubmed search logs for zero-shot biomedical information retrieval. Bioinformatics, 39(11):btad651, 2023.
- Mimic-cxr, a de-identified publicly available database of chest radiographs with free-text reports. Scientific Data, 6(1):317, 2019.
- Select atrophied regions in alzheimer disease (sara): An improved volumetric model for identifying alzheimer disease dementia. NeuroImage: Clinical, 26:102248, 2020.
- Mia-cov19d: Covid-19 detection through 3-d chest ct image analysis. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 537–544, 2021.
- Residual and plain convolutional neural networks for 3D brain MRI classification. In Proceedings-International Symposium on Biomedical Imaging, pages 835–838, 2017.
- Michał Koziarski. Radial-based undersampling for imbalanced data classification. Pattern Recognition, 102:107262, 2020.
- Oasis-3: Longitudinal neuroimaging, clinical, and cognitive dataset for normal aging and alzheimer disease. medRxiv, 2019.
- A curated mammography data set for use in computer-aided detection and diagnosis research. Scientific Data, 4(1):1–9, 2017.
- Equalized focal loss for dense long-tailed object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 6990–6999, 2022.
- Artificial intelligence distinguishes COVID-19 from community acquired pneumonia on chest CT. Radiology, 2020.
- Focal loss for dense object detection. IEEE International Conference on Computer Vision (ICCV), pages 2999–3007, 2017.
- Automatic diagnosis of covid-19 using a tailored transformer-like network. In Journal of Physics: Conference Series, volume 2010, page 012175. IOP Publishing, 2021.
- Exploratory undersampling for class-imbalance learning. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 39(2):539–550, 2008.
- Anna Majkowska et al. Chest radiograph interpretation with deep learning models: assessment with radiologist-adjudicated reference standards and population-adjusted evaluation. Radiology, 294(2):421–431, 2020.
- Open Access Series of Imaging Studies (OASIS): Cross-sectional MRI Data in Young, Middle Aged, Nondemented, and Demented Older Adults. Journal of Cognitive Neuroscience, 19(9):1498–1507, 09 2007.
- Is it time to replace cnns with transformers for medical images? arXiv preprint arXiv:2108.09038, 2021.
- Radimagenet: an open radiologic deep learning research dataset for effective transfer learning. Radiology: Artificial Intelligence, 4(5):e210315, 2022.
- Low-dose ct image and projection dataset. Medical Physics, 48(2):902–911, 2021.
- xViTCOS: explainable vision transformer based COVID-19 screening using radiography. IEEE Journal of Translational Engineering in Health and Medicine, 10:1–10, 2021.
- Med-Flamingo: A Multimodal Medical Few-shot Learner. arXiv preprint arXiv:2307.15189, July 2023.
- Mosmeddata: Chest ct scans with covid-19 related findings dataset. arXiv preprint arXiv:2005.06465, 2020.
- Sayyed Mostafa Mostafavi. COVID19-CT-Dataset: An Open-Access Chest CT Image Repository of 1000+ Patients with Confirmed COVID-19 Diagnosis. 2021.
- VinDr-CXR: An open dataset of chest X-rays with radiologist’s annotations. Scientific Data, 9(1):429, 2022.
- VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography. Scientific Data, 10(1):277, 2023.
- VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography. PhysioNet, 2022.
- VinDr-SpineXR: A deep learning framework for spinal lesions detection and classification from radiographs. In Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part V 24, pages 291–301. Springer, 2021.
- VinDr-PCXR: An open, large-scale chest radiograph dataset for interpretation of common thoracic diseases in children. PhysioNet, 2022.
- OpenAI. GPT-4 Technical Report. arXiv preprint arXiv:2303.08774, 2023.
- Vision transformer for covid-19 cxr diagnosis using chest x-ray feature corpus. arXiv preprint arXiv:2103.07055, 2021.
- Maya Pavlova et al. Covid-net cxr-2: An enhanced deep convolutional neural network design for detection of covid-19 cases from chest x-ray images. Frontiers in Medicine, 9, 2022.
- Pocformer: A lightweight transformer architecture for detection of covid-19 using point of care ultrasound. In 2021 IEEE international conference on image processing (ICIP), pages 195–199. IEEE, 2021.
- Asymmetric loss for multi-label classification. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 82–91, 2021.
- Multi-grade brain tumor classification using deep cnn with extensive data augmentation. Journal of Computational Science, 30:174–182, 2019.
- TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification. Advances in Neural Information Processing Systems, 34:2136–2147, 2021.
- Augmenting the national institutes of health chest radiograph dataset with expert annotations of possible pneumonia. Radiology: Artificial Intelligence, 1(1):e180041, 2019.
- Covid-transformer: Interpretable covid-19 detection using vision transformer for healthcare. International Journal of Environmental Research and Public Health, 18(21):11086, 2021.
- Brain tumor classification for mr images using transfer learning and fine-tuning. Computerized Medical Imaging and Graphics, 75:34–46, 2019.
- One click lesion recist measurement and segmentation on ct scans. In Medical Image Computing and Computer Assisted Intervention (MICCAI), 2020.
- Towards generalist biomedical ai. arXiv preprint arXiv:2307.14334, 2023.
- Chestx-ray8: Hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 2097–2106, 2017.
- Self-consistency improves chain of thought reasoning in language models. arXiv preprint arXiv:2203.11171, 2022.
- Integrating features from lymph node stations for metastatic lymph node detection. Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society, 101:102108, 2022.
- K-diag: Knowledge-enhanced disease diagnosis in radiographic imaging. Medical Image Computing and Computer Assisted Intervention – MICCAI Workshop, 2023.
- MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training. arXiv preprint arXiv:2301.02228, 2023.
- Towards generalist foundation model for radiology by leveraging web-scale 2d&3d medical data. IEEE International Conference on Computer Vision (ICCV), 2023.
- Deeplesion: Automated deep mining, categorization and detection of significant radiology image findings using large-scale clinical lesion annotations. arXiv preprint arXiv:1710.01766, 2017.
- Mil-vt: Multiple instance learning enhanced vision transformer for fundus image classification. In Medical Image Computing and Computer Assisted Intervention(MICCAI), 2021.
- Knowledge-enhanced pre-training for auto-diagnosis of chest radiology images. Nature Communications, 2023.
- Pmc-vqa: Visual instruction tuning for medical visual question answering. arXiv preprint arXiv:2305.10415, 2023.
- mmformer: Multimodal medical transformer for incomplete multimodal learning of brain tumor segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention, pages 107–117. Springer, 2022.
- A graph-transformer for whole slide image classification. IEEE Transactions on Medical Imaging, 41(11):3003–3015, 2022.
- Qiaoyu Zheng (4 papers)
- Weike Zhao (6 papers)
- Chaoyi Wu (24 papers)
- Xiaoman Zhang (31 papers)
- Ya Zhang (222 papers)
- Yanfeng Wang (211 papers)
- Weidi Xie (132 papers)
- Lisong Dai (7 papers)
- Hengyu Guan (1 paper)
- Yuehua Li (6 papers)