PadChest: A large chest x-ray image dataset with multi-label annotated reports (1901.07441v2)

Published 22 Jan 2019 in eess.IV and cs.CV

Abstract: We present a labeled large-scale, high resolution chest x-ray dataset for the automated exploration of medical images along with their associated reports. This dataset includes more than 160,000 images obtained from 67,000 patients that were interpreted and reported by radiologists at Hospital San Juan Hospital (Spain) from 2009 to 2017, covering six different position views and additional information on image acquisition and patient demography. The reports were labeled with 174 different radiographic findings, 19 differential diagnoses and 104 anatomic locations organized as a hierarchical taxonomy and mapped onto standard Unified Medical Language System (UMLS) terminology. Of these reports, 27% were manually annotated by trained physicians and the remaining set was labeled using a supervised method based on a recurrent neural network with attention mechanisms. The labels generated were then validated in an independent test set achieving a 0.93 Micro-F1 score. To the best of our knowledge, this is one of the largest public chest x-ray database suitable for training supervised models concerning radiographs, and the first to contain radiographic reports in Spanish. The PadChest dataset can be downloaded from http://bimcv.cipf.es/bimcv-projects/padchest/.

Citations (541)

View on Semantic Scholar

Summary

The paper introduces PadChest, a vast chest x-ray dataset featuring manual and RNN-based multi-label annotations, achieving a Micro-F1 score of 0.93.
The methodology integrates 27% expert-verified labels with automated recurrent neural network techniques to ensure precise and reliable annotations.
This dataset empowers AI diagnostic systems by providing detailed anatomical, radiographic, and clinical context for enhanced model training.

Overview of PadChest: A Comprehensive Chest X-Ray Dataset

The paper introduces PadChest, a substantial dataset comprising high-resolution chest x-ray images paired with multi-label annotated reports, targeting automated exploration of medical imaging. The dataset is compiled from 160,000+ images of 67,000 patients, spanning 2009-2017 at Hospital San Juan. The annotations encompass 174 radiographic findings, 19 differential diagnoses, and 104 anatomical locations, systematically mapped to the UMLS terminology. A significant 27% of these annotations are manually conducted by trained physicians, while the rest are automatically labeled using a recurrent neural network with attention mechanisms, achieving a Micro-F1 score of 0.93.

Dataset Structure and Methodology

PadChest stands out due to its expansive coverage and Spanish-language reports, a first in the publicly shared chest x-ray domain. The dataset is structured to include metadata such as patient demographics, image acquisition contexts, and projection types, enabling refined training environments for AI models in medical imaging. The automated annotations were achieved through a recurrent neural network paradigm augmented with attention mechanisms, validated rigorously to ensure reliability.

Relevance in Current AI Research

This dataset confronts critical challenges in the medical AI field, notably the need for large-scale, high-quality annotated data. While other datasets like ChestX-Ray8 and ChestX-Ray14 exist, PadChest offers more comprehensive labeling, translating nuanced clinical language into machine-readable formats. The implementation of hierarchical taxonomies to organize findings, diagnoses, and anatomical details further distinguishes PadChest from predecessors by offering more granular training and retrieval options.

Strong Results and Implications

The paper presents a robust methodology for dataset curation and validation. The impressive Micro-F1 score signifies not only the precision of the labeling process but also the high potential for PadChest to serve as a foundational resource in training AI models for diagnostic tasks. This precision is vital in the medical field where diagnostic accuracy equates to patient safety and care quality.

Potential Impact and Future Directions

Practically, PadChest could revolutionize the deployment of diagnostic decision support systems, reducing radiologist workload and error rates, and potentially enhancing early detection of thoracic diseases. Theoretically, it opens avenues for future research in AI that blends multimodal data more effectively. The balanced approach of combining manual and automated report annotations sets a precedent for future datasets aiming for accuracy at scale.

Challenges and Considerations

Despite its potential, the dataset does come with limitations, such as potential biases inherent in a single-institution paper and the reliance on reports for ground truth, which may omit some radiographic findings due to routine clinical practice nuances. Researchers utilizing PadChest should consider these factors in their experiments and develop strategies to manage such biases.

Conclusions

PadChest is a pivotal contribution to medical imaging research, offering an extensive, well-annotated dataset that extends beyond previous works with its size, labeling depth, and language inclusion. By bridging gaps in existing resources, PadChest stands to catalyze significant advancements in AI-driven medical diagnostics, offering avenues for more personalized and precise healthcare solutions. Researchers are encouraged to leverage this dataset alongside others to improve model generalization and explore sophisticated neural network architectures tailored to medical imaging challenges.

PDF Markdown