Anatomy-specific classification of medical images using deep convolutional nets (1504.04003v1)

Published 15 Apr 2015 in cs.CV

Abstract: Automated classification of human anatomy is an important prerequisite for many computer-aided diagnosis systems. The spatial complexity and variability of anatomy throughout the human body makes classification difficult. "Deep learning" methods such as convolutional networks (ConvNets) outperform other state-of-the-art methods in image classification tasks. In this work, we present a method for organ- or body-part-specific anatomical classification of medical images acquired using computed tomography (CT) with ConvNets. We train a ConvNet, using 4,298 separate axial 2D key-images to learn 5 anatomical classes. Key-images were mined from a hospital PACS archive, using a set of 1,675 patients. We show that a data augmentation approach can help to enrich the data set and improve classification performance. Using ConvNets and data augmentation, we achieve anatomy-specific classification error of 5.9 % and area-under-the-curve (AUC) values of an average of 0.998 in testing. We demonstrate that deep learning can be used to train very reliable and accurate classifiers that could initialize further computer-aided diagnosis.

Citations (177)

View on Semantic Scholar

Summary

The paper presents a deep convolutional neural network approach for classifying medical CT images into five specific anatomical regions (neck, lungs, liver, pelvis, legs).
The ConvNet achieved a low 5.9% classification error and high 0.998 AUC, with data augmentation significantly improving performance over a baseline model.
Anatomy-specific classification using deep learning offers a robust initial step for computer-aided diagnosis and detection systems, potentially enhancing clinical workflows.

Anatomy-Specific Classification of Medical Images Using Deep Convolutional Nets

In the paper titled "Anatomy-Specific Classification of Medical Images Using Deep Convolutional Nets," the authors present a convolutional neural network (ConvNet)-based approach to classify medical images based on specific anatomical regions. This research emphasizes the potential of deep learning to enhance the efficacy of computer-aided diagnosis by automating the classification of computed tomography (CT) images into specific anatomical categories.

Methodology and Implementation

The authors employ ConvNets, leveraging their inherent ability to learn hierarchical features directly from data, thereby bypassing traditional reliance on manually-crafted features. Specifically, this research utilizes ConvNets to classify CT images into five anatomical classes: neck, lungs, liver, pelvis, and legs. The classification model is developed using a dataset of 4,298 axial 2D key-images derived from a cohort of 1,675 patients. Ground truth labels are derived from radiological reports and DICOM tags, establishing a comprehensive training set for the ConvNet.

A significant aspect of this paper is the application of data augmentation techniques. These techniques include spatial transformations such as random translations, rotations, and non-rigid deformations (thin-plate-spline interpolations), which increase the dataset's variety and help mitigate overfitting. Such augmentation results in an augmented dataset that enhances the robustness of the trained classifier.

Results

The ConvNet achieved a classification error rate of 5.9% and demonstrated an average area-under-the-curve (AUC) score of 0.998 in testing, presenting solid evidence of the model's reliability. Data augmentation notably enhanced the model's performance, reducing the classification error from a baseline of 9.6% and increasing the mean AUC from 0.994. Confusion matrices displayed in the paper depict the model’s precision, indicating significant reductions in classification errors post data augmentation.

The t-distributed stochastic neighbor embedding (t-SNE) visualization revealed distinct separations among most classes, confirming the ConvNet's capability in feature specialization. Nevertheless, overlapping clusters were observed between lung and liver images in regions like the diaphragm, highlighting areas for potential improvement.

Implications and Future Directions

The paper demonstrates that ConvNets can efficiently classify medical images into anatomical classes with expert-level accuracy, significantly benefiting computer-aided detection (CADe) and diagnosis (CADx) systems. This classification capability could be utilized as an initial step in more specialized imaging analyses, potentially improving diagnostic workflows in clinical settings.

Further enhancement could involve introducing additional anatomical classes, refining the augmented dataset to more accurately reflect real-world anatomical variations, and scaling the methodology to other imaging modalities. The use of ConvNet classifiers as a foundational step in computer-aided diagnostic systems presents a promising avenue for future research, potentially expanding to include broader disease detection and complication assessments.

Overall, the robust framework and results of this research contribute to establishing deep learning architectures as viable, powerful tools in the field of medical imaging, setting a precedent for continued exploration and application across various domains within the field.