Deep learning is effective for the classification of OCT images of normal versus Age-related Macular Degeneration (1612.04891v1)

Published 15 Dec 2016 in stat.ML, cs.CV, and cs.LG

Abstract: Objective: The advent of Electronic Medical Records (EMR) with large electronic imaging databases along with advances in deep neural networks with machine learning has provided a unique opportunity to achieve milestones in automated image analysis. Optical coherence tomography (OCT) is the most commonly obtained imaging modality in ophthalmology and represents a dense and rich dataset when combined with labels derived from the EMR. We sought to determine if deep learning could be utilized to distinguish normal OCT images from images from patients with Age-related Macular Degeneration (AMD). Methods: Automated extraction of an OCT imaging database was performed and linked to clinical endpoints from the EMR. OCT macula scans were obtained by Heidelberg Spectralis, and each OCT scan was linked to EMR clinical endpoints extracted from EPIC. The central 11 images were selected from each OCT scan of two cohorts of patients: normal and AMD. Cross-validation was performed using a random subset of patients. Area under receiver operator curves (auROC) were constructed at an independent image level, macular OCT level, and patient level. Results: Of an extraction of 2.6 million OCT images linked to clinical datapoints from the EMR, 52,690 normal and 48,312 AMD macular OCT images were selected. A deep neural network was trained to categorize images as either normal or AMD. At the image level, we achieved an auROC of 92.78% with an accuracy of 87.63%. At the macula level, we achieved an auROC of 93.83% with an accuracy of 88.98%. At a patient level, we achieved an auROC of 97.45% with an accuracy of 93.45%. Peak sensitivity and specificity with optimal cutoffs were 92.64% and 93.69% respectively. Conclusions: Deep learning techniques are effective for classifying OCT images. These findings have important implications in utilizing OCT in automated screening and computer aided diagnosis tools.

Authors (3)

Cecilia S. Lee (5 papers)
Doug M. Baughman (1 paper)
Aaron Y. Lee (5 papers)

Citations (302)

View on Semantic Scholar

Summary

Evaluation of Deep Learning for Automated Classification of OCT Images in AMD Detection

The research paper under discussion investigates the use of deep learning methodologies to differentiate between normal optical coherence tomography (OCT) images and those indicative of age-related macular degeneration (AMD). Given the burgeoning volume of digital imaging, particularly OCT in ophthalmology, the paper leverages expansive datasets to train a convolutional neural network (CNN) for automated image classification. The findings articulate significant implications for the integration of deep learning models in computer-aided diagnosis (CAD) systems, particularly in the management and assessment of AMD.

Research Methodology and Model Architecture

The authors conducted an EMR and OCT database paper using data from the University of Washington's ophthalmology department. Specifically, macular OCT scans were automatically extracted from Heidelberg Spectralis systems and linked to clinical endpoints in a structured EMR. A cohort of OCT images from patients diagnosed with AMD and normal controls was defined based on clinical criteria, resulting in a dataset composed of over 100,000 individual images.

A modified VGG16 architecture was employed, initialized using the Xavier algorithm, trained using stochastic gradient descent with validation to monitor performance and prevent overfitting. The training dataset comprised a random selection of patients with the validation dataset ensuring exclusive non-overlapping patient images. The primary evaluation metric was the area under the ROC (AUROC), analyzed at the image, macula, and patient levels.

Quantitative Results

The neural network achieved considerable performance across various classification levels:

Image Level: The model attained an AUROC of 92.78% and an accuracy of 87.63%. Sensitivity and specificity were reported at 84.63% and 91.54%, respectively, with optimal cut-offs increasing both metrics to roughly 87%.
Macular Level: By averaging results across images from a single macular scan, the AUROC increased to 93.83%, with an improved accuracy of 88.98%. Sensitivity and specificity also registered heightened values, indicating enhanced diagnostic reliability.
Patient Level: The model achieved its highest performance at the patient aggregation level, with an AUROC of 97.45% and an exceptional accuracy of 93.45%. This level of analysis revealed peak sensitivity and specificity of 92.64% and 93.69%, respectively.

Implications and Future Directions

The paper suggests that the incorporation of deep learning algorithms into clinical ophthalmological practices could significantly streamline the screening process, potentially alleviating clinician workload through automated diagnosis. Given its high sensitivity and specificity, such systems could be especially useful in identifying potential cases of AMD, allowing for earlier intervention and optimized patient care strategies.

The application of a large labelled dataset for training underscores the critical role of comprehensive, high-quality data in the performance of deep learning models, a point that remains a bottleneck in the broader application of these techniques. The research highlights potential expansions, including training and validating models across diverse populations and using different OCT devices to enhance generalizability.

Moreover, this work lays a foundation for extending deep learning applications across other ocular diseases detectable via OCT, such as diabetic retinopathy or retinal vein occlusions. By showcasing an occlusion test to ascertain which OCT regions the model prioritized, the paper suggests potential for further exploration into visual explainability of AI models, thereby fostering trust and facilitating integration into software governing clinical decision-making.

Conclusion

This paper illustrates the efficacy of deep learning models in differentiating OCT images of normal retinas from those affected by AMD, with promising accuracy and reliability. The approach underscores a growing convergence of AI and medical imaging, pointing towards a future where robust computational models can elevate diagnostic precision in clinical ophthalmic practice. Future research should aim to refine these models across varied clinical settings and diseases, ultimately enriching the efficacy and utility of CAD systems.

PDF Markdown

Related Papers

Find Related Papers