Deep Learning based Computer-Aided Diagnosis Systems for Diabetic Retinopathy: A Survey (1811.01238v2)

Published 3 Nov 2018 in cs.CV

Abstract: Diabetic retinopathy (DR) results in vision loss if not treated early. A computer-aided diagnosis (CAD) system based on retinal fundus images is an efficient and effective method for early DR diagnosis and assisting experts. A computer-aided diagnosis (CAD) system involves various stages like detection, segmentation and classification of lesions in fundus images. Many traditional machine-learning (ML) techniques based on hand-engineered features have been introduced. The recent emergence of deep learning (DL) and its decisive victory over traditional ML methods for various applications motivated the researchers to employ it for DR diagnosis, and many deep-learning-based methods have been introduced. In this paper, we review these methods, highlighting their pros and cons. In addition, we point out the challenges to be addressed in designing and learning about efficient, effective and robust deep-learning algorithms for various problems in DR diagnosis and draw attention to directions for future research.

Authors (4)

Norah Asiri (4 papers)
Muhammad Hussain (20 papers)
Fadwa Al Adel (3 papers)
Nazih Alzaidi (1 paper)

Citations (215)

View on Semantic Scholar

Summary

The paper reviews deep learning architectures for DR diagnosis, emphasizing CNN-based classification and segmentation tasks.
It highlights the use of public datasets and performance metrics such as accuracy, AUC, and sensitivity for evaluating models.
The survey outlines challenges in model generalizability and interpretability, advocating advanced training strategies and augmentation techniques.

Deep Learning-based Computer-Aided Diagnosis Systems for Diabetic Retinopathy: A Survey

The paper "Deep Learning based Computer-Aided Diagnosis Systems for Diabetic Retinopathy: A Survey" presents a comprehensive review of the current landscape in utilizing deep learning (DL) for the diagnosis of diabetic retinopathy (DR). The work is crucial as DR is a leading cause of blindness, especially among the working-age population, and early detection is paramount for preventing severe vision loss. With the obstacles presented by an increasing number of potential DR patients and limited specialized human resources, the development of automated diagnosis methods is becoming increasingly significant.

The authors categorize the existing research into several tasks essential to DR diagnosis: retinal blood vessel segmentation, optic disc (OD) localization and segmentation, lesion detection and classification (primarily focusing on lesions like microaneurysms, hemorrhages, and exudates), and image-level DR diagnosis for referral. This segmentation is logical since each task addresses a critical component in the DR diagnostic pipeline.

Convolutional Neural Networks (CNNs)

CNNs are the most frequently utilized deep learning architectures for these tasks. They prove powerful in analyzing retinal images by automatically learning feature representations at different abstraction levels. Specific models, such as those leveraging AlexNet, VGGNet, Inception, and ResNet, have been adopted due to their successful application in various computer vision tasks.

Research has shown CNNs' effectiveness in not only handling image classification challenges—like determining the severity of DR—but also in intricate segmentation tasks crucial for delineating the retinal vasculature and diagnosing DR-related lesions. Some notable works employ CNNs in innovative architectures incorporating fully connected layers or adversarial learning to improve boundary detection in vessel segmentation or have utilized CNNs to fuse multi-level contextual information effectively.

Other Deep Learning Architectures

Beyond CNNs, other architectures like autoencoders and deep belief networks (DBNs) have also been explored. These architectures, albeit less popular than CNNs, demonstrate promise in tasks like feature extraction and dimensionality reduction, contributing to DR diagnosis systems' depth and breadth in a complementary manner.

Performance Metrics and Public Datasets

To evaluate these DL models, a variety of datasets and performance metrics are employed. Datasets such as MESSIDOR, e-ophtha, and Kaggle's DR dataset provide a wide range of annotated images that facilitate the training and testing of models developed using DL. Metrics commonly used include accuracy, AUC, sensitivity, specificity, F-scores, among others, which help assess models' efficacy across different DR diagnostic tasks.

Challenges and Future Directions

Despite the promising results, challenges remain, particularly concerning the generalizability and robustness of these DL-based systems across diverse populations and imaging conditions. The paper highlights the challenge of acquiring large, labeled datasets vital for training deep models effectively and the potential of exploring novel augmentation strategies or leveraging GANs to synthesize realistic images for training enhancements. Another critical area is addressing the interpretability of DL models, crucial for gaining trust in clinical settings.

Conclusion

In conclusion, this survey offers an in-depth exploration of the advancements in deep learning for computer-aided diagnosis of diabetic retinopathy. The review identifies both the progress made and existing gaps in current research, fostering further advancements. It is evident that while significant progress has been made in developing automated techniques for DR diagnosis using deep learning, multi-faceted research endeavors are required to tackle the existing challenges, ensure robustness and scalability, and seamlessly integrate these tools into clinical practice to achieve equitable patient outcomes globally.

PDF Markdown