ReLayNet: Retinal Layer and Fluid Segmentation of Macular Optical Coherence Tomography using Fully Convolutional Network (1704.02161v2)

Published 7 Apr 2017 in cs.CV

Abstract: Optical coherence tomography (OCT) is used for non-invasive diagnosis of diabetic macular edema assessing the retinal layers. In this paper, we propose a new fully convolutional deep architecture, termed ReLayNet, for end-to-end segmentation of retinal layers and fluid masses in eye OCT scans. ReLayNet uses a contracting path of convolutional blocks (encoders) to learn a hierarchy of contextual features, followed by an expansive path of convolutional blocks (decoders) for semantic segmentation. ReLayNet is trained to optimize a joint loss function comprising of weighted logistic regression and Dice overlap loss. The framework is validated on a publicly available benchmark dataset with comparisons against five state-of-the-art segmentation methods including two deep learning based approaches to substantiate its effectiveness.

Authors (7)

Abhijit Guha Roy (28 papers)
Sailesh Conjeti (19 papers)
Sri Phani Krishna Karri (2 papers)
Debdoot Sheet (32 papers)
Amin Katouzian (7 papers)
Christian Wachinger (64 papers)
Nassir Navab (459 papers)

Citations (510)

View on Semantic Scholar

Summary

The paper introduces a fully convolutional network with a U-Net architecture that enhances retinal layer and fluid segmentation, achieving a Dice score of 0.77.
It integrates weighted logistic regression with Dice loss to address class imbalance and improve boundary delineation in OCT image analysis.
Experimental results on DME patient scans show ReLayNet outperforming 9 out of 10 segmentation tasks, advancing diagnostic accuracy in ophthalmology.

ReLayNet: Semantic Segmentation for Retinal OCT Images

The paper "ReLayNet: Retinal Layer and Fluid Segmentation of Macular Optical Coherence Tomography using Fully Convolutional Networks" proposes an advanced fully convolutional network architecture for the segmentation of retinal layers and fluid masses in Optical Coherence Tomography (OCT) scans. This work stands out due to its utilization of a fully convolutional approach, specifically tailored to address the challenges in the domain of retinal OCT image analysis.

Methodology and Architecture

ReLayNet employs a U-Net like encoder-decoder architecture that is designed to operate end-to-end for image segmentation. The encoder path consists of multiple convolutional blocks that progressively capture hierarchical and contextual information, whereas the decoder path reconstructs the image at the pixel level to produce segmentation outputs. Notably, this architecture incorporates skip connections to enhance spatial detail retention and gradient propagation during training. Further advancements are realized through the use of unpooling layers, as opposed to interpolation techniques, thereby maintaining higher spatial consistency particularly crucial for accurately delineating retinal layers.

The network is trained with a composite loss function, integrating weighted logistic regression with a Dice loss. This formulation is purposeful; it mitigates class imbalance issues and specifically penalizes errors along layer boundaries, facilitating robust model convergence.

Experimental Validation

The efficacy of ReLayNet is demonstrated through comprehensive validation against a publicly accessible benchmark dataset of OCT scans from diabetic macular edema (DME) patients. Comparative evaluations include several state-of-the-art methodologies both from traditional graph-based approaches and other deep learning architectures. Quantitative metrics such as Dice score, mean absolute distance of layer thickness (MAD-LT), and contour estimation (CE) errors indicate the superior performance of ReLayNet, particularly in handling the complex stratification of retinal layers and fluid segmentation—a notable contrast against methods that do not account for fluid interaction.

Particularly highlighted is ReLayNet's capability to improve fluid segmentation over previous deep learning methods, marked by a Dice score of 0.77, and achieving state-of-the-art performance for 9 out of 10 segmentation tasks.

Practical and Theoretical Implications

The implications of this research span both technical and practical dimensions. Technically, the integration of Dice loss alongside logistic regression in the network’s loss function framework represents a significant stride in model training over imbalanced datasets. Practically, ReLayNet’s performance in real-time segmentation is noteworthy, implicating its potential to assist clinical diagnostics and treatment monitoring for conditions like diabetic retinopathy.

The architecture also opens pathways for future exploration into three-dimensional extensions and adaptation for intraoperative scenarios where OCT imaging is gaining traction.

Speculations and Future Directions

Considering the rapid evolution of AI applications in medical imaging, future adaptations of ReLayNet could focus on enhancing 3D segmentation capabilities to accommodate volumetric data, aiming to preserve consistency between consecutive slices. The integration of additional data-driven techniques, including self-supervised or few-shot learning paradigms, could tackle the data scarcity issues often encountered in medical datasets.

Overall, this paper offers meaningful contributions towards automated OCT image analysis, facilitating the development of intelligent systems geared towards improving diagnostic accuracy and operational efficiency in ophthalmology. ReLayNet’s architecture paves the way for more sophisticated, precise, and scalable deep learning approaches in medical imaging.

PDF Markdown