PraNet: Parallel Reverse Attention Network for Polyp Segmentation (2006.11392v4)

Published 13 Jun 2020 in eess.IV and cs.CV

Abstract: Colonoscopy is an effective technique for detecting colorectal polyps, which are highly related to colorectal cancer. In clinical practice, segmenting polyps from colonoscopy images is of great importance since it provides valuable information for diagnosis and surgery. However, accurate polyp segmentation is a challenging task, for two major reasons: (i) the same type of polyps has a diversity of size, color and texture; and (ii) the boundary between a polyp and its surrounding mucosa is not sharp. To address these challenges, we propose a parallel reverse attention network (PraNet) for accurate polyp segmentation in colonoscopy images. Specifically, we first aggregate the features in high-level layers using a parallel partial decoder (PPD). Based on the combined feature, we then generate a global map as the initial guidance area for the following components. In addition, we mine the boundary cues using a reverse attention (RA) module, which is able to establish the relationship between areas and boundary cues. Thanks to the recurrent cooperation mechanism between areas and boundaries, our PraNet is capable of calibrating any misaligned predictions, improving the segmentation accuracy. Quantitative and qualitative evaluations on five challenging datasets across six metrics show that our PraNet improves the segmentation accuracy significantly, and presents a number of advantages in terms of generalizability, and real-time segmentation efficiency.

Authors (7)

Deng-Ping Fan (88 papers)
Ge-Peng Ji (29 papers)
Tao Zhou (398 papers)
Geng Chen (115 papers)
Huazhu Fu (185 papers)
Jianbing Shen (96 papers)
Ling Shao (244 papers)

Citations (1,000)

View on Semantic Scholar

Summary

The paper presents PraNet, a novel network that integrates a parallel partial decoder with reverse attention to enhance polyp segmentation accuracy.
It refines segmentation by aggregating high-level features via a Res2Net backbone and iteratively enhancing boundaries through reverse attention modules.
PraNet outperforms state-of-the-art models on five colonoscopy datasets, achieving superior metrics and real-time processing (~50fps) for improved CRC screening.

An In-Depth Analysis of PraNet: Parallel Reverse Attention Network for Polyp Segmentation

Overview

The presented paper introduces PraNet, an advanced deep neural network designed specifically for the segmentation of polyps from colonoscopy images. Polyp segmentation is a critical task in preventing colorectal cancer (CRC), which is one of the most common types of cancer globally. Accurate segmentation aids in the early detection and treatment of polyps, potentially preventing them from developing into cancerous growths. The central contribution of this research is the development of a novel Parallel Reverse Attention Network (PraNet) that addresses the inherent challenges in polyp segmentation: variability in polyp appearance and indistinct boundaries between polyps and surrounding mucosa.

Methodology

PraNet employs a unique approach by integrating a Parallel Partial Decoder (PPD) with Reverse Attention (RA) modules, resulting in a network capable of precise polyp detection and segmentation. The PPD aggregates high-level features from a Res2Net backbone, generating a global map that serves as initial guidance for subsequent processing. The RA modules then refine this map by focusing on boundary cues, using a novel erasing mechanism that progressively enhances segmentation accuracy.

Key steps in the PraNet framework include:

Feature Aggregation via Parallel Partial Decoder: This component captures critical high-level features by leveraging a parallel partial decoder mechanism. By operating on high-level features only, it reduces computational overhead without sacrificing detail.
Reverse Attention Modules: These modules iteratively refine the segmentation results by reversing attention to the identified regions and boundaries, enhancing the ability to discern challenging polyp structures in an image.

The learning process of PraNet is driven by a weighted combination of IoU and binary cross-entropy losses, applying deep supervision across multiple side-outputs to ensure robust training.

Experiments and Results

The efficacy of PraNet was validated through extensive experiments on five datasets, including Kvasir, CVC-ClinicDB, CVC-ColonDB, ETIS, and EndoScene. Numerical results showcase PraNet's superior performance compared to state-of-the-art models, achieving substantial improvements across several metrics (mean Dice, mean IoU, $F_{\beta}^w$ , $S_\alpha$ , $E_\phi^{max}$ , and MAE).

For example, on the Kvasir dataset, PraNet achieved a mean Dice of 0.898 and an IoU of 0.840, significantly outperforming existing methods like U-Net, U-Net++, and SFA. Similar trends were observed on CVC-612, where PraNet recorded a mean Dice of 0.899 and IoU of 0.849. The robustness of PraNet was further affirmed through its generalizability on unseen datasets, maintaining high performance and low error rates (e.g., 0.709 mean Dice on CVC-ColonDB and 0.628 on ETIS).

Implications and Future Prospects

The practical implications of PraNet are profound. By providing real-time segmentation capabilities with high accuracy ( $\sim$ 50fps), PraNet presents an efficient solution that can be integrated into clinical workflows, potentially enhancing CRC screening and polyp removal procedures. The reduced training time and efficient inference capability make it a viable option for deployment in diverse clinical settings.

Theoretically, the innovative use of PPD and RA in PraNet sets a new standard in medical image segmentation. The parallel processing of features and the progressive refinement strategy pioneered in this research could inspire further advancements in related tasks, such as lung infection segmentation and classification.

Future developments might explore augmenting PraNet with more sophisticated modules or extend its application to other medical imaging challenges. The flexibility and universality of the PraNet architecture suggest a potential for broader impacts, contributing to the advancement of automated medical image analysis.

Conclusion

The paper successfully demonstrates that PraNet is a highly effective, efficient, and robust solution for polyp segmentation in colonoscopy images. By addressing the critical challenges in this domain with innovative architectural components, PraNet not only surpasses current state-of-the-art methods but also lays the groundwork for future research and applications in medical imaging. The detailed quantitative and qualitative analyses fortify its claims, marking a significant step forward in the computational approach to cancer prevention and treatment.

PDF Markdown