Brain tumor segmentation with self-ensembled, deeply-supervised 3D U-net neural networks: a BraTS 2020 challenge solution (2011.01045v2)

Published 30 Oct 2020 in eess.IV and cs.CV

Abstract: Brain tumor segmentation is a critical task for patient's disease management. In order to automate and standardize this task, we trained multiple U-net like neural networks, mainly with deep supervision and stochastic weight averaging, on the Multimodal Brain Tumor Segmentation Challenge (BraTS) 2020 training dataset. Two independent ensembles of models from two different training pipelines were trained, and each produced a brain tumor segmentation map. These two labelmaps per patient were then merged, taking into account the performance of each ensemble for specific tumor subregions. Our performance on the online validation dataset with test time augmentation were as follows: Dice of 0.81, 0.91 and 0.85; Hausdorff (95%) of 20.6, 4,3, 5.7 mm for the enhancing tumor, whole tumor and tumor core, respectively. Similarly, our solution achieved a Dice of 0.79, 0.89 and 0.84, as well as Hausdorff (95%) of 20.4, 6.7 and 19.5mm on the final test dataset, ranking us among the top ten teams. More complicated training schemes and neural network architectures were investigated without significant performance gain at the cost of greatly increased training time. Overall, our approach yielded good and balanced performance for each tumor subregion. Our solution is open sourced at https://github.com/lescientifik/open_brats2020.

Authors (7)

Alexandre Carre (1 paper)
Theo Estienne (1 paper)
Theophraste Henry (6 papers)
Marvin Lerousseau (12 papers)
Charlotte Robert (6 papers)
Nikos Paragios (34 papers)
Eric Deutsch (14 papers)

Citations (65)

View on Semantic Scholar

Summary

Overview of Self-Ensembled, Deeply-Supervised 3D U-Net Neural Networks for Brain Tumor Segmentation

The research paper addresses a pertinent problem in medical imaging: the segmentation of brain tumors from MRI scans as part of the BraTS 2020 challenge. The authors propose a solution leveraging U-Net 3D convolutional neural networks (CNNs) with self-ensembled, deeply-supervised architectures known for their utility in semantic segmentation tasks in medical imaging.

The methodology involves training multiple U-Net models on the BraTS 2020 dataset using two independent pipelines. Key techniques employed include deep supervision, stochastic weight averaging, and test time augmentation. Each trained ensemble produced separate label maps, which were then integrated to produce refined brain tumor segmentations. The final ensemble provided segmentation maps focusing on three tumor subregions: the enhancing tumor (ET), the whole tumor (WT), and the tumor core (TC). The approach yielded notable performance, reflected in Dice scores of 0.79, 0.89, and 0.84, respectively, on the test dataset, situating the authors among the top-performing teams during the challenge.

Methodological Insights

The authors experimented with network architectures while maintaining a foundational structure based on the 3D U-Net, with additional modifications to enhance performance. Noteworthy alterations included the use of group and instance normalization, dilated convolutions, and attention modules. They eschewed complex computational elements such as dense blocks or inverted residual bottlenecks due to negligible performance gains and increased computational costs, as per their experimental evaluations.

A critical aspect of the methodology was the use of extensive on-the-fly data augmentation, which mitigates overfitting, a common challenge in neural network training on medical datasets. Techniques included channel rescaling, Gaussian noise addition, and random flips, enhancing the models' robustness.

Furthermore, the fusion of self-ensembled models, performed via stochastic weight averaging, served to stabilize the model predictions and enhance generalization capabilities. The ensembling procedure and the merging of dual labelmaps reflect a strategic approach to maximizing segmentation performance by leveraging the strengths of each model pipeline.

Implications and Speculation

The work demonstrates substantial advances in automated brain tumor segmentation, aligning machine learning predictions closer to human expert annotations, thereby supporting clinical decision-making. The presented model, through effective preprocessing and tailored training schemes, could potentially be integrated into clinical workflows, particularly in aiding radiation oncologists in treatment planning.

Future developments could explore the integration of semi-supervised techniques, as data scarcity and annotation are significant bottlenecks in training medical AI models. Moreover, ensembling diverse neural network architectures might further enhance model learnability and robustness.

In conclusion, while the U-Net architecture remains a staple for segmentation tasks, this paper exemplifies the refinement possible through tailored preprocessing, strategic architecture modifications, and a robust training regimen. The robustness and applicability of such models could extend beyond brain tumor segmentation to other complex biomedical imaging tasks, assuming similar domain-specific adaptations are made. The open-sourcing of the methodology allows broader adoption and iterative improvements, fuelling advancements in medical imaging AI systems.

PDF Markdown

Related Papers

Find Related Papers

GitHub

GitHub - lescientifik/open_brats2020: Top 10 brats 2020 Solution (165 stars)