nnU-Net Revisited: A Call for Rigorous Validation in 3D Medical Image Segmentation (2404.09556v2)
Abstract: The release of nnU-Net marked a paradigm shift in 3D medical image segmentation, demonstrating that a properly configured U-Net architecture could still achieve state-of-the-art results. Despite this, the pursuit of novel architectures, and the respective claims of superior performance over the U-Net baseline, continued. In this study, we demonstrate that many of these recent claims fail to hold up when scrutinized for common validation shortcomings, such as the use of inadequate baselines, insufficient datasets, and neglected computational resources. By meticulously avoiding these pitfalls, we conduct a thorough and comprehensive benchmarking of current segmentation methods including CNN-based, Transformer-based, and Mamba-based approaches. In contrast to current beliefs, we find that the recipe for state-of-the-art performance is 1) employing CNN-based U-Net models, including ResNet and ConvNeXt variants, 2) using the nnU-Net framework, and 3) scaling models to modern hardware resources. These results indicate an ongoing innovation bias towards novel architectures in the field and underscore the need for more stringent validation standards in the quest for scientific progress.
- Auto3dseg. LINK. Accessed: 2024-01-25.
- Auto3dseg kits23 tutorial. LINK. Accessed: 2024-03-05.
- Swinunetr comment on additional training data. https://github.com/Project-MONAI/research-contributions/issues/68. Accessed: 2024-01-25.
- The medical segmentation decathlon. Nature communications, 2022.
- The rsna-asnr-miccai brats 2021 benchmark on brain tumor segmentation and radiogenomic classification. arXiv preprint arXiv:2107.02314, 2021.
- Advancing the cancer genome atlas glioma mri collections with expert segmentation labels and radiomic features. Scientific data, 2017.
- Deep learning techniques for automatic mri cardiac multi-structures segmentation and diagnosis: is the problem solved? IEEE TMI, 2018.
- The liver tumor segmentation benchmark (lits). Medical Image Analysis, 2023.
- Swin-unet: Unet-like pure transformer for medical image segmentation. In ECCV, 2022.
- Monai: An open-source framework for deep learning in healthcare. arXiv preprint arXiv:2211.02701, 2022.
- Transunet: Transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306, 2021.
- Utnet: a hybrid transformer architecture for medical image segmentation. In MICCAI 2021, 2021.
- A. Gu and T. Dao. Mamba: Linear-time sequence modeling with selective state spaces. arXiv preprint arXiv:2312.00752, 2023.
- Swin unetr: Swin transformers for semantic segmentation of brain tumors in mri images. In International MICCAI Brainlesion Workshop, 2021.
- Unetr: Transformers for 3d medical image segmentation. In Proceedings of the WACV, 2022.
- Unetr: Transformers for 3d medical image segmentation. In WACV, 2022.
- Swinunetr-v2: Stronger swin transformers with stagewise convolutions for 3d medical image segmentation. In MICCAI, 2023.
- Dints: Differentiable neural network topology search for 3d medical image segmentation. In Proceedings of WACV, 2021.
- The kits21 challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase ct, 2023.
- Stu-net: Scalable and transferable medical image segmentation models empowered by large-scale supervised pre-training. arXiv preprint arXiv:2304.06716, 2023.
- nnu-net: a self-configuring method for deep learning-based biomedical image segmentation. Nature methods, 18(2):203–211, 2021.
- F. Isensee and K. H. Maier-Hein. An attempt at beating the 3d u-net. arXiv preprint arXiv:1908.02182, 2019.
- nnu-net: Self-adapting framework for u-net-based medical image segmentation. arXiv preprint arXiv:1809.10486, 2018.
- Amos: A large-scale abdominal multi-organ benchmark for versatile medical image segmentation. Advances in Neural Information Processing Systems, 2022.
- 2015 miccai multi-atlas labeling beyond the cranial vault workshop and challenge. In Proc. MICCAI Multi-Atlas Labeling Beyond Cranial Vault—Workshop Challenge, 2015.
- U-mamba: Enhancing long-range dependency for biomedical image segmentation. arXiv preprint arXiv:2401.04722, 2024.
- The multimodal brain tumor image segmentation benchmark (brats). IEEE TMI, 2014.
- A. Myronenko. 3d mri brain tumor segmentation using autoencoder regularization. In BrainLes 2018, Held in Conjunction with MICCAI, 2019.
- U-net: Convolutional networks for biomedical image segmentation. In MICCAI, 2015.
- Transformer utilization in medical image segmentation networks. arXiv preprint arXiv:2304.04225, 2023.
- Mednext: transformer-driven scaling of convnets for medical image segmentation. In MICCAI, 2023.
- Self-supervised pre-training of swin transformers for 3d medical image analysis. In CVPR, 2022.
- Attention is all you need. NeurIPS, 2017.
- Transbts: Multimodal brain tumor segmentation using transformer. In MICCAI, 2021.
- Totalsegmentator: Robust segmentation of 104 anatomic structures in ct images. Radiol Artif Intell., 2023.
- D-former: A u-shaped dilated transformer for 3d medical image segmentation. Neural Computing and Applications, 2023.
- Cotr: Efficiently bridging cnn and transformer for 3d medical image segmentation. 2021.
- Segmamba: Long-range sequential modeling mamba for 3d medical image segmentation. arXiv preprint arXiv:2401.13560, 2024.
- Levit-unet: Make faster encoders with transformer for medical image segmentation. In PRCV, 2023.
- Transfuse: Fusing transformers and cnns for medical image segmentation. In MICCAI, 2021.
- nnformer: Interleaved transformer for volumetric segmentation. arXiv preprint arXiv:2109.03201, 2021.