FNOSeg3D: Resolution-Robust 3D Image Segmentation with Fourier Neural Operator (2310.03872v1)
Abstract: Due to the computational complexity of 3D medical image segmentation, training with downsampled images is a common remedy for out-of-memory errors in deep learning. Nevertheless, as standard spatial convolution is sensitive to variations in image resolution, the accuracy of a convolutional neural network trained with downsampled images can be suboptimal when applied on the original resolution. To address this limitation, we introduce FNOSeg3D, a 3D segmentation model robust to training image resolution based on the Fourier neural operator (FNO). The FNO is a deep learning framework for learning mappings between functions in partial differential equations, which has the appealing properties of zero-shot super-resolution and global receptive field. We improve the FNO by reducing its parameter requirement and enhancing its learning capability through residual connections and deep supervision, and these result in our FNOSeg3D model which is parameter efficient and resolution robust. When tested on the BraTS'19 dataset, it achieved superior robustness to training image resolution than other tested models with less than 1% of their model parameters.
- “Deep learning techniques for medical image segmentation: achievements and challenges,” Journal of Digital Imaging, vol. 32, no. 4, pp. 582–596, 2019.
- “A review of deep-learning-based medical image segmentation methods,” Sustainability, vol. 13, no. 3, pp. 1224, 2021.
- “UTNet: a hybrid transformer architecture for medical image segmentation,” in International Conference on Medical Image Computing and Computer-Assisted Intervention, 2021, pp. 61–71.
- “CoTr: Efficiently bridging CNN and transformer for 3D medical image segmentation,” in International Conference on Medical Image Computing and Computer-Assisted Intervention, 2021, pp. 171–180.
- “UNETR: Transformers for 3D medical image segmentation,” in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022, pp. 574–584.
- “Fourier neural operator for parametric partial differential equations,” in International Conference on Learning Representations, 2021.
- “Identifying the best machine learning algorithms for brain tumor segmentation, progression assessment, and overall survival prediction in the BRATS challenge,” arXiv:1811.02629 [cs.CV], 2018.
- “Deep residual learning for image recognition,” in IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 770–778.
- “Deeply-supervised nets,” in International Conference on Artificial Intelligence and Statistics, 2015, pp. 562–570.
- “Layer normalization,” arXiv:1607.06450 [stat.ML], 2016.
- “Self-normalizing neural networks,” in Advances in Neural Information Processing Systems, 2017, pp. 972–981.
- Ken C. L. Wong and Mehdi Moradi, “3D segmentation with fully trainable Gabor kernels and Pearson’s correlation coefficient,” in Machine Learning in Medical Imaging, 2022, pp. 53–61.
- “Adam: A method for stochastic optimization,” arXiv:1412.6980 [cs.LG], 2014.
- “SGDR: Stochastic gradient descent with warm restarts,” in International Conference on Learning Representations, 2017.
- “3D segmentation with exponential logarithmic loss for highly unbalanced object sizes,” in International Conference on Medical Image Computing and Computer-Assisted Intervention, 2018, pp. 612–619.