CIS-UNet: Multi-Class Segmentation of the Aorta in Computed Tomography Angiography via Context-Aware Shifted Window Self-Attention (2401.13049v1)
Abstract: Advancements in medical imaging and endovascular grafting have facilitated minimally invasive treatments for aortic diseases. Accurate 3D segmentation of the aorta and its branches is crucial for interventions, as inaccurate segmentation can lead to erroneous surgical planning and endograft construction. Previous methods simplified aortic segmentation as a binary image segmentation problem, overlooking the necessity of distinguishing between individual aortic branches. In this paper, we introduce Context Infused Swin-UNet (CIS-UNet), a deep learning model designed for multi-class segmentation of the aorta and thirteen aortic branches. Combining the strengths of Convolutional Neural Networks (CNNs) and Swin transformers, CIS-UNet adopts a hierarchical encoder-decoder structure comprising a CNN encoder, symmetric decoder, skip connections, and a novel Context-aware Shifted Window Self-Attention (CSW-SA) as the bottleneck block. Notably, CSW-SA introduces a unique utilization of the patch merging layer, distinct from conventional Swin transformers. It efficiently condenses the feature map, providing a global spatial context and enhancing performance when applied at the bottleneck layer, offering superior computational efficiency and segmentation accuracy compared to the Swin transformers. We trained our model on computed tomography (CT) scans from 44 patients and tested it on 15 patients. CIS-UNet outperformed the state-of-the-art SwinUNetR segmentation model, which is solely based on Swin transformers, by achieving a superior mean Dice coefficient of 0.713 compared to 0.697, and a mean surface distance of 2.78 mm compared to 3.39 mm. CIS-UNet's superior 3D aortic segmentation offers improved precision and optimization for planning endovascular treatments. Our dataset and code will be publicly available.
- Segmentation of aorta 3d ct images based on 2d convolutional neural networks. Electronics 10, 2559.
- The society for vascular surgery practice guidelines on the care of patients with an abdominal aortic aneurysm. Journal of vascular surgery 67, 2–77.
- Multi-stage learning for segmentation of aortic dissections using a prior aortic anatomy simplification. Medical image analysis 69, 101931.
- Transunet: Transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306 .
- Deep learning-based medical image segmentation of the aorta using xr-msf-u-net. Computer Methods and Programs in Biomedicine 225, 107073.
- Visformer: The vision-friendly transformer, in: Proceedings of the IEEE/CVF international conference on computer vision, pp. 589–598.
- Monai: Medical open network for ai. Online at https://doi. org/10.5281/zenodo 5525502.
- Graph cut based automatic aorta segmentation with an adaptive smoothness constraint in 3d abdominal ct images. Neurocomputing 310, 46–58.
- An image is worth 16x16 words: Transformers for image recognition at scale, in: ICLR.
- 3d automatic segmentation of aortic computed tomography angiography combining multi-view 2d convolutional neural networks. Cardiovascular engineering and technology 11, 576–586.
- 3d slicer as an image computing platform for the quantitative imaging network. Magnetic resonance imaging 30, 1323–1341.
- Neocognitron: A hierarchical neural network capable of visual pattern recognition. Neural networks 1, 119–130.
- Fusing 2d and 3d convolutional neural networks for the segmentation of aorta and coronary arteries from ct images. Artificial Intelligence in Medicine 121, 102189.
- Cmt: Convolutional neural networks meet vision transformers, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12175–12185.
- Swin unetr: Swin transformers for semantic segmentation of brain tumors in mri images, in: International MICCAI Brainlesion Workshop, Springer. pp. 272–284.
- Unetr: Transformers for 3d medical image segmentation, in: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp. 574–584.
- Deep residual learning for image recognition, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778.
- Left-ventricle quantification using residual u-net, in: Statistical Atlases and Computational Models of the Heart. Atrial Segmentation and LV Quantification Challenges: 9th International Workshop, STACOM 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, September 16, 2018, Revised Selected Papers 9, Springer. pp. 371–380.
- Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems 25.
- Miccai multi-atlas labeling beyond the cranial vault–workshop and challenge, in: Proc. MICCAI Multi-Atlas Labeling Beyond Cranial Vault—Workshop Challenge, p. 12.
- Gradient-based learning applied to document recognition. Proceedings of the IEEE 86, 2278–2324.
- Swin transformer: Hierarchical vision transformer using shifted windows, in: Proceedings of the IEEE/CVF international conference on computer vision, pp. 10012–10022.
- Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 .
- Loss odyssey in medical image segmentation. Medical Image Analysis 71, 102035.
- 2022 acc/aha guideline for the diagnosis and management of aortic disease: a report of the american heart association/american college of cardiology joint committee on clinical practice guidelines. Journal of the American College of Cardiology 80, e223–e393.
- Modified fenestrated stent grafts: device design, modifications, implantation, and current applications. Perspectives in vascular surgery and endovascular therapy 21, 157–167.
- Transfemoral intraluminal graft implantation for abdominal aortic aneurysms. Annals of vascular surgery 5, 491–499.
- dresu-net: 3d deep residual u-net based brain tumor segmentation from multimodal mri. Biomedical Signal Processing and Control 79, 103861.
- U-net: Convolutional networks for biomedical image segmentation, in: Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, Springer. pp. 234–241.
- A deep learning-based and fully automated pipeline for thoracic aorta geometric analysis and planning for endovascular repair from computed tomography. Journal of Digital Imaging 35, 226–239.
- Automated 3d segmentation and diameter measurement of the thoracic aorta on non-contrast enhanced ct. European radiology 29, 4613–4623.
- Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 .
- Self-supervised pre-training of swin transformers for 3d medical image analysis, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 20730–20740.
- High-resolution swin transformer for automatic medical image segmentation. Sensors 23, 3420.
- Transclaw u-net: claw u-net with transformers for medical image segmentation, in: 2022 5th International Conference on Information Communication and Signal Processing (ICICSP), IEEE. pp. 280–284.
Collections
Sign up for free to add this paper to one or more collections.