T-Mamba: A unified framework with Long-Range Dependency in dual-domain for 2D & 3D Tooth Segmentation (2404.01065v2)
Abstract: Tooth segmentation is a pivotal step in modern digital dentistry, essential for applications across orthodontic diagnosis and treatment planning. Despite its importance, this process is fraught with challenges due to the high noise and low contrast inherent in 2D and 3D tooth data. Both Convolutional Neural Networks (CNNs) and Transformers has shown promise in medical image segmentation, yet each method has limitations in handling long-range dependencies and computational complexity. To address this issue, this paper introduces T-Mamba, integrating frequency-based features and shared bi-positional encoding into vision mamba to address limitations in efficient global feature modeling. Besides, we design a gate selection unit to integrate two features in spatial domain and one feature in frequency domain adaptively. T-Mamba is the first work to introduce frequency-based features into vision mamba, and its flexibility allows it to process both 2D and 3D tooth data without the need for separate modules. Also, the TED3, a large-scale public tooth 2D dental X-ray dataset, has been presented in this paper. Extensive experiments demonstrate that T-Mamba achieves new SOTA results on a public tooth CBCT dataset and outperforms previous SOTA methods on TED3 dataset. The code and models are publicly available at: https://github.com/isbrycee/T-Mamba.
- “Deep frequency re-calibration u-net for medical image segmentation” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 3274–3283
- Zeyu Chen, Senyang Chen and Fengjun Hu “CTA-UNet: CNN-transformer architecture UNet for dental CBCT images segmentation” In Physics in Medicine & Biology 68.17 IOP Publishing, 2023, pp. 175042
- “3D U-Net: learning dense volumetric segmentation from sparse annotation” In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2016: 19th International Conference, Athens, Greece, October 17-21, 2016, Proceedings, Part II 19, 2016, pp. 424–432 Springer
- Zhiming Cui, Changjian Li and Wenping Wang “ToothNet: automatic tooth instance segmentation and identification from cone beam CT images” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 6368–6377
- “TSegNet: An efficient and accurate tooth segmentation network on 3D dental model” In Medical Image Analysis 69 Elsevier, 2021, pp. 101949
- “A fully automatic AI system for tooth and alveolar bone segmentation from cone-beam CT images” In Nature communications 13.1 Nature Publishing Group UK London, 2022, pp. 2096
- “An image is worth 16x16 words: Transformers for image recognition at scale” In arXiv preprint arXiv:2010.11929, 2020
- “Automatic multi-organ segmentation on abdominal CT with dense V-networks” In IEEE transactions on medical imaging 37.8 IEEE, 2018, pp. 1822–1834
- “PaXNet: Tooth segmentation and dental caries detection in panoramic X-ray using ensemble transfer learning and capsule classifier” In Multimedia Tools and Applications 82.18 Springer, 2023, pp. 27659–27679
- “Swin unetr: Swin transformers for semantic segmentation of brain tumors in mri images” In International MICCAI Brainlesion Workshop, 2021, pp. 272–284 Springer
- “Unetr: Transformers for 3d medical image segmentation” In Proceedings of the IEEE/CVF winter conference on applications of computer vision, 2022, pp. 574–584
- “Mask r-cnn” In Proceedings of the IEEE international conference on computer vision, 2017, pp. 2961–2969
- “Teeth U-Net: A segmentation model of dental panoramic X-ray images for context semantics and contrast enhancement” In Computers in Biology and Medicine 152 Elsevier, 2023, pp. 106296
- M MultiResUNet Ibtehaz “Rethinking the U-Net architecture for multimodal biomedical image segmentation” In arXiv, 2019
- “A fully automated method for 3D individual tooth identification and segmentation in dental CBCT” In IEEE transactions on pattern analysis and machine intelligence 44.10 IEEE, 2021, pp. 6562–6568
- “3d ux-net: A large kernel volumetric convnet modernizing hierarchical transformer for medical image segmentation” In arXiv preprint arXiv:2209.15076, 2022
- “UCFilTransNet: Cross-Filtering Transformer-based network for CT image segmentation” In Expert Systems with Applications 238 Elsevier, 2024, pp. 121717
- “Lightweight deep learning methods for panoramic dental X-ray image segmentation” In Neural Computing and Applications 35.11 Springer, 2023, pp. 8295–8306
- “Swin-umamba: Mamba-based unet with imagenet-based pretraining” In arXiv preprint arXiv:2402.03302, 2024
- “Vmamba: Visual state space model” In arXiv preprint arXiv:2401.10166, 2024
- Jun Ma, Feifei Li and Bo Wang “U-mamba: Enhancing long-range dependency for biomedical image segmentation” In arXiv preprint arXiv:2401.04722, 2024
- “Edgenext: efficiently amalgamated cnn-transformer architecture for mobile vision applications” In European Conference on Computer Vision, 2022, pp. 3–20 Springer
- “S4nd: Modeling images and videos as multidimensional signals with state spaces” In Advances in neural information processing systems 35, 2022, pp. 2846–2861
- “Attention u-net: Learning where to look for the pancreas. arXiv 2018” In arXiv preprint arXiv:1804.03999, 1804
- “Vm-unet: Vision mamba unet for medical image segmentation” In arXiv preprint arXiv:2402.02491, 2024
- “Teeth segmentation in panoramic dental X-ray using mask regional convolutional neural network” In Applied Sciences 13.13 MDPI, 2023, pp. 7947
- “Dental x-ray image segmentation” In Biometric Technology for Human Identification 5404, 2004, pp. 409–417 SPIE
- “A study on tooth segmentation and numbering using end-to-end deep neural networks” In 2020 33rd SIBGRAPI conference on graphics, patterns and images (SIBGRAPI), 2020, pp. 164–171 IEEE
- “Transbts: Multimodal brain tumor segmentation using transformer” In International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer, 2021, pp. 109–119
- “Segmamba: Long-range sequential modeling mamba for 3d medical image segmentation” In arXiv preprint arXiv:2401.13560, 2024
- Yijun Yang, Zhaohu Xing and Lei Zhu “Vivim: a video vision mamba for medical video object segmentation” In arXiv preprint arXiv:2401.14168, 2024
- “P-Mamba: Marrying Perona Malik Diffusion with Mamba for Efficient Pediatric Echocardiographic Left Ventricular Segmentation” In arXiv preprint arXiv:2402.08506, 2024
- “CoT-UNet++: A medical image segmentation method based on contextual Transformer and dense connection” In Mathematical Biosciences and Engineering 20.5, 2023, pp. 8320–8336
- “Automatic 3D cardiovascular MR segmentation with densely-connected volumetric convnets” In Medical Image Computing and Computer-Assisted Intervention- MICCAI 2017: 20th International Conference, Quebec City, QC, Canada, September 11-13, 2017, Proceedings, Part II 20, 2017, pp. 287–295 Springer
- Yue Yu, Kun She and Jinhua Liu “Wavelet frequency separation attention network for chest x-ray image super-resolution” In Micromachines 12.11 MDPI, 2021, pp. 1418
- “Simple parameter-free self-attention approximation” In arXiv preprint arXiv:2307.12018, 2023
- “PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation” In arXiv preprint arXiv:2401.07579, 2024
- “nnformer: Volumetric medical image segmentation via a 3d transformer” In IEEE Transactions on Image Processing IEEE, 2023
- “Vision mamba: Efficient visual representation learning with bidirectional state space model” In arXiv preprint arXiv:2401.09417, 2024
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.