Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 65 tok/s
Gemini 2.5 Pro 40 tok/s Pro
GPT-5 Medium 26 tok/s Pro
GPT-5 High 24 tok/s Pro
GPT-4o 113 tok/s Pro
Kimi K2 200 tok/s Pro
GPT OSS 120B 445 tok/s Pro
Claude Sonnet 4.5 34 tok/s Pro
2000 character limit reached

T-Mamba: A unified framework with Long-Range Dependency in dual-domain for 2D & 3D Tooth Segmentation (2404.01065v2)

Published 1 Apr 2024 in cs.CV

Abstract: Tooth segmentation is a pivotal step in modern digital dentistry, essential for applications across orthodontic diagnosis and treatment planning. Despite its importance, this process is fraught with challenges due to the high noise and low contrast inherent in 2D and 3D tooth data. Both Convolutional Neural Networks (CNNs) and Transformers has shown promise in medical image segmentation, yet each method has limitations in handling long-range dependencies and computational complexity. To address this issue, this paper introduces T-Mamba, integrating frequency-based features and shared bi-positional encoding into vision mamba to address limitations in efficient global feature modeling. Besides, we design a gate selection unit to integrate two features in spatial domain and one feature in frequency domain adaptively. T-Mamba is the first work to introduce frequency-based features into vision mamba, and its flexibility allows it to process both 2D and 3D tooth data without the need for separate modules. Also, the TED3, a large-scale public tooth 2D dental X-ray dataset, has been presented in this paper. Extensive experiments demonstrate that T-Mamba achieves new SOTA results on a public tooth CBCT dataset and outperforms previous SOTA methods on TED3 dataset. The code and models are publicly available at: https://github.com/isbrycee/T-Mamba.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (39)
  1. “Deep frequency re-calibration u-net for medical image segmentation” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 3274–3283
  2. Zeyu Chen, Senyang Chen and Fengjun Hu “CTA-UNet: CNN-transformer architecture UNet for dental CBCT images segmentation” In Physics in Medicine & Biology 68.17 IOP Publishing, 2023, pp. 175042
  3. “3D U-Net: learning dense volumetric segmentation from sparse annotation” In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2016: 19th International Conference, Athens, Greece, October 17-21, 2016, Proceedings, Part II 19, 2016, pp. 424–432 Springer
  4. Zhiming Cui, Changjian Li and Wenping Wang “ToothNet: automatic tooth instance segmentation and identification from cone beam CT images” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 6368–6377
  5. “TSegNet: An efficient and accurate tooth segmentation network on 3D dental model” In Medical Image Analysis 69 Elsevier, 2021, pp. 101949
  6. “A fully automatic AI system for tooth and alveolar bone segmentation from cone-beam CT images” In Nature communications 13.1 Nature Publishing Group UK London, 2022, pp. 2096
  7. “An image is worth 16x16 words: Transformers for image recognition at scale” In arXiv preprint arXiv:2010.11929, 2020
  8. “Automatic multi-organ segmentation on abdominal CT with dense V-networks” In IEEE transactions on medical imaging 37.8 IEEE, 2018, pp. 1822–1834
  9. “PaXNet: Tooth segmentation and dental caries detection in panoramic X-ray using ensemble transfer learning and capsule classifier” In Multimedia Tools and Applications 82.18 Springer, 2023, pp. 27659–27679
  10. “Swin unetr: Swin transformers for semantic segmentation of brain tumors in mri images” In International MICCAI Brainlesion Workshop, 2021, pp. 272–284 Springer
  11. “Unetr: Transformers for 3d medical image segmentation” In Proceedings of the IEEE/CVF winter conference on applications of computer vision, 2022, pp. 574–584
  12. “Mask r-cnn” In Proceedings of the IEEE international conference on computer vision, 2017, pp. 2961–2969
  13. “Teeth U-Net: A segmentation model of dental panoramic X-ray images for context semantics and contrast enhancement” In Computers in Biology and Medicine 152 Elsevier, 2023, pp. 106296
  14. M MultiResUNet Ibtehaz “Rethinking the U-Net architecture for multimodal biomedical image segmentation” In arXiv, 2019
  15. “A fully automated method for 3D individual tooth identification and segmentation in dental CBCT” In IEEE transactions on pattern analysis and machine intelligence 44.10 IEEE, 2021, pp. 6562–6568
  16. “3d ux-net: A large kernel volumetric convnet modernizing hierarchical transformer for medical image segmentation” In arXiv preprint arXiv:2209.15076, 2022
  17. “UCFilTransNet: Cross-Filtering Transformer-based network for CT image segmentation” In Expert Systems with Applications 238 Elsevier, 2024, pp. 121717
  18. “Lightweight deep learning methods for panoramic dental X-ray image segmentation” In Neural Computing and Applications 35.11 Springer, 2023, pp. 8295–8306
  19. “Swin-umamba: Mamba-based unet with imagenet-based pretraining” In arXiv preprint arXiv:2402.03302, 2024
  20. “Vmamba: Visual state space model” In arXiv preprint arXiv:2401.10166, 2024
  21. Jun Ma, Feifei Li and Bo Wang “U-mamba: Enhancing long-range dependency for biomedical image segmentation” In arXiv preprint arXiv:2401.04722, 2024
  22. “Edgenext: efficiently amalgamated cnn-transformer architecture for mobile vision applications” In European Conference on Computer Vision, 2022, pp. 3–20 Springer
  23. “S4nd: Modeling images and videos as multidimensional signals with state spaces” In Advances in neural information processing systems 35, 2022, pp. 2846–2861
  24. “Attention u-net: Learning where to look for the pancreas. arXiv 2018” In arXiv preprint arXiv:1804.03999, 1804
  25. “Vm-unet: Vision mamba unet for medical image segmentation” In arXiv preprint arXiv:2402.02491, 2024
  26. “Teeth segmentation in panoramic dental X-ray using mask regional convolutional neural network” In Applied Sciences 13.13 MDPI, 2023, pp. 7947
  27. “Dental x-ray image segmentation” In Biometric Technology for Human Identification 5404, 2004, pp. 409–417 SPIE
  28. “A study on tooth segmentation and numbering using end-to-end deep neural networks” In 2020 33rd SIBGRAPI conference on graphics, patterns and images (SIBGRAPI), 2020, pp. 164–171 IEEE
  29. “Transbts: Multimodal brain tumor segmentation using transformer” In International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer, 2021, pp. 109–119
  30. “Segmamba: Long-range sequential modeling mamba for 3d medical image segmentation” In arXiv preprint arXiv:2401.13560, 2024
  31. Yijun Yang, Zhaohu Xing and Lei Zhu “Vivim: a video vision mamba for medical video object segmentation” In arXiv preprint arXiv:2401.14168, 2024
  32. “P-Mamba: Marrying Perona Malik Diffusion with Mamba for Efficient Pediatric Echocardiographic Left Ventricular Segmentation” In arXiv preprint arXiv:2402.08506, 2024
  33. “CoT-UNet++: A medical image segmentation method based on contextual Transformer and dense connection” In Mathematical Biosciences and Engineering 20.5, 2023, pp. 8320–8336
  34. “Automatic 3D cardiovascular MR segmentation with densely-connected volumetric convnets” In Medical Image Computing and Computer-Assisted Intervention- MICCAI 2017: 20th International Conference, Quebec City, QC, Canada, September 11-13, 2017, Proceedings, Part II 20, 2017, pp. 287–295 Springer
  35. Yue Yu, Kun She and Jinhua Liu “Wavelet frequency separation attention network for chest x-ray image super-resolution” In Micromachines 12.11 MDPI, 2021, pp. 1418
  36. “Simple parameter-free self-attention approximation” In arXiv preprint arXiv:2307.12018, 2023
  37. “PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation” In arXiv preprint arXiv:2401.07579, 2024
  38. “nnformer: Volumetric medical image segmentation via a 3d transformer” In IEEE Transactions on Image Processing IEEE, 2023
  39. “Vision mamba: Efficient visual representation learning with bidirectional state space model” In arXiv preprint arXiv:2401.09417, 2024
Citations (9)

Summary

We haven't generated a summary for this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

This paper has been mentioned in 1 post and received 2 likes.