Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers (2403.14465v2)

Published 21 Mar 2024 in eess.IV and cs.CV

Abstract: In minimally invasive endovascular procedures, contrast-enhanced angiography remains the most robust imaging technique. However, it is at the expense of the patient and clinician's health due to prolonged radiation exposure. As an alternative, interventional ultrasound has notable benefits such as being radiation-free, fast to deploy, and having a small footprint in the operating room. Yet, ultrasound is hard to interpret, and highly prone to artifacts and noise. Additionally, interventional radiologists must undergo extensive training before they become qualified to diagnose and treat patients effectively, leading to a shortage of staff, and a lack of open-source datasets. In this work, we seek to address both problems by introducing a self-supervised deep learning architecture to segment catheters in longitudinal ultrasound images, without demanding any labeled data. The network architecture builds upon AiAReSeg, a segmentation transformer built with the Attention in Attention mechanism, and is capable of learning feature changes across time and space. To facilitate training, we used synthetic ultrasound data based on physics-driven catheter insertion simulations, and translated the data into a unique CT-Ultrasound common domain, CACTUSS, to improve the segmentation performance. We generated ground truth segmentation masks by computing the optical flow between adjacent frames using FlowNet2, and performed thresholding to obtain a binary map estimate. Finally, we validated our model on a test dataset, consisting of unseen synthetic data and images collected from silicon aorta phantoms, thus demonstrating its potential for applications to clinical data in the future.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (38)
  1. Joel A Kaplan. Kaplan’s cardiac anesthesia: In cardiac and noncardiac surgery. Elsevier Health Sciences, 2016.
  2. Epidemiology and contemporary management of abdominal aortic aneurysms. Abdominal Radiology, 43:1032–1043, 2018.
  3. Radiocontrast-induced acute renal failure. Journal of Intensive Care Medicine, 20(2):63–75, 2005.
  4. First experience using intraoperative contrast-enhanced ultrasound during endovascular aneurysm repair for infrarenal aortic aneurysms. Journal of vascular surgery, 51(5):1103–1110, 2010.
  5. Ultrasound-guided endovascular treatment for vascular access malfunction: results in 4896 cases. The Journal of Vascular Access, 14(3):225–230, 2013.
  6. Ultrasound physics and technology: how, why and when. Elsevier Health Sciences, 2011.
  7. Intelligent robotic sonographer: Mutual information-based disentangled reward learning from few demonstrations. The International Journal of Robotics Research, page 02783649231223547, 2023.
  8. Motion-aware robotic 3d ultrasound. In 2021 IEEE International Conference on Robotics and Automation (ICRA), pages 12494–12500. IEEE, 2021.
  9. Deformation-aware robotic 3d ultrasound. IEEE Robotics and Automation Letters, 6(4):7675–7682, 2021.
  10. Aiareseg: Catheter detection and segmentation in interventional ultrasound using transformers. arXiv preprint arXiv:2309.14492, 2023.
  11. Fully automatic catheter segmentation in mri with 3d convolutional neural networks: application to mri-guided gynecologic brachytherapy. Physics in Medicine & Biology, 64(16):165008, 2019.
  12. End-to-end real-time catheter segmentation with optical flow-guided warping during endovascular intervention. In 2020 IEEE International Conference on Robotics and Automation (ICRA), pages 9967–9973. IEEE, 2020.
  13. A survey on deep learning in medical image analysis. Medical image analysis, 42:60–88, 2017.
  14. Autonomous robotic screening of tubular structures based only on real-time ultrasound imaging feedback. IEEE Transactions on Industrial Electronics, 69(7):7064–7075, 2021.
  15. Ultrasound image segmentation: a deeply supervised network with attention to boundaries. IEEE Transactions on Biomedical Engineering, 66(6):1637–1648, 2018.
  16. Deep learning robotic guidance for autonomous vascular access. Nature Machine Intelligence, 2(2):104–115, 2020.
  17. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pages 234–241. Springer, 2015.
  18. nnu-net: a self-configuring method for deep learning-based biomedical image segmentation. Nature methods, 18(2):203–211, 2021.
  19. Attention-based models for speech recognition. Advances in neural information processing systems, 28, 2015.
  20. Attention is all you need. Advances in neural information processing systems, 30, 2017.
  21. An image is worth 16x16 words: Transformers for image recognition at scale. ICLR, 2021.
  22. Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 6881–6890, 2021.
  23. End-to-end object detection with adaptive clustering transformer. arXiv preprint arXiv:2011.09315, 2020.
  24. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
  25. Ross Girshick. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision, pages 1440–1448, 2015.
  26. Cathsim: An open-source simulator for autonomous cannulation. arXiv preprint arXiv:2208.01455, 2022.
  27. Cactuss: Common anatomical ct-us space for us examinations. International Journal of Computer Assisted Radiology and Surgery, pages 1–9, 2024.
  28. Determining optical flow. Artificial intelligence, 17(1-3):185–203, 1981.
  29. Unsupervised space-time network for temporally-consistent segmentation of multiple motions. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 22139–22148, 2023.
  30. Guess what moves: Unsupervised video and image segmentation by anticipating motion. arXiv preprint arXiv:2205.07844, 2022.
  31. Raft: Recurrent all-pairs field transforms for optical flow. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part II 16, pages 402–419. Springer, 2020.
  32. Multi-atlas labeling beyond the cranial vault - workshop and challenge. https://www.synapse.org/#!Synapse:syn3193805/wiki/89480, 2015.
  33. Gunnar Farnebäck. Two-frame motion estimation based on polynomial expansion. In Image Analysis: 13th Scandinavian Conference, SCIA 2003 Halmstad, Sweden, June 29–July 2, 2003 Proceedings 13, pages 363–370. Springer, 2003.
  34. Flownet 2.0: Evolution of optical flow estimation with deep networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2462–2470, 2017.
  35. Pwc-net: Cnns for optical flow using pyramid, warping, and cost volume. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 8934–8943, 2018.
  36. A fusion approach for multi-frame optical flow estimation.
  37. Per-pixel classification is not all you need for semantic segmentation.
  38. Monai: An open-source framework for deep learning in healthcare. arXiv preprint arXiv:2211.02701, 2022.

Summary

We haven't generated a summary for this paper yet.