Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
158 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Weakly Supervised Caveline Detection For AUV Navigation Inside Underwater Caves (2303.03670v2)

Published 7 Mar 2023 in eess.IV and cs.RO

Abstract: Underwater caves are challenging environments that are crucial for water resource management, and for our understanding of hydro-geology and history. Mapping underwater caves is a time-consuming, labor-intensive, and hazardous operation. For autonomous cave mapping by underwater robots, the major challenge lies in vision-based estimation in the complete absence of ambient light, which results in constantly moving shadows due to the motion of the camera-light setup. Thus, detecting and following the caveline as navigation guidance is paramount for robots in autonomous cave mapping missions. In this paper, we present a computationally light caveline detection model based on a novel Vision Transformer (ViT)-based learning pipeline. We address the problem of scarce annotated training data by a weakly supervised formulation where the learning is reinforced through a series of noisy predictions from intermediate sub-optimal models. We validate the utility and effectiveness of such weak supervision for caveline detection and tracking in three different cave locations: USA, Mexico, and Spain. Experimental results demonstrate that our proposed model, CL-ViT, balances the robustness-efficiency trade-off, ensuring good generalization performance while offering 10+ FPS on single-board (Jetson TX2) devices.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (53)
  1. CoralSeg: Learning Coral Segmentation from Sparse Annotations. Journal of Field Robotics (JFR), 36(8):1456–1477, 2019.
  2. J. Burge. Underwater Cave Surveying. Cave Diving Section of the National Speleological Society, 1988.
  3. American cave diving fatalities 1969-2007. International Journal of Aquatic Research and Education, 3(2):7, 2009.
  4. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(4):834–848, 2017.
  5. Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European conference on computer vision (ECCV), pages 801–818, 2018.
  6. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
  7. S. Exley. Basic cave diving: A blueprint for survival. Cave Diving Section of the National Speleological Society, 1986.
  8. D. Ford and P. Williams. Introduction to Karst, chapter 1, pages 1–8. John Wiley & Sons, Ltd, 2007.
  9. Environmental challenges, technical solutions and standard operating procedures for data collection in photogrammetric studies toward a unified database of objects and features in underwater caves in mexico. The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, 43:659–666, 2021.
  10. Y. Girdhar and G. Dudek. Modeling Curiosity in a Mobile Robot for Long-term Autonomous Exploration and Monitoring. Autonomous Robots, 40(7):1267–1278, 2016.
  11. Autonomous Adaptive Exploration using Realtime Online Spatiotemporal Topic Modeling. International Journal of Robotics Research (IJRR), 33(4):645–657, 2014.
  12. The arrival of humans on the Yucatan Peninsula: Evidence from submerged caves in the state of Quintana Roo, Mexico. Current Research in the Pleistocene, 25:1–24, 2008.
  13. A survey on instance segmentation: state of the art. International journal of multimedia information retrieval, 9(3):171–189, 2020.
  14. Searching for mobilenetv3. In IEEE/CVF International Conference on Computer Vision (ICCV), pages 1314–1324, 2019.
  15. nnu-net: a self-configuring method for deep learning-based biomedical image segmentation. Nature methods, 18(2):203–211, 2021.
  16. Semantic Segmentation of Underwater Imagery: Dataset and Benchmark. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE/RSJ, 2020.
  17. Simultaneous Enhancement and Super-Resolution of Underwater Imagery for Improved Visual Perception. In Robotics: Science and Systems (RSS), Corvalis, Oregon, USA, July 2020.
  18. SVAM: Saliency-guided Visual Attention Modeling by Autonomous Underwater Robots. In Robotics: Science and Systems (RSS), NY, USA, 2022.
  19. Fast Underwater Image Enhancement for Improved Visual Perception. IEEE Robotics and Automation Letters (RA-L), 5(2):3227–3234, 2020.
  20. Experimental Comparison of Open Source Visual-Inertial-Based State Estimation Algorithms in the Underwater Domain. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 7221–7227, 2019.
  21. High definition, inexpensive, underwater mapping. In IEEE International Conference on Robotics and Automation (ICRA), pages 1113–1121, Philadelphia, PA, USA, 2022.
  22. A probabilistic hough transform. Pattern recognition, 24(4):303–316, 1991.
  23. One-shot informed robotic visual search in the wild. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 5800–5807. IEEE, 2020.
  24. Scattnet: Semantic segmentation network with spatial and channel attention mechanism for high-resolution remote sensing images. IEEE Geoscience and Remote Sensing Letters, 18(5):905–909, 2020.
  25. Pyramid attention network for semantic segmentation. arXiv preprint arXiv:1805.10180, 2018.
  26. Expectation-maximization attention networks for semantic segmentation. In IEEE/CVF Int. Conference on Computer Vision, pages 9167–9176, 2019.
  27. Toward autonomous exploration in confined underwater environments. Journal of Field Robotics, 33(7):994–1012, 2016.
  28. Vision-based Autonomous Underwater Swimming in Dense Coral for Combined Collision Avoidance and Target Selection. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 1885–1891. IEEE, 2018.
  29. Contour-based approach for 3D mapping of underwater galleries. In Global Oceans 2020: Singapore–US Gulf Coast, pages 1–6. IEEE, 2020.
  30. V-net: Fully convolutional neural networks for volumetric medical image segmentation. In International Conference on 3D vision (3DV), pages 565–571. Ieee, 2016.
  31. Deep neural networks: a comparison on different computing platforms. In Canadian Conference on Computer and Robot Vision (CRV), pages 383–389, Toronto, ON, Canada, May 2018.
  32. Autonomous 3D Semantic Mapping of Coral Reefs. In 12th Conference on Field and Service Robotics (FSR), pages 365–379, Tokyo, Japan, Aug. 2019.
  33. Coral Identification and Counting with an Autonomous Underwater Vehicle. In IEEE International Conference on Robotics and Biomimetics (ROBIO), pages 524–529, Kuala Lumpur, Malaysia, (Finalist of T. J. Tarn Best Paper in Robotics), Dec. 2018.
  34. M. Modasshir and I. Rekleitis. Augmenting coral reef monitoring with an enhanced detection system. In IEEE International Conference on Robotics and Automation, pages 1874–1880, Paris, France, 2020.
  35. Sonar Visual Inertial SLAM of Underwater Structures. In IEEE International Conference on Robotics and Automation, pages 5190–5196, 2018.
  36. An Underwater SLAM System using Sonar, Visual, Inertial, and Depth Sensor. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 1861–1868, Macau, (IROS ICROS Best Application Paper Award. Finalist), 2019.
  37. Contour based reconstruction of underwater structures using sonar, visual, inertial, and depth sensor. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 8048–8053, Macau, Nov. 2019.
  38. SVIn2: A Multi-sensor Fusion-based Underwater SLAM System. International Journal of Robotics Research, 41(11-12):1022–1042, July 2022.
  39. Vision transformers for dense prediction. In IEEE/CVF Int. Conference on Computer Vision, pages 12179–12188, 2021.
  40. Automated Fish Detection in Underwater Images Using Shape-Based Level Sets. Photogrammetric Record, 30(149):46–62, 2015.
  41. Autonomous exploration and 3-D mapping of underwater caves with the human-portable SUNFISH® AUV. In Global Oceans 2020: Singapore–US Gulf Coast, pages 1–10. IEEE, 2020.
  42. Novel application of 3D documentation techniques at a submerged Late Pleistocene cave site in Quintana Roo, Mexico. In Digital Heritage, volume 1, pages 181–182, 2015.
  43. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention (MICCAI), pages 234–241. Springer, 2015.
  44. M. Tan and Q. Le. Efficientnet: Rethinking model scaling for convolutional neural networks. In International Conference on Machine Learning, pages 6105–6114. PMLR, 2019.
  45. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural networks for machine learning, 4(2):26–31, 2012.
  46. Real-Time Dense 3D Mapping of Underwater Environments. In IEEE International Conference on Robotics and Automation (ICRA), London, UK, 2023.
  47. Semantic segmentation in underwater ship inspections: Benchmark and data set. IEEE Journal of Oceanic Engineering, 2022.
  48. N. Weidner. Underwater Cave Mapping and Reconstruction Using Stereo Vision. Master’s thesis, Computer Science and Engineering Department, University of South Carolina, Columbia, SC, 2017.
  49. Underwater cave mapping using stereo vision. In IEEE International Conference on Robotics and Automation (ICRA), pages 5709 – 5715, 2017.
  50. Road extraction by deep residual u-net. IEEE Geoscience and Remote Sensing Letters, 15(5):749–753, 2018.
  51. Z. Zhang and M. Sabuncu. Generalized cross entropy loss for training deep neural networks with noisy labels. Advances in Neural Information Processing Systems, 31, 2018.
  52. Unet++: Redesigning skip connections to exploit multiscale features in image segmentation. IEEE transactions on medical imaging, 39(6):1856–1867, 2019.
  53. Saliency-Based Diver Target Detection and Localization Method. Mathematical Problems in Engineering, 2020, 2020.
Citations (8)

Summary

We haven't generated a summary for this paper yet.

Youtube Logo Streamline Icon: https://streamlinehq.com