Machine Learning for Shipwreck Segmentation from Side Scan Sonar Imagery: Dataset and Benchmark (2401.14546v2)
Abstract: Open-source benchmark datasets have been a critical component for advancing machine learning for robot perception in terrestrial applications. Benchmark datasets enable the widespread development of state-of-the-art machine learning methods, which require large datasets for training, validation, and thorough comparison to competing approaches. Underwater environments impose several operational challenges that hinder efforts to collect large benchmark datasets for marine robot perception. Furthermore, a low abundance of targets of interest relative to the size of the search space leads to increased time and cost required to collect useful datasets for a specific task. As a result, there is limited availability of labeled benchmark datasets for underwater applications. We present the AI4Shipwrecks dataset, which consists of 28 distinct shipwrecks totaling 286 high-resolution labeled side scan sonar images to advance the state-of-the-art in autonomous sonar image understanding. We leverage the unique abundance of targets in Thunder Bay National Marine Sanctuary in Lake Huron, MI, to collect and compile a sonar imagery benchmark dataset through surveys with an autonomous underwater vehicle (AUV). We consulted with expert marine archaeologists for the labeling of robotically gathered data. We then leverage this dataset to perform benchmark experiments for comparison of state-of-the-art supervised segmentation methods, and we present insights on opportunities and open challenges for the field. The dataset and benchmarking tools will be released as an open-source benchmark dataset to spur innovation in machine learning for Great Lakes and ocean exploration. The dataset and accompanying software are available at https://umfieldrobotics.github.io/ai4shipwrecks/.
- Aurora a multi sensor dataset for robotic ocean exploration. DOI: 10.21227/nnms-te61. URL https://dx.doi.org/10.21227/nnms-te61.
- On-line multi-class segmentation of side-scan sonar imagery using an autonomous underwater vehicle. Journal of Marine Science and Engineering, 8(8): 557. DOI: 10.3390/jmse8080557.
- A rasterized ray-tracer pipeline for real-time multi-device sonar simulation. Graphical Models, 111: 101086. DOI: https://doi.org/10.1016/j.gmod.2020.101086. URL https://www.sciencedirect.com/science/article/pii /S1524070320300278.
- Rethinking Atrous Convolution for Semantic Image Segmentation. ArXiv:1706.05587 [cs].
- Vision Transformer Adapter for Dense Predictions. ArXiv:2205.08534 [cs].
- Physics-based modelling and simulation of multibeam echosounder perception for autonomous underwater manipulation. Frontiers in Robotics and AI, 8. DOI: 10.3389/frobt.2021.706646. URL https://www.frontiersin.org/articles/10.3389/frobt.2021.706646.
- A deep learning approach to target recognition in side-scan sonar imagery. In: OCEANS 2018 MTS/IEEE Charleston, pp. 1–4. DOI: 10.1109/OCEANS.2018.8604879.
- The UNESCO convention on the protection of the underwater cultural heritage: a future for our past? Conservation and management of archaeological sites, 11(1): 54–69.
- Revisiting Image Pyramid Structure for High Resolution Salient Object Detection. ArXiv:2209.09475 [cs].
- Deep learning from shallow dives: Sonar image generation and training for underwater object detection. ArXiv:1810.07990 [cs].
- Conditional gans for sonar image filtering with applications to underwater occupancy mapping. IEEE. DOI: 10.1109/icra48891.2023.10160646. URL https://doi.org/10.1109
- Cyclegan-based realistic image dataset generation for forward-looking sonar. Advanced Robotics, 35(3-4): 242–254. DOI: 10.1080/01691864.2021.1873845. URL https://doi.org/10.1080/01691864.2021.1873845.
- MacLennan, D. (1986). Time varied gain functions for pulsed sonars. Journal of Sound and Vibration, 110(3): 511–522.
- Making a completely blind image quality analyzer. IEEE Signal Processing Letters, 20(3): 209–212. DOI: 10.1109/LSP.2012.2227726.
- A comparison of few-shot learning methods for underwater optical and sonar image classification. In: Global Oceans 2020: Singapore–US Gulf Coast, IEEE, pp. 1–10.
- U-net: Convolutional networks for biomedical image segmentation. In: Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings Part III, 18. Springer, pp. 234–241.
- Cross-view and cross-domain underwater localization based on optical aerial and acoustic underwater images. IEEE Robotics and Automation Letters, 7(2): 4969–4974. DOI: 10.1109/LRA.2022.3154482.
- On the stratification of multi-label data. In: Machine Learning and Knowledge Discovery in Databases, Berlin Heidelberg: Springer Berlin Heidelberg, pp. 145–158. ISBN: 978-3-642-23808-6.
- Towards sim2real for shipwreck detection in side scan sonar imagery. In: 3rd Workshop on Closing the Reality Gap in Sim2Real Transfer for Robotics.
- Stars: Zero-shot sim-to-real transfer for segmentation of shipwrecks in sonar imagery. In: 34th British Machine Vision Conference 2023 BMVC 2023, Aberdeen, UK, November 20-24
- Thunder Bay National Marine Sanctuary (Accessed online: 2024). URL: https://thunderbay.noaa.gov/.