STARS: Zero-shot Sim-to-Real Transfer for Segmentation of Shipwrecks in Sonar Imagery (2310.01667v1)
Abstract: In this paper, we address the problem of sim-to-real transfer for object segmentation when there is no access to real examples of an object of interest during training, i.e. zero-shot sim-to-real transfer for segmentation. We focus on the application of shipwreck segmentation in side scan sonar imagery. Our novel segmentation network, STARS, addresses this challenge by fusing a predicted deformation field and anomaly volume, allowing it to generalize better to real sonar images and achieve more effective zero-shot sim-to-real transfer for image segmentation. We evaluate the sim-to-real transfer capabilities of our method on a real, expert-labeled side scan sonar dataset of shipwrecks collected from field work surveys with an autonomous underwater vehicle (AUV). STARS is trained entirely in simulation and performs zero-shot shipwreck segmentation with no additional fine-tuning on real data. Our method provides a significant 20% increase in segmentation performance for the targeted shipwreck class compared to the best baseline.
- Zero-shot semantic segmentation. In Advances in Neural Information Processing Systems, volume 32, 2019.
- On-line multi-class segmentation of side-scan sonar imagery using an autonomous underwater vehicle. Journal of Marine Science and Engineering, 8(8):557, 2020.
- Blender Online Community. Blender - a 3d modelling and rendering package, 2018. URL http://www.blender.org.
- The cityscapes dataset for semantic urban scene understanding. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3213–3223, June 2016.
- Deformable convolutional networks. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 764–773, 2017.
- Cut, paste and learn: Surprisingly easy synthesis for instance detection. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 1310–1319, 2017.
- A deep learning approach to target recognition in side-scan sonar imagery. In OCEANS 2018 MTS/IEEE Charleston, pages 1–4, 2018.
- Pøda: Prompt-driven zero-shot domain adaptation. In ICCV, 2023.
- Detection of boulders in side scan sonar mosaics by a neural network. Geosciences, 9(4):159, 2019.
- Domain-adversarial training of neural networks. Journal of Machine Learning Research, 17(1):2096–2030, Jan 2016.
- Side-scan sonar image classification based on style transfer and pre-trained convolutional neural networks. Electronics, 10(15):1823, 2021.
- From pixel to patch: Synthesize context-aware features for zero-shot semantic segmentation. IEEE Transactions on Neural Networks and Learning Systems, pages 1–15, 2022.
- CyCADA: Cycle-consistent adversarial domain adaptation. In Proceedings of the 35th International Conference on Machine Learning, volume 80, pages 1989–1998. PMLR, 10–15 Jul 2018.
- Hrda: Context-aware high-resolution domain-adaptive semantic segmentation. In Proceedings of the European Conference on Computer Vision (ECCV), page 372–391, Berlin, Heidelberg, 2022.
- Side-scan sonar image synthesis based on generative adversarial network for images in multiple frequencies. IEEE Geoscience and Remote Sensing Letters, 18(9):1505–1509, 2021.
- L.M. Linnett J.M. Bell. Simulation and analysis of synthetic sidescan sonar images. IEE Proc. - Radar, Sonar and Navigation, 144:219–226(7), August 1997.
- Revisiting image pyramid structure for high resolution salient object detection. In Proceedings of the Asian Conference on Computer Vision, pages 108–124, 2022.
- Segment anything. arXiv:2304.02643, 2023.
- Data augmentation using image translation for underwater sonar image segmentation. PLOS ONE, 17(8):1–15, 08 2022.
- Deep learning from shallow dives: Sonar image generation and training for underwater object detection. CoRR, abs/1810.07990, 2018.
- Deep learning based object detection via style-transferred underwater sonar images. IFAC-PapersOnLine, 52(21):152–155, 2019.
- Machine learning techniques for auv side-scan sonar data feature extraction as applied to intelligent search for underwater archaeological sites. In Field and Service Robotics, pages 219–233, Singapore, 2021.
- Panda: Adapting pretrained features for anomaly detection and segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2806–2814, 2021.
- Blainder—a blender ai add-on for generation of semantically labeled depth-sensing data. Sensors, 21(6), 2021.
- Towards total recall in industrial anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 14318–14328, June 2022.
- An introduction to the sonar equations with applications. 1976.
- Towards sim2real for shipwreck detection in side scan sonar imagery. 3rd Workshop on Closing the Reality Gap in Sim2Real Transfer, Robotics: Science and Systems, 2022.
- Synthetic sonar image simulation with various seabed conditions for automatic target recognition. In OCEANS 2022, Hampton Roads, pages 1–8, 2022.
- Deep high-resolution representation learning for human pose estimation. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5686–5696, 2019.
- Raft: Recurrent all-pairs field transforms for optical flow. In Proceedings of the European Conference on Computer Vision (ECCV), pages 402–419, 2020.
- Thunder Bay National Marine Sanctuary. Thunder Bay National Marine Sanctuary. https://thunderbay.noaa.gov/, Accessed online: 2023.
- Minet: Efficient deep learning automatic target recognition for small autonomous vehicles. IEEE Geoscience and Remote Sensing Letters, 18(6):1014–1018, 2021.
- TURBOSQUID. 3D Models for Professionals. https://www.turbosquid.com, Accessed online: 2022.
- Adversarial discriminative domain adaptation. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 2962–2971, Los Alamitos, CA, USA, Jul 2017.
- Sepico: Semantic-guided pixel contrast for domain adaptive semantic segmentation. IEEE Transactions on Pattern Analysis &; Machine Intelligence, (01):1–17, Jan 2023.
- Side-scan sonar image segmentation based on multi-channel cnn for auv navigation. Frontiers in Neurorobotics, 16:928206, 2022a.
- Semantic segmentation of side-scan sonar images with few samples. Electronics, 11(19), 2022b.
- Object-contextual representations for semantic segmentation. In Proceedings of the European Conference on Computer Vision (ECCV), page 173–190, Berlin, Heidelberg, 2020.
- Destseg: Segmentation guided denoising student-teacher for anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3914–3923, 2023.
- Scene parsing through ade20k dataset. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 5122–5130, 2017.