Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
120 tokens/sec
GPT-4o
10 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
3 tokens/sec
DeepSeek R1 via Azure Pro
51 tokens/sec
2000 character limit reached

STARS: Zero-shot Sim-to-Real Transfer for Segmentation of Shipwrecks in Sonar Imagery (2310.01667v1)

Published 2 Oct 2023 in cs.CV

Abstract: In this paper, we address the problem of sim-to-real transfer for object segmentation when there is no access to real examples of an object of interest during training, i.e. zero-shot sim-to-real transfer for segmentation. We focus on the application of shipwreck segmentation in side scan sonar imagery. Our novel segmentation network, STARS, addresses this challenge by fusing a predicted deformation field and anomaly volume, allowing it to generalize better to real sonar images and achieve more effective zero-shot sim-to-real transfer for image segmentation. We evaluate the sim-to-real transfer capabilities of our method on a real, expert-labeled side scan sonar dataset of shipwrecks collected from field work surveys with an autonomous underwater vehicle (AUV). STARS is trained entirely in simulation and performs zero-shot shipwreck segmentation with no additional fine-tuning on real data. Our method provides a significant 20% increase in segmentation performance for the targeted shipwreck class compared to the best baseline.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (40)
  1. Zero-shot semantic segmentation. In Advances in Neural Information Processing Systems, volume 32, 2019.
  2. On-line multi-class segmentation of side-scan sonar imagery using an autonomous underwater vehicle. Journal of Marine Science and Engineering, 8(8):557, 2020.
  3. Blender Online Community. Blender - a 3d modelling and rendering package, 2018. URL http://www.blender.org.
  4. The cityscapes dataset for semantic urban scene understanding. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3213–3223, June 2016.
  5. Deformable convolutional networks. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 764–773, 2017.
  6. Cut, paste and learn: Surprisingly easy synthesis for instance detection. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 1310–1319, 2017.
  7. A deep learning approach to target recognition in side-scan sonar imagery. In OCEANS 2018 MTS/IEEE Charleston, pages 1–4, 2018.
  8. Pøda: Prompt-driven zero-shot domain adaptation. In ICCV, 2023.
  9. Detection of boulders in side scan sonar mosaics by a neural network. Geosciences, 9(4):159, 2019.
  10. Domain-adversarial training of neural networks. Journal of Machine Learning Research, 17(1):2096–2030, Jan 2016.
  11. Side-scan sonar image classification based on style transfer and pre-trained convolutional neural networks. Electronics, 10(15):1823, 2021.
  12. From pixel to patch: Synthesize context-aware features for zero-shot semantic segmentation. IEEE Transactions on Neural Networks and Learning Systems, pages 1–15, 2022.
  13. CyCADA: Cycle-consistent adversarial domain adaptation. In Proceedings of the 35th International Conference on Machine Learning, volume 80, pages 1989–1998. PMLR, 10–15 Jul 2018.
  14. Hrda: Context-aware high-resolution domain-adaptive semantic segmentation. In Proceedings of the European Conference on Computer Vision (ECCV), page 372–391, Berlin, Heidelberg, 2022.
  15. Side-scan sonar image synthesis based on generative adversarial network for images in multiple frequencies. IEEE Geoscience and Remote Sensing Letters, 18(9):1505–1509, 2021.
  16. L.M. Linnett J.M. Bell. Simulation and analysis of synthetic sidescan sonar images. IEE Proc. - Radar, Sonar and Navigation, 144:219–226(7), August 1997.
  17. Revisiting image pyramid structure for high resolution salient object detection. In Proceedings of the Asian Conference on Computer Vision, pages 108–124, 2022.
  18. Segment anything. arXiv:2304.02643, 2023.
  19. Data augmentation using image translation for underwater sonar image segmentation. PLOS ONE, 17(8):1–15, 08 2022.
  20. Deep learning from shallow dives: Sonar image generation and training for underwater object detection. CoRR, abs/1810.07990, 2018.
  21. Deep learning based object detection via style-transferred underwater sonar images. IFAC-PapersOnLine, 52(21):152–155, 2019.
  22. Machine learning techniques for auv side-scan sonar data feature extraction as applied to intelligent search for underwater archaeological sites. In Field and Service Robotics, pages 219–233, Singapore, 2021.
  23. Panda: Adapting pretrained features for anomaly detection and segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2806–2814, 2021.
  24. Blainder—a blender ai add-on for generation of semantically labeled depth-sensing data. Sensors, 21(6), 2021.
  25. Towards total recall in industrial anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 14318–14328, June 2022.
  26. An introduction to the sonar equations with applications. 1976.
  27. Towards sim2real for shipwreck detection in side scan sonar imagery. 3rd Workshop on Closing the Reality Gap in Sim2Real Transfer, Robotics: Science and Systems, 2022.
  28. Synthetic sonar image simulation with various seabed conditions for automatic target recognition. In OCEANS 2022, Hampton Roads, pages 1–8, 2022.
  29. Deep high-resolution representation learning for human pose estimation. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5686–5696, 2019.
  30. Raft: Recurrent all-pairs field transforms for optical flow. In Proceedings of the European Conference on Computer Vision (ECCV), pages 402–419, 2020.
  31. Thunder Bay National Marine Sanctuary. Thunder Bay National Marine Sanctuary. https://thunderbay.noaa.gov/, Accessed online: 2023.
  32. Minet: Efficient deep learning automatic target recognition for small autonomous vehicles. IEEE Geoscience and Remote Sensing Letters, 18(6):1014–1018, 2021.
  33. TURBOSQUID. 3D Models for Professionals. https://www.turbosquid.com, Accessed online: 2022.
  34. Adversarial discriminative domain adaptation. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 2962–2971, Los Alamitos, CA, USA, Jul 2017.
  35. Sepico: Semantic-guided pixel contrast for domain adaptive semantic segmentation. IEEE Transactions on Pattern Analysis &; Machine Intelligence, (01):1–17, Jan 2023.
  36. Side-scan sonar image segmentation based on multi-channel cnn for auv navigation. Frontiers in Neurorobotics, 16:928206, 2022a.
  37. Semantic segmentation of side-scan sonar images with few samples. Electronics, 11(19), 2022b.
  38. Object-contextual representations for semantic segmentation. In Proceedings of the European Conference on Computer Vision (ECCV), page 173–190, Berlin, Heidelberg, 2020.
  39. Destseg: Segmentation guided denoising student-teacher for anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3914–3923, 2023.
  40. Scene parsing through ade20k dataset. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 5122–5130, 2017.
Citations (3)

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.