Papers
Topics
Authors
Recent
2000 character limit reached

SA-MixNet: Structure-aware Mixup and Invariance Learning for Scribble-supervised Road Extraction in Remote Sensing Images (2403.01381v1)

Published 3 Mar 2024 in cs.CV

Abstract: Mainstreamed weakly supervised road extractors rely on highly confident pseudo-labels propagated from scribbles, and their performance often degrades gradually as the image scenes tend various. We argue that such degradation is due to the poor model's invariance to scenes with different complexities, whereas existing solutions to this problem are commonly based on crafted priors that cannot be derived from scribbles. To eliminate the reliance on such priors, we propose a novel Structure-aware Mixup and Invariance Learning framework (SA-MixNet) for weakly supervised road extraction that improves the model invariance in a data-driven manner. Specifically, we design a structure-aware Mixup scheme to paste road regions from one image onto another for creating an image scene with increased complexity while preserving the road's structural integrity. Then an invariance regularization is imposed on the predictions of constructed and origin images to minimize their conflicts, which thus forces the model to behave consistently on various scenes. Moreover, a discriminator-based regularization is designed for enhancing the connectivity meanwhile preserving the structure of roads. Combining these designs, our framework demonstrates superior performance on the DeepGlobe, Wuhan, and Massachusetts datasets outperforming the state-of-the-art techniques by 1.47%, 2.12%, 4.09% respectively in IoU metrics, and showing its potential of plug-and-play. The code will be made publicly available.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (35)
  1. Slic superpixels compared to state-of-the-art superpixel methods. IEEE Transactions on Pattern Analysis and Machine Intelligence, page 2274–2282, 2012.
  2. Spin road mapper: Extracting roads from aerial images via spatial and interaction space graph reasoning for autonomous driving. pages 343–350, 2022.
  3. Improved road connectivity by joint learning of orientation and segmentation. In CVPR, 2019.
  4. Deepglobe 2018: A challenge to parse the earth through satellite images. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2018.
  5. Fmix: Enhancing mixed sample data augmentation. arXiv preprint arXiv:2002.12047, 2020.
  6. Weakly supervised learning with deep convolutional neural networks for semantic segmentation: Understanding semantic layout of images with minimum human supervision. IEEE Signal Processing Magazine, 34(6):39–49, 2017.
  7. Puzzle mix: Exploiting saliency and local statistics for optimal mixup. International Conference on Machine Learning, 2020.
  8. Co-mixup: Saliency guided joint mixup with supermodular diversity. arXiv preprint arXiv:2102.03065, 2021.
  9. Temporal ensembling for semi-supervised learning. arXiv preprint arXiv:1610.02242, 2016.
  10. Topology-enhanced urban road extraction via a geographic feature-enhanced network. IEEE Transactions on Geoscience and Remote Sensing, page 8819–8830, 2020.
  11. Weakly supervised road segmentation in high-resolution remote sensing images using point annotations. IEEE Transactions on Geoscience and Remote Sensing, PP(99):1–13, 2021.
  12. Scribblesup: Scribble-supervised convolutional networks for semantic segmentation. pages 3159–3167, 2016.
  13. Coanet: Connectivity attention network for road extraction from satellite imagery. IEEE Transactions on Image Processing, page 8540–8552, 2021.
  14. Volodymyr Mnih. Machine learning for aerial image labeling. 2013.
  15. Gated crf loss for weakly supervised semantic image segmentation. arXiv preprint arXiv:1906.04651, 2019.
  16. Semi-supervised semantic segmentation with cross-consistency training. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
  17. Learning self-supervised low-rank network for single-stage weakly and semi-supervised semantic segmentation. International Journal of Computer Vision, 130(5):1181–1195, 2022.
  18. Semi-supervised learning with ladder networks. Advances in neural information processing systems, 28, 2015.
  19. Regularization with stochastic transformations and perturbations for deep semi-supervised learning, 2016.
  20. A seeded image segmentation framework unifying graph cuts and random walker which yields a new algorithm. In 2007 IEEE 11th International Conference on Computer Vision, 2007.
  21. Consistency regularization for adversarial robustness. Proceedings of the AAAI Conference on Artificial Intelligence, page 8414–8422, 2022.
  22. Normalized cut loss for weakly-supervised cnn segmentation. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.
  23. Stacked u-nets with multi-output for road extraction. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2018.
  24. Boundary perception guidance: A scribble-supervised semantic segmentation approach. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019.
  25. Scribble-based weakly supervised deep learning for road surface extraction from remote sensing images. IEEE Transactions on Geoscience and Remote Sensing, (99), 2021.
  26. Road extraction from very high resolution images using weakly labeled openstreetmap centerline. International Journal of Geo-Information, (11), 2019.
  27. Generative mixup networks for zero-shot learning. IEEE Transactions on Neural Networks and Learning Systems, page 1–12, 2022.
  28. Cutmix: Regularization strategy to train strong classifiers with localizable features. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), 2019.
  29. mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412, 2017.
  30. Weakly-supervised salient object detection via scribble annotations. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
  31. Cyclemix: A holistic strategy for medical image segmentation from scribble supervision. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 11656–11665, 2022.
  32. Split depth-wise separable graph-convolution network for road extraction in complex environments from high-resolution remote-sensing images. IEEE Transactions on Geoscience and Remote Sensing, page 1–15, 2022a.
  33. D-linknet: Linknet with pretrained encoder and dilated convolution for high resolution satellite imagery road extraction. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2018.
  34. Large-scale road extraction from high-resolution remote sensing images based on a weakly-supervised structural and orientational consistency constraint network. ISPRS Journal of Photogrammetry and Remote Sensing, 193:234–251, 2022b.
  35. Unpaired image-to-image translation using cycle-consistent adversarial networks. In 2017 IEEE International Conference on Computer Vision (ICCV), 2017.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

Whiteboard

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.