Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
143 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Zero-shot Degree of Ill-posedness Estimation for Active Small Object Change Detection (2405.06185v1)

Published 10 May 2024 in cs.CV

Abstract: In everyday indoor navigation, robots often needto detect non-distinctive small-change objects (e.g., stationery,lost items, and junk, etc.) to maintain domain knowledge. Thisis most relevant to ground-view change detection (GVCD), a recently emerging research area in the field of computer vision.However, these existing techniques rely on high-quality class-specific object priors to regularize a change detector modelthat cannot be applied to semantically nondistinctive smallobjects. To address ill-posedness, in this study, we explorethe concept of degree-of-ill-posedness (DoI) from the newperspective of GVCD, aiming to improve both passive and activevision. This novel DoI problem is highly domain-dependent,and manually collecting fine-grained annotated training datais expensive. To regularize this problem, we apply the conceptof self-supervised learning to achieve efficient DoI estimationscheme and investigate its generalization to diverse datasets.Specifically, we tackle the challenging issue of obtaining self-supervision cues for semantically non-distinctive unseen smallobjects and show that novel "oversegmentation cues" from openvocabulary semantic segmentation can be effectively exploited.When applied to diverse real datasets, the proposed DoI modelcan boost state-of-the-art change detection models, and it showsstable and consistent improvements when evaluated on real-world datasets.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (31)
  1. Dr-tanet: Dynamic receptive temporal attention network for street scene change detection. In 2021 IEEE Intelligent Vehicles Symposium (IV), pages 502–509. IEEE, 2021.
  2. Weakly supervised silhouette-based semantic scene change detection. In 2020 IEEE International conference on robotics and automation (ICRA), pages 6861–6867. IEEE, 2020.
  3. Predicting matchability. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2014.
  4. Segment anything. In IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023, pages 3992–4003. IEEE, 2023.
  5. Fast change detection for camera-based surveillance systems. In 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), pages 2481–2486, 2016.
  6. Land cover change detection with vhr satellite imagery based on multi-scale slic-cnn and scae features. IEEE Access, 8:228070–228087, 2020.
  7. Fully convolutional networks for multisource building extraction from an open aerial and satellite imagery data set. IEEE Transactions on Geoscience and Remote Sensing, 57(1):574–586, 2019.
  8. Landslide inventory mapping from bitemporal images using deep convolutional neural networks. IEEE Geoscience and Remote Sensing Letters, 16(6):982–986, 2019.
  9. Change detection from a street image pair using CNN features and superpixel segmentation. In Xianghua Xie, Mark W. Jones, and Gary K. L. Tam, editors, Proceedings of the British Machine Vision Conference 2015, BMVC 2015, Swansea, UK, September 7-10, 2015, pages 61.1–61.12. BMVA Press, 2015.
  10. Dense optical flow based change detection network robust to difference of camera viewpoints. arXiv preprint arXiv:1712.02941, 2017.
  11. Flownet: Learning optical flow with convolutional networks. In Proceedings of the IEEE international conference on computer vision, pages 2758–2766, 2015.
  12. Domain invariant siamese attention mask for small object change detection via everyday indoor robot navigation. In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 739–745. IEEE, 2022.
  13. Lifelong change detection: Continuous domain adaptation for small object change detection in everyday robot navigation. In 2023 18th International Conference on Machine Vision and Applications (MVA), pages 1–5. IEEE, 2023.
  14. Real-time small-object change detection from ground vehicles using a siamese convolutional neural network. Journal of Imaging Science and Technology, 63(6):060402, 2019.
  15. A comprehensive survey on pretrained foundation models: A history from bert to chatgpt, 2023.
  16. Bert: Pre-training of deep bidirectional transformers for language understanding, 2019.
  17. Learning transferable visual models from natural language supervision. In International conference on machine learning, pages 8748–8763. PMLR, 2021.
  18. Scaling up visual and vision-language representation learning with noisy text supervision. In International conference on machine learning, pages 4904–4916. PMLR, 2021.
  19. Improved baselines with visual instruction tuning. arXiv preprint arXiv:2310.03744, 2023.
  20. Alpha-clip: A clip model focusing on wherever you want. arXiv preprint arXiv:2312.03818, 2023.
  21. Vicuna: An open-source chatbot impressing gpt-4 with 90%* chatgpt quality, March 2023.
  22. Semantic scene difference detection in daily life patroling by mobile robots using pre-trained large-scale vision-language model. In 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 3228–3233. IEEE, 2023.
  23. City-scale change detection in cadastral 3d models using images. In Proceedings of the IEEE Conference on computer Vision and Pattern Recognition, pages 113–120, 2013.
  24. Microsoft coco: Common objects in context. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, pages 740–755. Springer, 2014.
  25. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee, 2009.
  26. Grounding dino: Marrying dino with grounded pre-training for open-set object detection. arXiv preprint arXiv:2303.05499, 2023.
  27. Paddleseg: A high-efficient development toolkit for image segmentation. arXiv preprint arXiv:2101.06175, 2021.
  28. PaddlePaddle Authors. Paddleseg, end-to-end image segmentation kit based on paddlepaddle. https://github.com/PaddlePaddle/PaddleSeg, 2019.
  29. Learning accurate dense correspondences and when to trust them. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5714–5724, 2021.
  30. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
  31. Adam: A method for stochastic optimization. In International Conference on Learning Representations (ICLR), San Diega, CA, USA, 2015.

Summary

We haven't generated a summary for this paper yet.