Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PartSLAM: Unsupervised Part-based Scene Modeling for Fast Succinct Map Matching (2306.10782v1)

Published 19 Jun 2023 in cs.CV and cs.RO

Abstract: In this paper, we explore the challenging 1-to-N map matching problem, which exploits a compact description of map data, to improve the scalability of map matching techniques used by various robot vision tasks. We propose a first method explicitly aimed at fast succinct map matching, which consists only of map-matching subtasks. These tasks include offline map matching attempts to find a compact part-based scene model that effectively explains each map using fewer larger parts. The tasks also include an online map matching attempt to efficiently find correspondence between the part-based maps. Our part-based scene modeling approach is unsupervised and uses common pattern discovery (CPD) between the input and known reference maps. This enables a robot to learn a compact map model without human intervention. We also present a practical implementation that uses the state-of-the-art CPD technique of randomized visual phrases (RVP) with a compact bounding box (BB) based part descriptor, which consists of keypoint and descriptor BBs. The results of our challenging map-matching experiments, which use a publicly available radish dataset, show that the proposed approach achieves successful map matching with significant speedup and a compact description of map data that is tens of times more compact. Although this paper focuses on the standard 2D point-set map and the BB-based part representation, we believe our approach is sufficiently general to be applicable to a broad range of map formats, such as the 3D point cloud map, as well as to general bounding volumes and other compact part representations.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (23)
  1. Spatial learning for navigation in dynamic environments. IEEE Trans. Systems, Man, and Cybernetics, Part B, 26(3):496–505, 1996.
  2. Sparse local submap joining filter for building large-scale maps. IEEE Trans. Robotics (TRO), 24(5):1121–1130, 2008.
  3. Dense reconstruction on-the-fly. In IEEE Int. Conf. Computer Vision and Pattern Recognition (CVPR), pages 1450–1457, 2012.
  4. Mobile robot localization and mapping with uncertainty using scale-invariant visual landmarks. I. J. Robotic Res., 21(8):735–760, 2002.
  5. Highly scalable appearance-only slam - fab-map 2.0. In Robotics: Science and Systems, 2009.
  6. Closing the loop in appearance-guided omnidirectional visual odometry by using vocabulary trees. Robot. Auton. Syst., 58(6):820–827, 2010.
  7. The representation and matching of pictorial structures. IEEE Trans. Computers, C-22(1):67 – 92, 1973.
  8. Combined object categorization and segmentation with an implicit shape model. In Euro. Conf. Compuer Vision (ECCV) workshop on statistical learning in computer vision, pages 17–32, 2004.
  9. Semantic segmentation using regions and parts. In IEEE Int. Conf. Computer Vision and Pattern Recognition (CVPR), pages 3378–3385, 2012.
  10. Common pattern discovery using earth mover s distance and local flow maximization. In IEEE Int. Conf. Computer Vision (ICCV), pages 1222–1229, 2005.
  11. Randomized visual phrases for object search. In IEEE Int. Conf. Computer Vision and Pattern Recognition (CVPR), pages 3100–3107, 2012.
  12. Common landmark discovery in urban scenes. IAPR Int. Conf. Machine Vision Applications, 2013.
  13. The robotics data set repository (radish), 2003.
  14. Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), 32(9):1627–1645, 2010.
  15. Scene recognition and weakly supervised object localization with deformable part-based models. In ICCV, pages 1307–1314, 2011.
  16. Reconfigurable models for scene recognition. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, pages 2775–2782, 2012.
  17. Lsh-ransac: An incremental scheme for scalable localization. In IEEE Int. Conf. Robotics and Automation (ICRA), pages 3523–3530, 2009. http://rc.his.u-fukui.ac.jp/LR.pdf.
  18. An incremental scheme for dictionary-based compressive slam. In IEEE/RSJ Int. Conf. Intelligent Robots and Systems (IROS), pages 872–879, 2011. http://rc.his.u-fukui.ac.jp/ICDCS.pdf.
  19. Dictionary-based compressive slam. SICE JCMSI Journal of SICE, 6(1):54–64, 2013. http://rc.his.u-fukui.ac.jp/DCS.pdf.
  20. Unsupervised object discovery: A comparison. International Journal of Computer Vision, 88(2):284–302, 2010.
  21. Video google: A text retrieval approach to object matching in videos. In IEEE Int. Conf. Computer Vision (ICCV), pages 1470–1477, 2003.
  22. Robust single view room structure segmentation in manhattan-like environments from stereo vision. In IEEE Int. Conf. Robotics and Automation (ICRA), pages 5315–5322, 2011.
  23. A bag-of-bounding-boxes approach to object-level view image retrieval. In Proc. SICE Annual Conference, 2013. http://rc.his.u-fukui.ac.jp/BOBB.pdf.
Citations (17)

Summary

We haven't generated a summary for this paper yet.