PartSLAM: Unsupervised Part-based Scene Modeling for Fast Succinct Map Matching (2306.10782v1)
Abstract: In this paper, we explore the challenging 1-to-N map matching problem, which exploits a compact description of map data, to improve the scalability of map matching techniques used by various robot vision tasks. We propose a first method explicitly aimed at fast succinct map matching, which consists only of map-matching subtasks. These tasks include offline map matching attempts to find a compact part-based scene model that effectively explains each map using fewer larger parts. The tasks also include an online map matching attempt to efficiently find correspondence between the part-based maps. Our part-based scene modeling approach is unsupervised and uses common pattern discovery (CPD) between the input and known reference maps. This enables a robot to learn a compact map model without human intervention. We also present a practical implementation that uses the state-of-the-art CPD technique of randomized visual phrases (RVP) with a compact bounding box (BB) based part descriptor, which consists of keypoint and descriptor BBs. The results of our challenging map-matching experiments, which use a publicly available radish dataset, show that the proposed approach achieves successful map matching with significant speedup and a compact description of map data that is tens of times more compact. Although this paper focuses on the standard 2D point-set map and the BB-based part representation, we believe our approach is sufficiently general to be applicable to a broad range of map formats, such as the 3D point cloud map, as well as to general bounding volumes and other compact part representations.
- Spatial learning for navigation in dynamic environments. IEEE Trans. Systems, Man, and Cybernetics, Part B, 26(3):496–505, 1996.
- Sparse local submap joining filter for building large-scale maps. IEEE Trans. Robotics (TRO), 24(5):1121–1130, 2008.
- Dense reconstruction on-the-fly. In IEEE Int. Conf. Computer Vision and Pattern Recognition (CVPR), pages 1450–1457, 2012.
- Mobile robot localization and mapping with uncertainty using scale-invariant visual landmarks. I. J. Robotic Res., 21(8):735–760, 2002.
- Highly scalable appearance-only slam - fab-map 2.0. In Robotics: Science and Systems, 2009.
- Closing the loop in appearance-guided omnidirectional visual odometry by using vocabulary trees. Robot. Auton. Syst., 58(6):820–827, 2010.
- The representation and matching of pictorial structures. IEEE Trans. Computers, C-22(1):67 – 92, 1973.
- Combined object categorization and segmentation with an implicit shape model. In Euro. Conf. Compuer Vision (ECCV) workshop on statistical learning in computer vision, pages 17–32, 2004.
- Semantic segmentation using regions and parts. In IEEE Int. Conf. Computer Vision and Pattern Recognition (CVPR), pages 3378–3385, 2012.
- Common pattern discovery using earth mover s distance and local flow maximization. In IEEE Int. Conf. Computer Vision (ICCV), pages 1222–1229, 2005.
- Randomized visual phrases for object search. In IEEE Int. Conf. Computer Vision and Pattern Recognition (CVPR), pages 3100–3107, 2012.
- Common landmark discovery in urban scenes. IAPR Int. Conf. Machine Vision Applications, 2013.
- The robotics data set repository (radish), 2003.
- Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), 32(9):1627–1645, 2010.
- Scene recognition and weakly supervised object localization with deformable part-based models. In ICCV, pages 1307–1314, 2011.
- Reconfigurable models for scene recognition. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, pages 2775–2782, 2012.
- Lsh-ransac: An incremental scheme for scalable localization. In IEEE Int. Conf. Robotics and Automation (ICRA), pages 3523–3530, 2009. http://rc.his.u-fukui.ac.jp/LR.pdf.
- An incremental scheme for dictionary-based compressive slam. In IEEE/RSJ Int. Conf. Intelligent Robots and Systems (IROS), pages 872–879, 2011. http://rc.his.u-fukui.ac.jp/ICDCS.pdf.
- Dictionary-based compressive slam. SICE JCMSI Journal of SICE, 6(1):54–64, 2013. http://rc.his.u-fukui.ac.jp/DCS.pdf.
- Unsupervised object discovery: A comparison. International Journal of Computer Vision, 88(2):284–302, 2010.
- Video google: A text retrieval approach to object matching in videos. In IEEE Int. Conf. Computer Vision (ICCV), pages 1470–1477, 2003.
- Robust single view room structure segmentation in manhattan-like environments from stereo vision. In IEEE Int. Conf. Robotics and Automation (ICRA), pages 5315–5322, 2011.
- A bag-of-bounding-boxes approach to object-level view image retrieval. In Proc. SICE Annual Conference, 2013. http://rc.his.u-fukui.ac.jp/BOBB.pdf.