CrossLoc3D: Aerial-Ground Cross-Source 3D Place Recognition (2303.17778v2)
Abstract: We present CrossLoc3D, a novel 3D place recognition method that solves a large-scale point matching problem in a cross-source setting. Cross-source point cloud data corresponds to point sets captured by depth sensors with different accuracies or from different distances and perspectives. We address the challenges in terms of developing 3D place recognition methods that account for the representation gap between points captured by different sources. Our method handles cross-source data by utilizing multi-grained features and selecting convolution kernel sizes that correspond to most prominent features. Inspired by the diffusion models, our method uses a novel iterative refinement process that gradually shifts the embedding spaces from different sources to a single canonical space for better metric learning. In addition, we present CS-Campus3D, the first 3D aerial-ground cross-source dataset consisting of point cloud data from both aerial and ground LiDAR scans. The point clouds in CS-Campus3D have representation gaps and other features like different views, point densities, and noise patterns. We show that our CrossLoc3D algorithm can achieve an improvement of 4.74% - 15.37% in terms of the top 1 average recall on our CS-Campus3D benchmark and achieves performance comparable to state-of-the-art 3D place recognition method on the Oxford RobotCar. The code and CS-CAMPUS3D benchmark will be available at github.com/rayguan97/crossloc3d.
- The state of maryland lidar. https://imap.maryland.gov/pages/lidar-download.
- Netvlad: Cnn architecture for weakly supervised place recognition. pages 5297–5307, 06 2016.
- All about vlad. In 2013 IEEE Conference on Computer Vision and Pattern Recognition, pages 1578–1585, 2013.
- Cold diffusion: Inverting arbitrary image transforms without noise, 2022.
- 4d spatio-temporal convnets: Minkowski convolutional neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3075–3084, 2019.
- Scalable place recognition under appearance change for autonomous driving. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 9318–9327, 2019.
- Dh3d: Deep hierarchical 3d descriptors for robust large-scale 6dof relocalization. In European Conference on Computer Vision (ECCV), 2020.
- Svt-net: Super light-weight sparse voxel transformer for large scale place recognition. Proceedings of the AAAI Conference on Artificial Intelligence, 36(1):551–560, Jun. 2022.
- X-view: Graph-based semantic multi-view localization. IEEE Robotics and Automation Letters, 3(3):1687–1694, 2018.
- Beyond self-attention: External attention using two linear layers for visual tasks. IEEE transactions on pattern analysis and machine intelligence, PP, 2021.
- Denoising diffusion probabilistic models. In H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, volume 33, pages 6840–6851. Curran Associates, Inc., 2020.
- Feature-metric registration: A fast semi-supervised approach for robust point cloud registration without correspondences. In The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2020.
- A comprehensive survey on point cloud registration. ArXiv, abs/2103.02690, 2021.
- A coarse-to-fine algorithm for registration in 3d street-view cross-source point clouds. In 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA), pages 1–6, 2016.
- Pyramid point cloud transformer for large-scale place recognition. In ICCV, 2021.
- Aggregating local descriptors into a compact image representation. In 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pages 3304–3311, 2010.
- Jacek Komorowski. Minkloc3d: Point cloud based large-scale place recognition. In 2021 IEEE Winter Conference on Applications of Computer Vision (WACV), pages 1789–1798, 2021.
- J. Komorowski. Improving point cloud based place recognition with ranking-based loss and large batch training. In 2022 26th International Conference on Pattern Recognition (ICPR), pages 3699–3705, Los Alamitos, CA, USA, aug 2022. IEEE Computer Society.
- Lpd-net: 3d point cloud learning for large-scale place recognition and environment analysis. In The IEEE International Conference on Computer Vision (ICCV), October 2019.
- 1 Year, 1000km: The Oxford RobotCar Dataset. The International Journal of Robotics Research (IJRR), 36(1):3–15, 2017.
- Orb-slam2: An open-source slam system for monocular, stereo, and rgb-d cameras. IEEE Transactions on Robotics, 33(5):1255–1262, 2017.
- Pointnet: Deep learning on point sets for 3d classification and segmentation. arXiv preprint arXiv:1612.00593, 2016.
- Denoising diffusion implicit models. In International Conference on Learning Representations, 2021.
- Get to the Point: Learning Lidar Place Recognition and Metric Localisation Using Overhead Imagery. In Proceedings of Robotics: Science and Systems, Virtual, July 2021.
- Get to the point: Learning lidar place recognition and metric localisation using overhead imagery. Robotics: Science and Systems XVII, 2021.
- Coming down to earth: Satellite-to-street view synthesis for geo-localization. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 6484–6493, 2021.
- Pointnetvlad: Deep point cloud based retrieval for large-scale place recognition. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
- Changchang Wu. Towards linear-time incremental structure from motion. In 2013 International Conference on 3D Vision - 3DV 2013, pages 127–134, 2013.
- Sampling matters in deep embedding learning. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 2859–2867, Los Alamitos, CA, USA, oct 2017. IEEE Computer Society.
- Soe-net: A self-attention and orientation encoding network for point cloud based place recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
- Visual cross-view metric localization with dense uncertainty estimates. In European Conference on Computer Vision, pages 90–106. Springer, 2022.
- Transloc3d : Point cloud based large-scale place recognition using adaptive receptive fields, 2021.
- Semantic maps for cross-view relocalization of terrestrial to uav point clouds. International Journal of Applied Earth Observation and Geoinformation, 114:103081, 2022.
- Cross-view geo-localization with layer-to-layer transformer. In M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, and J. Wortman Vaughan, editors, Advances in Neural Information Processing Systems, volume 34, pages 29009–29020. Curran Associates, Inc., 2021.
- Pcan: 3d attention map learning using contextual information for point cloud based retrieval. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12436–12445, 2019.
- Visual place recognition: A survey from deep learning perspective. Pattern Recognition, 113:107760, 2021.
- Transgeo: Transformer is all you need for cross-view image geo-localization. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 1152–1161, 2022.
- Vigor: Cross-view image geo-localization beyond one-to-one retrieval. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5316–5325, 2020.
- Minkloc3d-si: 3d lidar place recognition with sparse convolutions, spherical coordinates, and intensity. IEEE Robotics and Automation Letters, PP:1–1, 2021.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.