Fully Geometric Panoramic Localization (2403.19904v1)
Abstract: We introduce a lightweight and accurate localization method that only utilizes the geometry of 2D-3D lines. Given a pre-captured 3D map, our approach localizes a panorama image, taking advantage of the holistic 360 view. The system mitigates potential privacy breaches or domain discrepancies by avoiding trained or hand-crafted visual descriptors. However, as lines alone can be ambiguous, we express distinctive yet compact spatial contexts from relationships between lines, namely the dominant directions of parallel lines and the intersection between non-parallel lines. The resulting representations are efficient in processing time and memory compared to conventional visual descriptor-based methods. Given the groups of dominant line directions and their intersections, we accelerate the search process to test thousands of pose candidates in less than a millisecond without sacrificing accuracy. We empirically show that the proposed 2D-3D matching can localize panoramas for challenging scenes with similar structures, dramatic domain shifts or illumination changes. Our fully geometric approach does not involve extensive parameter tuning or neural network training, making it a practical algorithm that can be readily deployed in the real world. Project page including the code is available through this link: https://82magnolia.github.io/fgpl/.
- L2d2: Learnable line detector and descriptor. In 2021 International Conference on 3D Vision (3DV), pages 442–452, 2021.
- Cvxpnpl: A unified convex solution to the absolute pose estimation problem from point and line correspondences. Journal of Mathematical Imaging and Vision, 65:492–512, 2019.
- NetVLAD: CNN architecture for weakly supervised place recognition. In IEEE Conference on Computer Vision and Pattern Recognition, 2016.
- Joint 2d-3d-semantic data for indoor scene understanding. arXiv preprint arXiv:1702.01105, 2017.
- Least-squares fitting of two 3-d point sets. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-9(5):698–700, 1987.
- Rethinking visual geo-localization for large-scale applications. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 4878–4888, 2022.
- Trusting the cloud. SIGACT News, 40(2):81–86, 2009.
- Globally-optimal inlier set maximisation for camera pose and correspondence estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence, page preprint, 2018.
- The alignment of the spheres: Globally-optimal spherical mixture alignment for camera pose estimation. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), page to appear, Long Beach, USA, 2019. IEEE.
- Solving the blind perspective-n-point problem end-to-end with robust differentiable geometric optimization. In Proceedings of the European Conference on Computer Vision (ECCV), page preprint. Springer, 2020.
- Analysis and mitigations of reverse engineering attacks on local feature descriptors, 2021.
- Superpoint: Self-supervised interest point detection and description. In CVPR Deep Learning for Visual SLAM Workshop, 2018.
- Privacy-preserving image features via adversarial affine subspace embeddings. In Computer Vision and Pattern Recognition (CVPR 2021). CVF/IEEE, 2021.
- Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM, 24(6):381–395, 1981.
- Pose refinement with joint optimization of visual points and lines. 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 2888–2894, 2021a.
- Pose refinement with joint optimization of visual points and lines. 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 2888–2894, 2021b.
- Self-supervising fine-grained region similarities for large-scale image localization. In European Conference on Computer Vision, 2020.
- Light spanners for snowflake metrics. In Proceedings of the Thirtieth Annual Symposium on Computational Geometry, page 387–395, New York, NY, USA, 2014. Association for Computing Machinery.
- Lsd: A fast line segment detector with a false detection control. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(4):722–732, 2010.
- Hdpl: a hybrid descriptor for points and lines based on graph neural networks. Ind. Robot, 48:737–744, 2021.
- Batch effects in single-cell rna-sequencing data are corrected by matching mutual nearest neighbors. Nature biotechnology, 36, 2018.
- Peter Hall. On representatives of subsets. Journal of The London Mathematical Society-second Series, pages 26–30, 1935.
- A direct least-squares (dls) method for pnp. In 2011 International Conference on Computer Vision, pages 383–390, 2011.
- Efficient 3d scene abstraction using line segments. Computer Vision and Image Understanding, 157:167–178, 2017. Large-Scale 3D Modeling of Urban Indoor or Outdoor Scenes from Images and Range Scans.
- Lalaloc++: Global floor plan comprehension for layout localisation in unvisited environments. 2022.
- Lalaloc: Latent layout localisation in dynamic, unvisited environments. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 10107–10116, 2021.
- W. Kabsch. A solution for the best rotation to relate two sets of vectors. Acta Crystallographica Section A, 32(5):922–923, 1976.
- Stochastic estimation of the maximum of a regression function. Ann. Math. Statist., 23(3):462–466, 1952.
- A novel line matching method based on intersection context. In 2010 IEEE International Conference on Robotics and Automation, pages 1014–1021, 2010.
- Piccolo: Point cloud-centric omnidirectional localization. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 3313–3323, 2021.
- Cpo: Change robust panorama to point cloud localization. ECCV, 2022.
- Ldl: Line distance functions for panoramic localization. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 17882–17892, 2023.
- Adam: A method for stochastic optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, 2015.
- DLD: A Deep Learning Based Line Descriptor for Line Feature Matching. In 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 5910–5915. IEEE, 2019.
- William Leeb. Approximating snowflake metrics by trees. Applied and Computational Harmonic Analysis, 45(2):405–424, 2018.
- Epnp: An accurate o(n) solution to the pnp problem. International Journal Of Computer Vision, 81:155–166, 2009a.
- Epnp: An accurate o(n) solution to the pnp problem. Int. J. Comput. Vision, 81(2):155–166, 2009b.
- LightGlue: Local Feature Matching at Light Speed. In ICCV, 2023.
- 3d line mapping revisited. In Computer Vision and Pattern Recognition (CVPR), 2023.
- David G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60:91–110, 2004.
- Robust line segments matching via graph convolution networks. ArXiv, abs/2004.04993, 2020.
- Wglsm: An end-to-end line matching network based on graph convolution. Neurocomput., 453(C):195–208, 2021.
- Structure from motion with line segments under relaxed endpoint constraints. In 2014 2nd International Conference on 3D Vision, pages 13–19, 2014.
- Descriptor free visual indoor localization with line segments. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3165–3173, 2015.
- Laser: Latent space rendering for 2d visual localization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 11122–11131, 2022.
- Pytorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems 32, pages 8024–8035. Curran Associates, Inc., 2019.
- Sold2: Self-supervised occlusion-aware line description and detection. In Computer Vision and Pattern Recognition (CVPR), 2021.
- Deeplsd: Line segment detection and refinement with deep image gradients. In Computer Vision and Pattern Recognition (CVPR), 2023a.
- Vanishing point estimation in uncalibrated images with prior gravity direction. In International Conference on Computer Vision (ICCV), 2023b.
- GlueStick: Robust image matching by sticking points and lines together. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023c.
- Revealing scenes by inverting structure from motion reconstructions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 145–154, 2019.
- Robust and accurate line- and/or point-based pose estimation without manhattan assumptions. In European Conference on Computer Vision, 2016.
- Line-based robust sfm with little image overlap. In 2017 International Conference on 3D Vision (3DV), pages 195–204, Los Alamitos, CA, USA, 2017. IEEE Computer Society.
- SuperGlue: Learning feature matching with graph neural networks. In CVPR, 2020.
- Structure-from-motion revisited. In Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
- Pixelwise view selection for unstructured multi-view stereo. In European Conference on Computer Vision (ECCV), 2016.
- Privacy preserving image-based localization. 2019a.
- Privacy preserving image queries for camera localization. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 1486–1496, 2019b.
- LoFTR: Detector-free local feature matching with transformers. CVPR, 2021.
- Elsed: Enhanced line segment drawing. Pattern Recognition, 127:108619, 2022.
- InLoc: Indoor Visual Localization with Dense Matching and View Synthesis. In CVPR 2018 - IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, United States, 2018.
- Lcd – line clustering and description for place recognition, 2020.
- Dgc-gnn: Descriptor-free geometric-color graph neural network for 2d-3d matching, 2023.
- Fast 3d line segment detection from unorganized point cloud. arXiv preprint arXiv:1901.02532, 2019.
- Line as a visual sentence: Context-aware line descriptor for visual localization. IEEE Robotics and Automation Letters, 6(4):8726–8733, 2021.
- Vlase: Vehicle localization by aggregating semantic edges. In 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 3196–3203, 2018.
- Rotation equivariant orientation estimation for omnidirectional localization. In ACCV, 2020.
- An efficient and robust line segment matching approach based on lbd descriptor and pairwise geometric consistency. Journal of Visual Communication and Image Representation, 24(7):794–805, 2013.
- Structured3d: A large photo-realistic dataset for structured 3d modeling. In Proceedings of The European Conference on Computer Vision (ECCV), 2020.
- Is geometry enough for matching in visual localization? In Computer Vision – ECCV 2022, pages 407–425, Cham, 2022. Springer Nature Switzerland.
- Junho Kim (57 papers)
- Jiwon Jeong (9 papers)
- Young Min Kim (37 papers)