Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
184 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

From Correspondences to Pose: Non-minimal Certifiably Optimal Relative Pose without Disambiguation (2312.05995v2)

Published 10 Dec 2023 in cs.CV

Abstract: Estimating the relative camera pose from $n \geq 5$ correspondences between two calibrated views is a fundamental task in computer vision. This process typically involves two stages: 1) estimating the essential matrix between the views, and 2) disambiguating among the four candidate relative poses that satisfy the epipolar geometry. In this paper, we demonstrate a novel approach that, for the first time, bypasses the second stage. Specifically, we show that it is possible to directly estimate the correct relative camera pose from correspondences without needing a post-processing step to enforce the cheirality constraint on the correspondences. Building on recent advances in certifiable non-minimal optimization, we frame the relative pose estimation as a Quadratically Constrained Quadratic Program (QCQP). By applying the appropriate constraints, we ensure the estimation of a camera pose that corresponds to a valid 3D geometry and that is globally optimal when certified. We validate our method through exhaustive synthetic and real-world experiments, confirming the efficacy, efficiency and accuracy of the proposed approach. Code is available at https://github.com/javrtg/C2P.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (70)
  1. Fast explicit diffusion for accelerated features in nonlinear scale spaces. In BMVC. British Machine Vision Association, 2013.
  2. Semidefinite relaxations for quadratically constrained quadratic programming: A review and comparisons. Mathematical programming, 129:129–157, 2011.
  3. Navigation using affine structure from motion. In ECCV. Springer, 1994.
  4. Nicolas Boumal. An introduction to optimization on smooth manifolds. Cambridge University Press, 2023.
  5. The non-convex burer-monteiro approach works on smooth semidefinite programs. In NeurIPS. Curran Associates, Inc., 2016.
  6. Convex optimization. Cambridge university press, 2004.
  7. G. Bradski. The OpenCV Library. Dr. Dobb’s Journal of Software Tools, 2000.
  8. Convex global 3d registration with lagrangian duality. In CVPR, 2017a.
  9. Cartan-sync: Fast and global se(d)-synchronization. IEEE RA-L, 2(4):2127–2134, 2017b.
  10. A Certifiably Globally Optimal Solution to the Non-Minimal Relative Pose Problem. In CVPR, 2018.
  11. A nonlinear programming algorithm for solving semidefinite programs via low-rank factorization. Mathematical programming, 95(2):329–357, 2003.
  12. A strengthened sdp relaxation for quadratic optimization over the stiefel manifold. Journal of optimization theory and applications, pages 1–20, 2023.
  13. Orb-slam3: An accurate open-source library for visual, visual–inertial, and multimap slam. IEEE T-R0, 37(6):1874–1890, 2021.
  14. On the local stability of semidefinite relaxations. Mathematical Programming, pages 1–35, 2022.
  15. Shonan rotation averaging: Global optimality by surfing so (p)^ n so (p) n. In ECCV, pages 292–308. Springer, 2020.
  16. Superpoint: Self-supervised interest point detection and description. In CVPRW, 2018.
  17. DKM: Dense Kernelized Feature Matching for Geometry Estimation. In CVPR, 2023a.
  18. RoMa: Revisiting Robust Lossses for Dense Feature Matching. arXiv preprint arXiv:2305.15404, 2023b.
  19. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6):381–395, 1981.
  20. Sdpa (semidefinite programming algorithm) user’s manual—version 6.2. 0. Department of Mathematical and Com-puting Sciences, Tokyo Institute of Technology. Research Reports on Mathematical and Computing Sciences Series B: Operations Research, 2002.
  21. Fast and robust certifiable estimation of the relative pose between two calibrated cameras. Journal of Mathematical Imaging and Vision, 63(8):1036–1056, 2021.
  22. Certifiable relative pose estimation. Image and Vision Computing, 109:104142, 2021.
  23. A tighter relaxation for the relative pose problem between cameras. Journal of Mathematical Imaging and Vision, 64(5):493–505, 2022.
  24. Matthew Peter Giamou. Semidefinite Relaxations for Geometric Problems in Robotics. PhD thesis, University of Toronto (Canada), 2023.
  25. Semidefinite relaxations for robust multiview triangulation. In CVPR, 2023.
  26. R.I. Hartley. In defense of the eight-point algorithm. IEEE TPAMI, 19(6):580–593, 1997.
  27. Rotation averaging. IJCV, 103:267–305, 2013.
  28. Global optimization through rotation space search. IJCV, 82(1):64–79, 2009.
  29. Multiple View Geometry in Computer Vision. Cambridge University Press, ISBN: 0521540518, second edition, 2004.
  30. Essential matrix estimation using gauss-newton iterations on a manifold. IJCV, 74:117–136, 2007.
  31. Nicholas J Higham. Matrix nearness problems and applications. Applications of matrix theory, 22, 1989.
  32. Matrix Analysis. Cambridge university press, 2012.
  33. SDPT3 — A Matlab software package for semidefinite programming, Version 1.3. Optimization Methods and Software, 11(1-4):545–581, 1999.
  34. Essential matrix estimation using convex relaxations in orthogonal space. In ICCV, pages 17142–17152, 2023.
  35. Opengv: A unified and generalized approach to real-time calibrated geometric vision. In IEEE ICRA, pages 1–8, 2014.
  36. Direct optimization of frame-to-frame rotation. In ICCV, 2013.
  37. Finding the exact rotation between two images independently of the translation. In ECCV, pages 696–709. Springer, 2012.
  38. Triangulation: Why optimize? In BMVC, 2019.
  39. Geometric interpretations of the normalized epipolar error. arXiv preprint arXiv:2008.01254, 2020.
  40. Hara: A hierarchical approach for robust rotation averaging. In CVPR, pages 15777–15786, 2022.
  41. Lightglue: Local feature matching at light speed. In ICCV, 2023.
  42. H Christopher Longuet-Higgins. A computer algorithm for reconstructing a scene from two projections. Nature, 293(5828):133–135, 1981.
  43. David G Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60:91–110, 2004.
  44. OpenMVG: Open multiple view geometry. In International Workshop on Reproducible Research in Pattern Recognition, pages 60–74. Springer, 2016.
  45. The probabilistic normal epipolar constraint for frame-to-frame rotation optimization under uncertain feature positions. In CVPR, 2022.
  46. Orb-slam2: An open-source slam system for monocular, stereo, and rgb-d cameras. IEEE T-R0, 33(5):1255–1262, 2017.
  47. Orb-slam: A versatile and accurate monocular slam system. IEEE T-R0, 31(5):1147–1163, 2015.
  48. D. Nister. An efficient solution to the five-point relative pose problem. IEEE TPAMI, 26(6):756–770, 2004.
  49. Usac: A universal framework for random sample consensus. IEEE TPAMI, 35(8):2022–2038, 2012.
  50. Se-sync: A certifiably correct algorithm for synchronization over the special euclidean group. The International Journal of Robotics Research, 38(2-3):95–125, 2019.
  51. Orbitopes. Mathematika, 57(2):275–314, 2011.
  52. Semidefinite descriptions of the convex hull of rotation matrices. SIAM Journal on Optimization, 25(3):1314–1343, 2015.
  53. Structure-from-motion revisited. In CVPR, 2016.
  54. Recent developments on direct relative orientation. ISPRS Journal of Photogrammetry and Remote Sensing, 60(4):284–294, 2006.
  55. its applications. Academic Press, New York, 14:181208, 1980.
  56. On benchmarking camera calibration and multi-view stereo for high resolution imagery. In CVPR, 2008.
  57. Jos F. Sturm. Using sedumi 1.02, a MATLAB toolbox for optimization over symmetric cones. Optimization Methods and Software, 11(1-4):625–653, 1999.
  58. Loftr: Detector-free local feature matching with transformers. In CVPR, 2021.
  59. On the quotient representation for the essential manifold. In CVPR, 2014.
  60. The space of essential matrices as a riemannian quotient manifold. SIAM Journal on Imaging Sciences, 10(3):1416–1445, 2017.
  61. PDC-Net+: Enhanced Probabilistic Dense Correspondence Network. IEEE TPAMI, 45(8):10247–10266, 2023.
  62. Charles F Van Loan. The ubiquitous kronecker product. Journal of computational and applied mathematics, 123(1-2):85–100, 2000.
  63. Semidefinite programming. SIAM review, 38(1):49–95, 1996.
  64. A high-performance software package for semidefinite programs: Sdpa 7. Technical report, 2010.
  65. A quaternion-based certifiably optimal solution to the wahba problem with outliers. In ICCV, 2019.
  66. Graduated non-convexity for robust spatial perception: From non-minimal solvers to global outlier rejection. IEEE RA-L, 5(2):1127–1134, 2020.
  67. Teaser: Fast and certifiable point cloud registration. IEEE T-R0, 37(2):314–333, 2021.
  68. Optimal essential matrix estimation via inlier-set maximization. In ECCV. Springer, 2014.
  69. Ji Zhao. An efficient solution to non-minimal case essential matrix estimation. IEEE TPAMI, 44(4):1777–1792, 2022.
  70. A certifiably globally optimal solution to generalized essential matrix estimation. In CVPR, 2020.

Summary

  • The paper introduces a certifiably optimal approach that directly estimates the relative pose from correspondences, bypassing the traditional two-stage method.
  • The methodology frames the pose problem as a Quadratically Constrained Quadratic Program relaxed to a Semidefinite Program to ensure global optimality.
  • Experimental results show improved runtime and accuracy over state-of-the-art methods, enhancing real-time performance in SfM and SLAM applications.

Non-minimal Certifiably Optimal Relative Pose Estimation

This paper presents a novel approach to estimating the relative pose between two calibrated camera views, focusing on bypassing the traditional two-stage process. The authors propose a method that directly estimates the correct relative camera pose from correspondences without requiring the disambiguation step typically necessary to enforce the cheirality constraint, which demands that 3D points lie in front of the camera. This paper frames the relative pose estimation problem as a Quadratically Constrained Quadratic Program (QCQP), utilizing advancements in certifiable non-minimal optimization.

Summary of the Approach

The traditional process of estimating relative camera pose involves two stages: first, estimating the essential matrix from at least five correspondences, and second, disambiguating among the four candidate poses satisfying the epipolar geometry. The novel contribution of this paper is the introduction of a certifiably optimal approach that eliminates the need for the second stage. The method leverages the properties of the camera projection geometry to directly solve the relative pose in a single optimization step.

Key to this approach is the integration of geometric constraints into the optimization problem. The authors utilize a QCQP, which is relaxed to a Semidefinite Program (SDP) to ensure a global and certifiable solution. The mathematical framework shows that by incorporating convex optimization techniques, it is feasible to ensure that the estimated pose corresponds to the valid 3D geometry. This is a substantial advancement over methods that separately handle these steps, typically using non-convex optimization strategies.

Results and Implications

The paper provides comprehensive validation of the proposed method through synthetic and real-world experiments. The results demonstrate that the method is both effective and efficient, achieving high accuracy relative to existing approaches. Performance is compared against state-of-the-art methods such as those presented by Briales et al. and Garc�a-Salguero et al., with the novel approach outperforming these in terms of runtime efficiency and pose accuracy.

One of the significant implications of this research is the potential improvements in applications such as Structure from Motion (SfM) and Simultaneous Localization and Mapping (SLAM). By reducing the computational complexity and eliminating the need for post-processing disambiguation of the pose, this method could enhance real-time performance and robustness of localization and mapping systems in dynamic environments.

Future Developments

This research opens new avenues for enhancements in pose estimation algorithms, particularly in the field of incorporating convex optimization schemes to ensure globally optimal solutions without the overhead of iterative disambiguation. Future work could explore the integration of this method into larger SLAM systems and robotic vision applications, as well as extending the technique to incorporate more complex camera models and multi-view scenarios.

Overall, this paper contributes to the corpus of computer vision by simplifying and improving the computational aspects of a foundational problem. The authors have laid groundwork for potentially significant improvements in practical computer vision systems by efficiently solving the relative pose estimation problem with a novel one-step globally optimal approach.

Youtube Logo Streamline Icon: https://streamlinehq.com