Consistent and Asymptotically Statistically-Efficient Solution to Camera Motion Estimation (2403.01174v1)
Abstract: Given 2D point correspondences between an image pair, inferring the camera motion is a fundamental issue in the computer vision community. The existing works generally set out from the epipolar constraint and estimate the essential matrix, which is not optimal in the maximum likelihood (ML) sense. In this paper, we dive into the original measurement model with respect to the rotation matrix and normalized translation vector and formulate the ML problem. We then propose a two-step algorithm to solve it: In the first step, we estimate the variance of measurement noises and devise a consistent estimator based on bias elimination; In the second step, we execute a one-step Gauss-Newton iteration on manifold to refine the consistent estimate. We prove that the proposed estimate owns the same asymptotic statistical properties as the ML estimate: The first is consistency, i.e., the estimate converges to the ground truth as the point number increases; The second is asymptotic efficiency, i.e., the mean squared error of the estimate converges to the theoretical lower bound -- Cramer-Rao bound. In addition, we show that our algorithm has linear time complexity. These appealing characteristics endow our estimator with a great advantage in the case of dense point correspondences. Experiments on both synthetic data and real images demonstrate that when the point number reaches the order of hundreds, our estimator outperforms the state-of-the-art ones in terms of estimation accuracy and CPU time.
- J. Engel, V. Koltun, and D. Cremers, “Direct sparse odometry,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 3, pp. 611–625, 2017.
- P.-E. Sarlin, P. Lindenberger, V. Larsson, and M. Pollefeys, “Pixel-perfect structure-from-motion with featuremetric refinement,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, DOI: 10.1109/TPAMI.2023.3237269.
- H. Zhan, C. S. Weerasekera, J.-W. Bian, and I. Reid, “Visual odometry revisited: What should be learnt?” in Proceedings of IEEE International Conference on Robotics and Automation (ICRA), 2020, pp. 4203–4210.
- R. I. Hartley, “In defense of the eight-point algorithm,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, no. 6, pp. 580–593, 1997.
- D. Nistér, “An efficient solution to the five-point relative pose problem,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 6, pp. 756–770, 2004.
- B. Li, L. Heng, G. H. Lee, and M. Pollefeys, “A 4-point algorithm for relative pose estimation of a calibrated camera with a known relative rotation angle,” in Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2013, pp. 1595–1601.
- A. Chatterjee and V. M. Govindu, “Robust relative rotation averaging,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 4, pp. 958–972, 2017.
- D. Zou and P. Tan, “Coslam: Collaborative visual slam in dynamic environments,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 2, pp. 354–366, 2012.
- Y.-Y. Jau, R. Zhu, H. Su, and M. Chandraker, “Deep keypoint-based camera pose estimation with geometric constraints,” in Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2020, pp. 4950–4957.
- J. Zhao, “An efficient solution to non-minimal case essential matrix estimation,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 4, pp. 1777–1792, 2020.
- G. Chesi, “Camera displacement via constrained minimization of the algebraic error,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 2, pp. 370–375, 2008.
- U. Helmke, K. Hüper, P. Y. Lee, and J. Moore, “Essential matrix estimation using gauss-newton iterations on a manifold,” International Journal of Computer Vision, vol. 74, pp. 117–136, 2007.
- J. Briales, L. Kneip, and J. Gonzalez-Jimenez, “A certifiably globally optimal solution to the non-minimal relative pose problem,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 145–154.
- Y. Ding, D. Barath, J. Yang, H. Kong, and Z. Kukelova, “Globally optimal relative pose estimation with gravity prior,” in Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 394–403.
- R. I. Hartley and F. Kahl, “Global optimization through rotation space search,” International Journal of Computer Vision, vol. 82, no. 1, pp. 64–79, 2009.
- N. Jiang, Z. Cui, and P. Tan, “A global linear method for camera pose registration,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2013, pp. 481–488.
- Y. Ma, J. Košecká, and S. Sastry, “Optimization criteria and geometric algorithms for motion and structure estimation,” International Journal of Computer Vision, vol. 44, pp. 219–249, 2001.
- R. Tron and K. Daniilidis, “The space of essential matrices as a riemannian quotient manifold,” SIAM Journal on Imaging Sciences, vol. 10, no. 3, pp. 1416–1445, 2017.
- M. Garcia-Salguero, J. Briales, and J. Gonzalez-Jimenez, “A tighter relaxation for the relative pose problem between cameras,” Journal of Mathematical Imaging and Vision, vol. 64, no. 5, pp. 493–505, 2022.
- H. Li, R. Hartley, and J.-h. Kim, “A linear approach to motion estimation using generalized camera models,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008, pp. 1–8.
- T. Schops, J. L. Schonberger, S. Galliani, T. Sattler, K. Schindler, M. Pollefeys, and A. Geiger, “A multi-view stereo benchmark with high-resolution images and multi-camera videos,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 3260–3269.
- Z. Kukelova and T. Pajdla, “Two minimal problems for cameras with radial distortion,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2007, pp. 1–8.
- Z. Kukelova, M. Bujnak, and T. Pajdla, “Polynomial eigenvalue solutions to the 5-pt and 6-pt relative pose problems.” in Proceedings of British Machine Vision Conference (BMVC), 2008, pp. 56.1–56.10.
- H. Stewenius, C. Engels, and D. Nistér, “Recent developments on direct relative orientation,” ISPRS Journal of Photogrammetry and Remote Sensing, vol. 60, no. 4, pp. 284–294, 2006.
- L. Kneip, R. Siegwart, and M. Pollefeys, “Finding the exact rotation between two images independently of the translation,” in Proceedings of European Conference on Computer Vision (ECCV), 2012, pp. 696–709.
- H. C. Longuet-Higgins, “A computer algorithm for reconstructing a scene from two projections,” Nature, vol. 293, no. 5828, pp. 133–135, 1981.
- D. Barath and J. Matas, “Graph-cut ransac: Local optimization on spatially coherent structures,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 9, pp. 4961–4974, 2021.
- R. Subbarao, Y. Genc, and P. Meer, “Robust unambiguous parametrization of the essential manifold,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008, pp. 1–8.
- L. Kneip and S. Lynen, “Direct optimization of frame-to-frame rotation,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2013, pp. 2352–2359.
- M. Garcia-Salguero, J. Briales, and J. Gonzalez-Jimenez, “Certifiable relative pose estimation,” Image and Vision Computing, vol. 109, 2021, Art. no. 104142.
- V. Lepetit, F. Moreno-Noguer, and P. Fua, “Epnp: An accurate o (n) solution to the pnp problem,” International Journal of Computer Vision, vol. 81, no. 2, pp. 155–166, 2009.
- J. A. Hesch and S. I. Roumeliotis, “A direct least-squares (dls) method for pnp,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2011, pp. 383–390.
- S. Urban, J. Leitloff, and S. Hinz, “Mlpnp-a real-time maximum likelihood solution to the perspective-n-point problem,” arXiv:1607.08112, 2016.
- Q. Cai, Y. Wu, L. Zhang, and P. Zhang, “Equivalent constraints for two-view geometry: Pose solution/pure rotation identification and 3d reconstruction,” International Journal of Computer Vision, vol. 127, pp. 163–180, 2019.
- C. Campos, R. Elvira, J. J. G. Rodríguez, J. M. Montiel, and J. D. Tardós, “Orb-slam3: An accurate open-source library for visual, visual–inertial, and multimap slam,” IEEE Transactions on Robotics, vol. 37, no. 6, pp. 1874–1890, 2021.
- C. Toft, W. Maddern, A. Torii, L. Hammarstrand, E. Stenborg, D. Safari, M. Okutomi, M. Pollefeys, J. Sivic, T. Pajdla, F. Kahl, and T. Sattler, “Long-term visual localization revisited,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 4, pp. 2074–2088, 2020.
- A. Tonioni, M. Poggi, S. Mattoccia, and L. Di Stefano, “Unsupervised domain adaptation for depth prediction from images,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 10, pp. 2396–2409, 2019.
- G. Zeng, B. Mu, L. Shi, J. Chen, and J. Wu, “Consistent and asymptotically efficient localization from range-difference measurements,” arXiv:2302.03311, 2023.
- G. Zeng, B. Mu, J. Chen, Z. Shi, and J. Wu, “Global and asymptotically efficient localization from range measurements,” IEEE Transactions on Signal Processing, vol. 70, pp. 5041–5057, 2022.
- P. Stoica and B. C. Ng, “On the cramér-rao bound under parametric constraints,” IEEE Signal Processing Letters, vol. 5, no. 7, pp. 177–179, 1998.
- Guangyang Zeng (11 papers)
- Qingcheng Zeng (30 papers)
- Xinghan Li (7 papers)
- Biqiang Mu (23 papers)
- Jiming Chen (105 papers)
- Ling Shi (119 papers)
- Junfeng Wu (71 papers)