Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash 96 tok/s
Gemini 2.5 Pro 49 tok/s Pro
GPT-5 Medium 24 tok/s
GPT-5 High 36 tok/s Pro
GPT-4o 102 tok/s
GPT OSS 120B 434 tok/s Pro
Kimi K2 198 tok/s Pro
2000 character limit reached

Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods (2408.00117v1)

Published 31 Jul 2024 in cs.CV, cs.LG, cs.RO, cs.SY, and eess.SY

Abstract: This work addresses the certification of the local robustness of vision-based two-stage 6D object pose estimation. The two-stage method for object pose estimation achieves superior accuracy by first employing deep neural network-driven keypoint regression and then applying a Perspective-n-Point (PnP) technique. Despite advancements, the certification of these methods' robustness remains scarce. This research aims to fill this gap with a focus on their local robustness on the system level--the capacity to maintain robust estimations amidst semantic input perturbations. The core idea is to transform the certification of local robustness into neural network verification for classification tasks. The challenge is to develop model, input, and output specifications that align with off-the-shelf verification tools. To facilitate verification, we modify the keypoint detection model by substituting nonlinear operations with those more amenable to the verification processes. Instead of injecting random noise into images, as is common, we employ a convex hull representation of images as input specifications to more accurately depict semantic perturbations. Furthermore, by conducting a sensitivity analysis, we propagate the robustness criteria from pose to keypoint accuracy, and then formulating an optimal error threshold allocation problem that allows for the setting of a maximally permissible keypoint deviation thresholds. Viewing each pixel as an individual class, these thresholds result in linear, classification-akin output specifications. Under certain conditions, we demonstrate that the main components of our certification framework are both sound and complete, and validate its effects through extensive evaluations on realistic perturbations. To our knowledge, this is the first study to certify the robustness of large-scale, keypoint-based pose estimation given images in real-world scenarios.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (58)
  1. Perception Contracts for Safety of ML-Enabled Systems. Proceedings of the ACM on Programming Languages 7, OOPSLA2 (2023), 2196–2223.
  2. Improved geometric path enumeration for verifying relu neural networks. In Computer Aided Verification: 32nd International Conference, CAV 2020, Los Angeles, CA, USA, July 21–24, 2020, Proceedings, Part I 32. Springer, 66–96.
  3. Measuring neural net robustness with constraints. Advances in neural information processing systems 29 (2016).
  4. Mehdi Behroozi. 2019. Largest Inscribed Rectangles in Geometric Convex Sets. arXiv preprint arXiv:1905.13246 (2019).
  5. The fourth international verification of neural networks competition (vnn-comp 2023): Summary and results. arXiv preprint arXiv:2312.16760 (2023).
  6. Sensitivity analysis in optimization and reliability problems. Reliability Engineering & System Safety 93, 12 (2008), 1788–1800.
  7. Towards safety verification of direct perception neural networks. In 2020 Design, Automation & Test in Europe Conference & Exhibition (DATE). IEEE, 1640–1643.
  8. The moped framework: Object recognition and pose estimation for manipulation. The international journal of robotics research 30, 10 (2011), 1284–1306.
  9. Self-supervised 6d object pose estimation for robot manipulation. In 2020 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 3665–3671.
  10. So-pose: Exploiting self-occlusion for direct 6d pose estimation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 12396–12405.
  11. Harnessing Neuron Stability to Improve DNN Verification. arXiv preprint arXiv:2401.14412 (2024).
  12. Reachability analysis for neural feedback systems using regressive polynomial rule inference. In Proceedings of the 22nd ACM International Conference on Hybrid Systems: Computation and Control. 157–168.
  13. Michael Everett. 2021. Neural network verification in control. In 2021 60th IEEE Conference on Decision and Control (CDC). IEEE, 6326–6340.
  14. Deep learning on monocular object pose detection and tracking: A comprehensive overview. Comput. Surveys 55, 4 (2022), 1–40.
  15. Ai2: Safety and robustness certification of neural networks with abstract interpretation. In 2018 IEEE symposium on security and privacy (SP). IEEE, 3–18.
  16. Ffb6d: A full flow bidirectional fusion network for 6d pose estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3003–3013.
  17. Hikami3150. 2024. Boeing 777-300ER Model. https://sketchfab.com/3d-models/boeing-777-300er-model-322e7961d2024bba834887878d2d49a2. 3D model.
  18. Verifying controllers with vision-based perception using safe approximate abstractions. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 41, 11 (2022), 4205–4216.
  19. Robustness verification for perception models against camera motion perturbations. In ICML Workshop on Formal Verification of Machine Learning (WFVML).
  20. Verisig 2.0: Verification of neural network controllers using taylor model preconditioning. In International Conference on Computer Aided Verification. Springer, 249–262.
  21. Case study: verifying the safety of an autonomous racing car with a neural network controller. In Proceedings of the 23rd International Conference on Hybrid Systems: Computation and Control. 1–7.
  22. Compositional learning and verification of neural network controllers. ACM Transactions on Embedded Computing Systems (TECS) 20, 5s (2021), 1–26.
  23. The marabou framework for verification and analysis of deep neural networks. In Computer Aided Verification: 31st International Conference, CAV 2019, New York City, NY, USA, July 15-18, 2019, Proceedings, Part I 31. Springer, 443–452.
  24. Verification of image-based neural network controllers using generative models. Journal of Aerospace Information Systems 19, 9 (2022), 574–584.
  25. Ssd-6d: Making rgb-based 3d detection and 6d pose estimation great again. In Proceedings of the IEEE international conference on computer vision. 1521–1529.
  26. Verification of semantic key point detection for aircraft pose estimation. In Proceedings of the International Conference on Principles of Knowledge Representation and Reasoning, Vol. 19. 757–762.
  27. Ruyi Lian and Haibin Ling. 2023. Checkerpose: Progressive dense keypoint localization for object pose estimation with graph neural network. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 14022–14033.
  28. Algorithms for verifying deep neural networks. Foundations and Trends® in Optimization 4, 3-4 (2021), 244–404.
  29. Deep Learning-Based Object Pose Estimation: A Comprehensive Survey. arXiv preprint arXiv:2405.07801 (2024).
  30. Numerical coordinate regression with convolutional neural networks. arXiv preprint arXiv:1801.07372 (2018).
  31. Making deep heatmaps robust to partial occlusions for 3d object pose estimation. In Proceedings of the European conference on computer vision (ECCV). 119–134.
  32. Pix2pose: Pixel-wise coordinate regression of objects for 6d pose estimation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 7668–7677.
  33. Pvnet: Pixel-wise voting network for 6dof pose estimation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 4561–4570.
  34. Generative adversarial perturbations. In Proceedings of the IEEE conference on computer vision and pattern recognition. 4422–4431.
  35. Mahdi Rad and Vincent Lepetit. 2017. Bb8: A scalable, accurate, robust to partial occlusion method for predicting the 3d poses of challenging objects without using depth. In Proceedings of the IEEE international conference on computer vision. 3828–3836.
  36. Ulices Santa Cruz and Yasser Shoukry. 2022. Nnlander-verif: A neural network formal verification framework for vision-based autonomous aircraft landing. In NASA Formal Methods Symposium. Springer, 213–230.
  37. Ulices Santa Cruz and Yasser Shoukry. 2023. Certified vision-based state estimation for autonomous landing systems using reachability analysis. In 2023 62nd IEEE Conference on Decision and Control (CDC). IEEE, 6052–6057.
  38. A correct-and-certify approach to self-supervise object pose estimators via ensemble self-training. arXiv preprint arXiv:2302.06019 (2023).
  39. Dpodv2: Dense correspondence-based 6 dof pose estimation. IEEE transactions on pattern analysis and machine intelligence 44, 11 (2021), 7417–7435.
  40. Formal verification of stochastic systems with relu neural network controllers. In 2022 International Conference on Robotics and Automation (ICRA). IEEE, 6800–6806.
  41. Formal verification of neural network controlled autonomous systems. In Proceedings of the 22nd ACM International Conference on Hybrid Systems: Computation and Control. 147–156.
  42. Certifiable Object Pose Estimation: Foundations, Learning Models, and Self-Training. IEEE Transactions on Robotics (2023).
  43. 3D mapping and 6D pose computation for real time augmented reality on cylindrical objects. IEEE Transactions on Circuits and Systems for Video Technology 30, 9 (2019), 2887–2899.
  44. Challenges for monocular 6d object pose estimation in robotics. arXiv preprint arXiv:2307.12172 (2023).
  45. Evaluating Robustness of Neural Networks with Mixed Integer Programming. In International Conference on Learning Representations.
  46. Safety verification of cyber-physical systems with reinforcement learning control. ACM Transactions on Embedded Computing Systems (TECS) 18, 5s (2019), 1–22.
  47. NNV: the neural network verification tool for deep neural networks and learning-enabled cyber-physical systems. In International Conference on Computer Aided Verification. Springer, 3–17.
  48. Normalized object coordinate space for category-level 6d object pose and size estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2642–2651.
  49. Beta-crown: Efficient bound propagation with per-neuron split constraints for neural network robustness verification. Advances in Neural Information Processing Systems 34 (2021), 29909–29921.
  50. Convex Bounds on the Softmax Function with Applications to Robustness Verification. In Proceedings of The 26th International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research, Vol. 206), Francisco Ruiz, Jennifer Dy, and Jan-Willem van de Meent (Eds.). PMLR, 6853–6878. https://proceedings.mlr.press/v206/wei23c.html
  51. ModelVerification. jl: a Comprehensive Toolbox for Formally Verifying Deep Neural Networks. arXiv preprint arXiv:2407.01639 (2024).
  52. Marabou 2.0: A Versatile Formal Analyzer of Neural Networks. arXiv preprint arXiv:2401.14461 (2024).
  53. Uncertainty Quantification with Deep Ensembles for 6D Object Pose Estimation. arXiv preprint arXiv:2403.07741 (2024).
  54. Output reachable set estimation and verification for multilayer neural networks. IEEE transactions on neural networks and learning systems 29, 11 (2018), 5777–5783.
  55. Fast and Complete: Enabling Complete Neural Network Verification with Rapid and Massively Parallel Incomplete Verifiers. In International Conference on Learning Representations.
  56. Heng Yang and Marco Pavone. 2023. Object pose estimation with statistical guarantees: Conformal keypoint detection and geometric uncertainty propagation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8947–8958.
  57. Deep learning-based human pose estimation: A survey. Comput. Surveys 56, 1 (2023), 1–37.
  58. Hs-pose: Hybrid scope feature extraction for category-level object pose estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 17163–17173.
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube