Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
184 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SpyroPose: SE(3) Pyramids for Object Pose Distribution Estimation (2303.05308v2)

Published 9 Mar 2023 in cs.CV and cs.RO

Abstract: Object pose estimation is a core computer vision problem and often an essential component in robotics. Pose estimation is usually approached by seeking the single best estimate of an object's pose, but this approach is ill-suited for tasks involving visual ambiguity. In such cases it is desirable to estimate the uncertainty as a pose distribution to allow downstream tasks to make informed decisions. Pose distributions can have arbitrary complexity which motivates estimating unparameterized distributions, however, until now they have only been used for orientation estimation on SO(3) due to the difficulty in training on and normalizing over SE(3). We propose a novel method for pose distribution estimation on SE(3). We use a hierarchical grid, a pyramid, which enables efficient importance sampling during training and sparse evaluation of the pyramid at inference, allowing real time 6D pose distribution estimation. Our method outperforms state-of-the-art methods on SO(3), and to the best of our knowledge, we provide the first quantitative results on pose distribution estimation on SE(3). Code will be available at spyropose.github.io

Definition Search Book Streamline Icon: https://streamlinehq.com
References (31)
  1. Andrea Censi. An accurate closed-form estimate of ICP’s covariance. In 2007 IEEE International Conference on Robotics and Automation, pages 3167–3172, Apr. 2007.
  2. Deep bingham networks: Dealing with uncertainty and ambiguity in pose estimation. International Journal of Computer Vision, pages 1–28, 2022.
  3. Poserbpf: A rao–blackwellized particle filter for 6-d object pose tracking. IEEE Transactions on Robotics, 37(5):1328–1342, 2021.
  4. Propagation of orientation uncertainty of 3d rigid object to its points. In Proceedings of the IEEE International Conference on Computer Vision Workshops, pages 2183–2191, 2017.
  5. Deep orientation uncertainty learning based on a bingham loss. In International Conference on Learning Representations, 2019.
  6. Healpix: A framework for high-resolution discretization and fast analysis of data distributed on the sphere. The Astrophysical Journal, 622(2):759, 2005.
  7. Combined optimization of gripper finger design and pose estimation processes for advanced industrial assembly. In 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 2022–2029. IEEE, 2019.
  8. Surfemb: Dense and continuous correspondence distributions for object pose estimation with learnt surface embeddings. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6749–6758, 2022.
  9. Multi-view object pose estimation from correspondence distributions and epipolar geometry. arXiv preprint arXiv:2210.00924, 2022.
  10. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
  11. Epos: Estimating 6d pose of objects with symmetries. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 11703–11712, 2020.
  12. T-less: An rgb-d dataset for 6d pose estimation of texture-less objects. In 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), pages 880–888. IEEE, 2017.
  13. Bop challenge 2020 on 6d object localization. In European Conference on Computer Vision, pages 577–594. Springer, 2020.
  14. Hyperposepdf-hypernetworks predicting the probability distribution on so (3). In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 2369–2379, 2023.
  15. Ki-pode: Keypoint-based implicit pose distribution estimation of rigid objects. In 33rd British Machine Vision Conference 2022, BMVC 2022, London, UK, November 21-24, 2022, page 222. BMVA Press, 2022.
  16. Homebreweddb: Rgb-d dataset for 6d pose estimation of 3d objects. In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, pages 0–0, 2019.
  17. Ssd-6d: Making rgb-based 3d detection and 6d pose estimation great again. In Proceedings of the IEEE international conference on computer vision, pages 1521–1529, 2017.
  18. Image to sphere: Learning equivariant features for efficient pose prediction. arXiv preprint arXiv:2302.13926, 2023.
  19. Cosypose: Consistent multi-view multi-object 6d pose estimation. In European Conference on Computer Vision, pages 574–591. Springer, 2020.
  20. Explaining the ambiguity of object detection and 6d pose from visual data. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 6841–6850, 2019.
  21. Implicit-pdf: Non-parametric representation of probability distributions on the rotation manifold. arXiv preprint arXiv:2106.05965, 2021.
  22. Learning orientation distributions for object pose estimation. In 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 10580–10587. IEEE, 2020.
  23. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748, 2018.
  24. Pvnet: Pixel-wise voting network for 6dof pose estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4561–4570, 2019.
  25. Deep directional statistics: Pose estimation with uncertainty quantification. In Proceedings of the European conference on computer vision (ECCV), pages 534–551, 2018.
  26. Bb8: A scalable, accurate, robust to partial occlusion method for predicting the 3d poses of challenging objects without using depth. In Proceedings of the IEEE international conference on computer vision, pages 3828–3836, 2017.
  27. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pages 234–241. Springer, 2015.
  28. Fast uncertainty quantification for deep object pose estimation. In 2021 IEEE International Conference on Robotics and Automation (ICRA), pages 5200–5207. IEEE, 2021.
  29. Implicit 3d orientation learning for 6d object detection from rgb images. In Proceedings of the european conference on computer vision (ECCV), pages 699–715, 2018.
  30. Posecnn: A convolutional neural network for 6d object pose estimation in cluttered scenes. 2018.
  31. Generating uniform incremental grids on so (3) using the hopf fibration. The International journal of robotics research, 29(7):801–812, 2010.
Citations (7)

Summary

We haven't generated a summary for this paper yet.