Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Uncertainty-aware Active Learning of NeRF-based Object Models for Robot Manipulators using Visual and Re-orientation Actions (2404.01812v1)

Published 2 Apr 2024 in cs.RO and cs.AI

Abstract: Manipulating unseen objects is challenging without a 3D representation, as objects generally have occluded surfaces. This requires physical interaction with objects to build their internal representations. This paper presents an approach that enables a robot to rapidly learn the complete 3D model of a given object for manipulation in unfamiliar orientations. We use an ensemble of partially constructed NeRF models to quantify model uncertainty to determine the next action (a visual or re-orientation action) by optimizing informativeness and feasibility. Further, our approach determines when and how to grasp and re-orient an object given its partial NeRF model and re-estimates the object pose to rectify misalignments introduced during the interaction. Experiments with a simulated Franka Emika Robot Manipulator operating in a tabletop environment with benchmark objects demonstrate an improvement of (i) 14% in visual reconstruction quality (PSNR), (ii) 20% in the geometric/depth reconstruction of the object surface (F-score) and (iii) 71% in the task success rate of manipulating objects a-priori unseen orientations/stable configurations in the scene; over current methods. The project page can be found here: https://actnerf.github.io.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (32)
  1. “NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis” In ECCV, 2020
  2. “pixelNeRF: Neural Radiance Fields from One or Few Images” In CVPR, 2021
  3. M.M. Johari, Y. Lepoittevin and F. Fleuret “GeoNeRF: Generalizing NeRF with Geometry Priors” In IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2022
  4. “Uncertainty guided policy for active robotic 3d reconstruction using neural radiance fields” In IEEE Robotics and Automation Letters 7.4 IEEE, 2022, pp. 12070–12077
  5. “Active View Planning for Radiance Fields” In Robotics Science and Systems, 2022
  6. “Dex-NeRF: Using a Neural Radiance Field to Grasp Transparent Objects” In ArXiv abs/2110.14217, 2021 URL: https://api.semanticscholar.org/CorpusID:239998474
  7. “Evo-nerf: Evolving nerf for sequential robot grasping of transparent objects” In 6th Annual Conference on Robot Learning, 2022
  8. “Vision-Only Robot Navigation in a Neural Radiance World” website: https://mikh3x4.github.io/nerf-navigation/ In IEEE Robotics and Automation Letters (RA-L) 7.2, 2022, pp. 4606–4613
  9. “Learning Multi-Object Dynamics with Compositional Neural Radiance Fields” In Conf. on Robot Learning 205, Proceedings of Machine Learning Research, 2023, pp. 1755–1768
  10. Michael Krainin, Brian Curless and Dieter Fox “Autonomous generation of complete 3D object models using next best view manipulation planning” In 2011 IEEE international conference on robotics and automation, 2011, pp. 5031–5037 IEEE
  11. “An information gain formulation for active volumetric 3D reconstruction” In 2016 IEEE International Conference on Robotics and Automation (ICRA), 2016, pp. 3477–3484 IEEE
  12. “An adaptable, probabilistic, next-best view algorithm for reconstruction of unknown 3-d objects” In IEEE Robotics and Automation Letters 2.3 IEEE, 2017, pp. 1540–1547
  13. “Neu-nbv: Next best view planning using uncertainty estimation in image-based neural rendering” In 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023, pp. 11305–11312 IEEE
  14. “ActiveNeRF: Learning Where to See with Uncertainty Estimation” In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXIII, 2022, pp. 230–246 Springer
  15. “Stochastic neural radiance fields: Quantifying uncertainty in implicit 3d representations” In 2021 International Conference on 3D Vision (3DV), 2021, pp. 972–981 IEEE
  16. “Probnerf: Uncertainty-aware inference of 3d shapes from 2d images” In International Conference on Artificial Intelligence and Statistics, 2023, pp. 10425–10444 PMLR
  17. Niko Sünderhauf, Jad Abou-Chakra and Dimity Miller “Density-aware nerf ensembles: Quantifying predictive uncertainty in neural radiance fields” In 2023 IEEE International Conference on Robotics and Automation (ICRA), 2023, pp. 9370–9376 IEEE
  18. Nelson Max “Optical models for direct volume rendering” In IEEE Transactions on Visualization and Computer Graphics 1.2 IEEE, 1995, pp. 99–108
  19. “Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks”, 2024 arXiv:2401.14159 [cs.CV]
  20. “Grounding dino: Marrying dino with grounded pre-training for open-set object detection” In arXiv preprint arXiv:2303.05499, 2023
  21. “Segment Anything” In arXiv:2304.02643, 2023
  22. “Anygrasp: Robust and efficient grasp perception in spatial and temporal domains” In IEEE Transactions on Robotics IEEE, 2023
  23. “inerf: Inverting neural radiance fields for pose estimation” In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 1323–1330 IEEE
  24. John A Nelder and Roger Mead “A simplex method for function minimization” In The computer journal 7.4 The British Computer Society, 1965, pp. 308–313
  25. Michael JD Powell “A direct search optimization method that models the objective and constraint functions by linear interpolation” Springer, 1994
  26. Michael JD Powell “Direct search algorithms for optimization calculations” In Acta numerica 7 Cambridge University Press, 1998, pp. 287–336
  27. Michael JD Powell “An efficient method for finding the minimum of a function of several variables without calculating derivatives” In The computer journal 7.2 Oxford University Press, 1964, pp. 155–162
  28. “Yale-CMU-Berkeley dataset for robotic manipulation research” In The International Journal of Robotics Research 36.3 SAGE Publications Sage UK: London, England, 2017, pp. 261–268
  29. “Kaolin wisp: A pytorch library and engine for neural fields research”, 2022
  30. “Tanks and temples: Benchmarking large-scale scene reconstruction” In ACM Transactions on Graphics (ToG) 36.4 ACM New York, NY, USA, 2017, pp. 1–13
  31. William E Lorensen and Harvey E Cline “Marching cubes: A high resolution 3D surface construction algorithm” In Seminal graphics: pioneering efforts that shaped the field, 1998, pp. 347–353
  32. “Instant neural graphics primitives with a multiresolution hash encoding” In ACM transactions on graphics (TOG) 41.4 ACM New York, NY, USA, 2022, pp. 1–15
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com