Uncertainty-aware Active Learning of NeRF-based Object Models for Robot Manipulators using Visual and Re-orientation Actions (2404.01812v1)
Abstract: Manipulating unseen objects is challenging without a 3D representation, as objects generally have occluded surfaces. This requires physical interaction with objects to build their internal representations. This paper presents an approach that enables a robot to rapidly learn the complete 3D model of a given object for manipulation in unfamiliar orientations. We use an ensemble of partially constructed NeRF models to quantify model uncertainty to determine the next action (a visual or re-orientation action) by optimizing informativeness and feasibility. Further, our approach determines when and how to grasp and re-orient an object given its partial NeRF model and re-estimates the object pose to rectify misalignments introduced during the interaction. Experiments with a simulated Franka Emika Robot Manipulator operating in a tabletop environment with benchmark objects demonstrate an improvement of (i) 14% in visual reconstruction quality (PSNR), (ii) 20% in the geometric/depth reconstruction of the object surface (F-score) and (iii) 71% in the task success rate of manipulating objects a-priori unseen orientations/stable configurations in the scene; over current methods. The project page can be found here: https://actnerf.github.io.
- “NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis” In ECCV, 2020
- “pixelNeRF: Neural Radiance Fields from One or Few Images” In CVPR, 2021
- M.M. Johari, Y. Lepoittevin and F. Fleuret “GeoNeRF: Generalizing NeRF with Geometry Priors” In IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2022
- “Uncertainty guided policy for active robotic 3d reconstruction using neural radiance fields” In IEEE Robotics and Automation Letters 7.4 IEEE, 2022, pp. 12070–12077
- “Active View Planning for Radiance Fields” In Robotics Science and Systems, 2022
- “Dex-NeRF: Using a Neural Radiance Field to Grasp Transparent Objects” In ArXiv abs/2110.14217, 2021 URL: https://api.semanticscholar.org/CorpusID:239998474
- “Evo-nerf: Evolving nerf for sequential robot grasping of transparent objects” In 6th Annual Conference on Robot Learning, 2022
- “Vision-Only Robot Navigation in a Neural Radiance World” website: https://mikh3x4.github.io/nerf-navigation/ In IEEE Robotics and Automation Letters (RA-L) 7.2, 2022, pp. 4606–4613
- “Learning Multi-Object Dynamics with Compositional Neural Radiance Fields” In Conf. on Robot Learning 205, Proceedings of Machine Learning Research, 2023, pp. 1755–1768
- Michael Krainin, Brian Curless and Dieter Fox “Autonomous generation of complete 3D object models using next best view manipulation planning” In 2011 IEEE international conference on robotics and automation, 2011, pp. 5031–5037 IEEE
- “An information gain formulation for active volumetric 3D reconstruction” In 2016 IEEE International Conference on Robotics and Automation (ICRA), 2016, pp. 3477–3484 IEEE
- “An adaptable, probabilistic, next-best view algorithm for reconstruction of unknown 3-d objects” In IEEE Robotics and Automation Letters 2.3 IEEE, 2017, pp. 1540–1547
- “Neu-nbv: Next best view planning using uncertainty estimation in image-based neural rendering” In 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023, pp. 11305–11312 IEEE
- “ActiveNeRF: Learning Where to See with Uncertainty Estimation” In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXIII, 2022, pp. 230–246 Springer
- “Stochastic neural radiance fields: Quantifying uncertainty in implicit 3d representations” In 2021 International Conference on 3D Vision (3DV), 2021, pp. 972–981 IEEE
- “Probnerf: Uncertainty-aware inference of 3d shapes from 2d images” In International Conference on Artificial Intelligence and Statistics, 2023, pp. 10425–10444 PMLR
- Niko Sünderhauf, Jad Abou-Chakra and Dimity Miller “Density-aware nerf ensembles: Quantifying predictive uncertainty in neural radiance fields” In 2023 IEEE International Conference on Robotics and Automation (ICRA), 2023, pp. 9370–9376 IEEE
- Nelson Max “Optical models for direct volume rendering” In IEEE Transactions on Visualization and Computer Graphics 1.2 IEEE, 1995, pp. 99–108
- “Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks”, 2024 arXiv:2401.14159 [cs.CV]
- “Grounding dino: Marrying dino with grounded pre-training for open-set object detection” In arXiv preprint arXiv:2303.05499, 2023
- “Segment Anything” In arXiv:2304.02643, 2023
- “Anygrasp: Robust and efficient grasp perception in spatial and temporal domains” In IEEE Transactions on Robotics IEEE, 2023
- “inerf: Inverting neural radiance fields for pose estimation” In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 1323–1330 IEEE
- John A Nelder and Roger Mead “A simplex method for function minimization” In The computer journal 7.4 The British Computer Society, 1965, pp. 308–313
- Michael JD Powell “A direct search optimization method that models the objective and constraint functions by linear interpolation” Springer, 1994
- Michael JD Powell “Direct search algorithms for optimization calculations” In Acta numerica 7 Cambridge University Press, 1998, pp. 287–336
- Michael JD Powell “An efficient method for finding the minimum of a function of several variables without calculating derivatives” In The computer journal 7.2 Oxford University Press, 1964, pp. 155–162
- “Yale-CMU-Berkeley dataset for robotic manipulation research” In The International Journal of Robotics Research 36.3 SAGE Publications Sage UK: London, England, 2017, pp. 261–268
- “Kaolin wisp: A pytorch library and engine for neural fields research”, 2022
- “Tanks and temples: Benchmarking large-scale scene reconstruction” In ACM Transactions on Graphics (ToG) 36.4 ACM New York, NY, USA, 2017, pp. 1–13
- William E Lorensen and Harvey E Cline “Marching cubes: A high resolution 3D surface construction algorithm” In Seminal graphics: pioneering efforts that shaped the field, 1998, pp. 347–353
- “Instant neural graphics primitives with a multiresolution hash encoding” In ACM transactions on graphics (TOG) 41.4 ACM New York, NY, USA, 2022, pp. 1–15