FlowBot++: Learning Generalized Articulated Objects Manipulation via Articulation Projection (2306.12893v4)
Abstract: Understanding and manipulating articulated objects, such as doors and drawers, is crucial for robots operating in human environments. We wish to develop a system that can learn to articulate novel objects with no prior interaction, after training on other articulated objects. Previous approaches for articulated object manipulation rely on either modular methods which are brittle or end-to-end methods, which lack generalizability. This paper presents FlowBot++, a deep 3D vision-based robotic system that predicts dense per-point motion and dense articulation parameters of articulated objects to assist in downstream manipulation tasks. FlowBot++ introduces a novel per-point representation of the articulated motion and articulation parameters that are combined to produce a more accurate estimate than either method on their own. Simulated experiments on the PartNet-Mobility dataset validate the performance of our system in articulating a wide range of objects, while real-world experiments on real objects' point clouds and a Sawyer robot demonstrate the generalizability and feasibility of our system in real-world scenarios.
- ScrewNet: Category-Independent articulation model estimation from depth images using screw theory. In 2021 IEEE International Conference on Robotics and Automation (ICRA), pages 13670–13677, May 2021.
- Structure from action: Learning interactions for articulated object 3d structure discovery. arXiv preprint arXiv:2207.08997, 2022.
- Task space regions: A framework for pose-constrained manipulation planning. The International Journal of Robotics Research, 30(12):1435–1460, 2011.
- Deep part induction from articulated object pairs. arXiv preprint arXiv:1809.07417, 2018.
- Rpm-net: recurrent prediction of motion and parts from point cloud. arXiv preprint arXiv:2006.14865, 2020.
- Flowbot3d: Learning 3d articulation flow to manipulate articulated objects. Robotics: Science and Systems (RSS), 2022.
- Immobilizing Hinged Polygons. Int. J. Comput. Geom. Appl., 17(01):45–69, Feb. 2007.
- Learning to manipulate articulated objects in unstructured environments using a grounded relational representation. In Robotics: Science and Systems IV. Robotics: Science and Systems Foundation, June 2008.
- Partnet: A large-scale benchmark for fine-grained and hierarchical part-level 3d object understanding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 909–918, 2019.
- Sapien: A simulated part-based interactive environment. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11097–11107, 2020.
- Where2act: From pixels to actions for articulated 3d objects. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 6813–6823, 2021.
- Umpnet: Universal manipulation policy network for articulated objects. IEEE Robotics and Automation Letters, 2022.
- ManiSkill: Learning-from-Demonstrations benchmark for generalizable manipulation skills. arXiv e-prints, pages arXiv–2107, 2021.
- Shape2motion: Joint analysis of motion parts and attributes from 3d shapes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8876–8884, 2019.
- Learning to predict part mobility from a single static snapshot. ACM Trans. Graph., 36(6):1–13, Nov. 2017.
- Category-Level articulated object pose estimation, 2020.
- Visual identification of articulated object parts. In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 2443–2450. IEEE, 2020.
- V. Narayanan and M. Likhachev. Task-oriented planning for manipulating articulated mechanisms under model uncertainty. In 2015 IEEE International Conference on Robotics and Automation (ICRA), pages 3095–3101, May 2015.
- Whole-body motion planning for manipulation of articulated objects. In 2013 IEEE International Conference on Robotics and Automation, pages 1656–1662, May 2013.
- Planning for autonomous door opening with a mobile manipulator. In 2010 IEEE International Conference on Robotics and Automation, pages 1799–1806, May 2010.
- Determining optical flow. Artif. Intell., 17(1):185–203, Aug. 1981.
- FlowNet: Learning optical flow with convolutional networks, 2015.
- Flownet 2.0: Evolution of optical flow estimation with deep networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2462–2470, 2017.
- Z. Teed and J. Deng. RAFT: Recurrent All-Pairs field transforms for optical flow. In Computer Vision – ECCV 2020, pages 402–419. Springer International Publishing, 2020.
- Tactile-rl for insertion: Generalization to objects of unknown geometry. In 2021 IEEE International Conference on Robotics and Automation (ICRA), pages 6437–6443. IEEE, 2021.
- Motion perception in reinforcement learning with dynamic objects. In A. Billard, A. Dragan, J. Peters, and J. Morimoto, editors, Proceedings of The 2nd Conference on Robot Learning, volume 87 of Proceedings of Machine Learning Research, pages 156–168. PMLR, 2018.
- Fabricflownet: Bimanual cloth manipulation with a flow-based policy. In Conference on Robot Learning, pages 192–202. PMLR, 2022.
- Tax-pose: Task-specific cross-pose estimation for robot manipulation. In Conference on Robot Learning, pages 1783–1792. PMLR, 2023.
- Toolflownet: Robotic manipulation with tools via predicting tool flow from point clouds. In Conference on Robot Learning, pages 1038–1049. PMLR, 2023.
- Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 652–660, 2017.
- A reduction of imitation learning and structured prediction to no-regret online learning. In Proceedings of the fourteenth international conference on artificial intelligence and statistics, pages 627–635. JMLR Workshop and Conference Proceedings, 2011.
- Dex-Net AR: Distributed deep grasp planning using a commodity cellphone and augmented reality app. In 2020 IEEE International Conference on Robotics and Automation (ICRA), pages 552–558, May 2020.
- Open3d: A modern library for 3d data processing. arXiv preprint arXiv:1801.09847, 2018.
- PointNet++: Deep hierarchical feature learning on point sets in a metric space. June 2017.
- D. P. Kingma and J. Ba. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- Harry Zhang (37 papers)
- Ben Eisner (13 papers)
- David Held (81 papers)