Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
194 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning Generalizable Tool-use Skills through Trajectory Generation (2310.00156v5)

Published 29 Sep 2023 in cs.RO and cs.AI

Abstract: Autonomous systems that efficiently utilize tools can assist humans in completing many common tasks such as cooking and cleaning. However, current systems fall short of matching human-level of intelligence in terms of adapting to novel tools. Prior works based on affordance often make strong assumptions about the environments and cannot scale to more complex, contact-rich tasks. In this work, we tackle this challenge and explore how agents can learn to use previously unseen tools to manipulate deformable objects. We propose to learn a generative model of the tool-use trajectories as a sequence of tool point clouds, which generalizes to different tool shapes. Given any novel tool, we first generate a tool-use trajectory and then optimize the sequence of tool poses to align with the generated trajectory. We train a single model on four different challenging deformable object manipulation tasks, using demonstration data from only one tool per task. The model generalizes to various novel tools, significantly outperforming baselines. We further test our trained policy in the real world with unseen tools, where it achieves the performance comparable to human. Additional materials can be found on our project website: https://sites.google.com/view/toolgen.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (22)
  1. Improvisation through physical understanding: Using novel objects as tools with visual foresight. Robotics: Science and Systems (RSS), 2019.
  2. Learning task-oriented grasping for tool manipulation from simulated self-supervision. The International Journal of Robotics Research (IJRR), 2020.
  3. Keto: Learning keypoint representations for tool manipulation. In IEEE International Conference on Robotics and Automation (ICRA), 2020.
  4. Gift: Generalizable interaction-aware functional tool affordances without labels. In Robotics: Science and Systems (RSS), 2021.
  5. Diffskill: Skill abstraction from differentiable physics for deformable object manipulations with tools. International Conference on Learning Representations (ICLR), 2022.
  6. Learning closed-loop dough manipulation using a differentiable reset module. IEEE Robotics and Automation Letters, 7(4):9857–9864, 2022.
  7. Planning with spatial-temporal abstraction from point clouds for deformable object manipulation. In Conference on Robot Learning (CoRL), 2022.
  8. Understanding tools: Task-oriented object modeling, learning and recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
  9. Understanding physical effects for effective tool-use. IEEE Robotics and Automation Letters (R-AL), 2022.
  10. Virdo++: Real-world, visuo-tactile dynamics and perception of deformable objects. 2022.
  11. Disect: A differentiable simulation engine for autonomous robotic cutting. arXiv preprint arXiv:2105.12244, 2021.
  12. Roboninja: Learning an adaptive cutting policy for multi-material objects. Robotics: Science and Systems (RSS), 2023.
  13. Plasticinelab: A soft-body manipulation benchmark with differentiable physics. In International Conference on Learning Representations, 2021.
  14. Pointnet++: Deep hierarchical feature learning on point sets in a metric space. Advances in neural information processing systems, 30, 2017.
  15. A smooth representation of belief over so (3) for deep rotation learning with uncertainty. arXiv preprint arXiv:2006.01031, 2020.
  16. On the continuity of rotation representations in neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5745–5753, 2019.
  17. Projective manifold gradient layer for deep rotation regression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6646–6655, 2022.
  18. Pointflow: 3d point cloud generation with continuous normalizing flows. In Proceedings of the IEEE/CVF international conference on computer vision, pages 4541–4550, 2019.
  19. Toolflownet: Robotic manipulation with tools via predicting tool flow from point clouds. In Conference on Robot Learning (CoRL), 2022.
  20. Sinkhorn divergences for unbalanced optimal transport. arXiv preprint arXiv:1910.12958, 2019.
  21. O. Sorkine-Hornung and M. Rabinovich. Least-squares rigid motion using svd. Computing, 1(1):1–5, 2017.
  22. An analysis of svd for deep rotation estimation. Advances in Neural Information Processing Systems, 33:22554–22565, 2020.
Citations (3)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com