Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
173 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

OGMP: Oracle Guided Multi-mode Policies for Agile and Versatile Robot Control (2403.04205v3)

Published 7 Mar 2024 in cs.RO

Abstract: The efficacy of reinforcement learning for robot control relies on the tailored integration of task-specific priors and heuristics for effective exploration, which challenges their straightforward application to complex tasks and necessitates a unified approach. In this work, we define a general class for priors called oracles that generate state references when queried in a closed-loop manner during training. By bounding the permissible state around the oracle's ansatz, we propose a task-agnostic oracle-guided policy optimization. To enhance modularity, we introduce task-vital modes, showing that a policy mastering a compact set of modes and transitions can handle infinite-horizon tasks. For instance, to perform parkour on an infinitely long track, the policy must learn to jump, leap, pace, and transition between these modes effectively. We validate this approach in challenging bipedal control tasks: parkour and diving using a 16 DoF dynamic bipedal robot, HECTOR. Our method results in a single policy per task, solving parkour across diverse tracks and omnidirectional diving from varied heights up to 2m in simulation, showcasing versatile agility. We demonstrate successful sim-to-real transfer of parkour, including leaping over gaps up to 105 % of the leg length, jumping over blocks up to 20 % of the robot's nominal height, and pacing at speeds of up to 0.6 m/s, along with effective transitions between these modes in the real robot.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (20)
  1. R. McN. Alexander. The gaits of bipedal and quadrupedal animals. IJRR, 1984.
  2. Imitate and repurpose: Learning reusable robot movement skills from human and animal behaviors. arXiv preprint arXiv:2203.1713, 2022.
  3. Extreme parkour with legged robots. In RoboLetics: Workshop @CoRL 2023, 2023.
  4. Diffusion policy: Visuomotor policy learning via action diffusion, 2023.
  5. The mit humanoid robot: Design, motion planning, and control for acrobatic behaviors. In 2020 IEEE-RAS 20th International Conference on Humanoid Robots (Humanoids), 2021.
  6. Opt-mimic: Imitation of optimized trajectories for dynamic quadruped behaviors. In ICRA, 2023.
  7. Repetitive extreme-acceleration (14-g) spatial jumping with salto-1p. In IROS, 2017.
  8. CoMic: Complementary task learning &; mimicry for reusable skills. In ICML, Proceedings of Machine Learning Research, 2020.
  9. Dtc: Deep tracking control. Science Robotics, 2024.
  10. Rl + model-based control: Using on-demand optimal control to learn versatile legged locomotion. IEEE RAL, 2023.
  11. Learning multimodal bipedal locomotion and implicit transitions: A versatile policy approach. In IEEE IROS, 2023.
  12. Force-and-moment-based model predictive control for achieving highly dynamic locomotion on bipedal robots. In IEEE CDC, 2021.
  13. Rapid Locomotion via Reinforcement Learning. In Proceedings of RSS, 2022.
  14. Learning robust perceptive locomotion for quadrupedal robots in the wild. Science Robotics, 2022.
  15. Centroidal trajectory generation and stabilization based on preview control for humanoid multi-contact motion. IEEE RAL, 2022.
  16. Efficient multicontact pattern generation with sequential convex approximations of the centroidal dynamics. IEEE T-RO, 2021.
  17. Advanced skills by learning locomotion and local navigation end-to-end. In IEEE IROS, 2022.
  18. Dynamics-aware unsupervised discovery of skills. In ICLR, 2020.
  19. Blind Bipedal Stair Traversal via Sim-to-Real Reinforcement Learning. In RSS, 2021.
  20. Robot parkour learning. In CoRL, 2023.
Citations (2)

Summary

We haven't generated a summary for this paper yet.

Youtube Logo Streamline Icon: https://streamlinehq.com