CAJun: Continuous Adaptive Jumping using a Learned Centroidal Controller (2306.09557v2)
Abstract: We present CAJun, a novel hierarchical learning and control framework that enables legged robots to jump continuously with adaptive jumping distances. CAJun consists of a high-level centroidal policy and a low-level leg controller. In particular, we use reinforcement learning (RL) to train the centroidal policy, which specifies the gait timing, base velocity, and swing foot position for the leg controller. The leg controller optimizes motor commands for the swing and stance legs according to the gait timing to track the swing foot target and base velocity commands using optimal control. Additionally, we reformulate the stance leg optimizer in the leg controller to speed up policy training by an order of magnitude. Our system combines the versatility of learning with the robustness of optimal control. By combining RL with optimal control methods, our system achieves the versatility of learning while enjoys the robustness from control methods, making it easily transferable to real robots. We show that after 20 minutes of training on a single GPU, CAJun can achieve continuous, long jumps with adaptive distances on a Go1 robot with small sim-to-real gaps. Moreover, the robot can jump across gaps with a maximum width of 70cm, which is over 40% wider than existing methods.
- Dynamic trotting on slopes for quadrupedal robots. In 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 5129–5135. IEEE, 2015.
- Traversing steep and granular martian analog slopes with a dynamic quadrupedal robot. arXiv preprint arXiv:2106.01974, 2021.
- Barkour: Benchmarking animal-level agility with quadruped robots. arXiv preprint arXiv:2305.14654, 2023.
- Learning quadrupedal locomotion over challenging terrain. Science robotics, 5(47):eabc5986, 2020.
- Legged locomotion in challenging terrains using egocentric vision. In Conference on Robot Learning, pages 403–415. PMLR, 2023.
- Autonomous navigation for quadrupedal robots with optimized jumping through constrained obstacles. In 2021 IEEE 17th International Conference on Automation Science and Engineering (CASE), pages 2132–2139. IEEE, 2021.
- Dynamic locomotion and whole-body control for quadrupedal robots. In 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 3359–3365. IEEE, 2017.
- Dynamic locomotion in the mit cheetah 3 through convex model-predictive control. In 2018 IEEE/RSJ international conference on intelligent robots and systems (IROS), pages 1–9. IEEE, 2018.
- Highly dynamic quadruped locomotion via whole-body impulse control and model predictive control. arXiv preprint arXiv:1909.06586, 2019.
- Real-time model predictive control for versatile dynamic motions in quadrupedal robots. In 2019 International Conference on Robotics and Automation (ICRA), pages 8484–8490. IEEE, 2019.
- Practice makes perfect: An optimization-based approach to controlling agile motions for a quadruped robot. IEEE Robotics & Automation Magazine, 23(1):34–43, 2016.
- Optimized jumping on the mit cheetah 3 robot. In 2019 International Conference on Robotics and Automation (ICRA), pages 7448–7454. IEEE, 2019.
- An optimal motion planning framework for quadruped jumping. In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 11366–11373. IEEE, 2022.
- Gait and trajectory optimization for legged systems through phase-based end-effector parameterization. IEEE Robotics and Automation Letters, 3(3):1560–1567, 2018.
- Sim-to-real: Learning agile locomotion for quadruped robots. arXiv preprint arXiv:1804.10332, 2018.
- Learning to walk in minutes using massively parallel deep reinforcement learning. In Conference on Robot Learning, pages 91–100. PMLR, 2022.
- G. B. Margolis and P. Agrawal. Walk these ways: Tuning robot control for generalization with multiplicity of behavior. In Conference on Robot Learning, pages 22–31. PMLR, 2023.
- Learning a single policy for diverse behaviors on a quadrupedal robot using scalable motion imitation. arXiv preprint arXiv:2303.15331, 2023.
- Learning and adapting agile locomotion skills by transferring experience. arXiv preprint arXiv:2304.09834, 2023.
- Rma: Rapid motor adaptation for legged robots. arXiv preprint arXiv:2107.04034, 2021.
- Glide: Generalizable quadrupedal locomotion in diverse environments with a centroidal model. In Algorithmic Foundations of Robotics XV: Proceedings of the Fifteenth Workshop on the Algorithmic Foundations of Robotics, pages 523–539. Springer, 2022.
- Fast and efficient locomotion via learned gait transitions. In Conference on Robot Learning, pages 773–783. PMLR, 2022.
- Visual-locomotion: Learning to walk on complex terrains with vision. In 5th Annual Conference on Robot Learning, 2021.
- Unitree. Go1 Website. URL https://www.unitree.com/products/go1/.
- Continuous jumping for legged robots on stepping stones via trajectory optimization and model predictive control. In 2022 IEEE 61st Conference on Decision and Control (CDC), pages 93–99. IEEE, 2022.
- Jumping over obstacles with mit cheetah 2. Robotics and Autonomous Systems, 136:103703, 2021.
- Online planning for autonomous running jumps over obstacles in high-speed quadrupeds. 2015.
- Rapid locomotion via reinforcement learning. arXiv preprint arXiv:2205.02824, 2022.
- Robust and versatile bipedal jumping control through multi-task reinforcement learning. arXiv preprint arXiv:2302.09450, 2023.
- Learning to jump from pixels. In Conference on Robot Learning, pages 1025–1034. PMLR, 2022.
- Learning agile and dynamic motor skills for legged robots. Science Robotics, 4(26):eaau5872, 2019.
- Learning a contact-adaptive controller for robust, efficient legged locomotion. In Conference on Robot Learning, pages 883–894. PMLR, 2021.
- Continuous versatile jumping using learned action residuals. arXiv preprint arXiv:2304.08663, 2023.
- G. Bellegarda and Q. Nguyen. Robust quadruped jumping via deep reinforcement learning. arXiv preprint arXiv:2011.07089, 2020.
- Rloc: Terrain-aware legged locomotion using reinforcement learning and optimal control. arXiv preprint arXiv:2012.03094, 2020.
- Robust rough-terrain locomotion with a quadrupedal robot. In 2018 IEEE International Conference on Robotics and Automation (ICRA), pages 5761–5768. IEEE, 2018.
- Mpc-based controller with terrain insight for dynamic legged locomotion. In 2020 IEEE International Conference on Robotics and Automation (ICRA), pages 2436–2442. IEEE, 2020.
- Perceptive locomotion in rough terrain–online foothold optimization. IEEE Robotics and Automation Letters, 5(4):5370–5376, 2020.
- Full-body compliant human–humanoid interaction: balancing in the presence of unknown external forces. IEEE transactions on robotics, 23(5):884–898, 2007.
- M. Chignoli and P. M. Wensing. Variational-based optimal control of underactuated balancing for dynamic quadrupeds. IEEE Access, 8:49785–49797, 2020.
- Z. Zhou and Y. Zhao. Accelerated admm based trajectory optimization for legged locomotion with coupled rigid body dynamics. In 2020 American Control Conference (ACC), pages 5082–5089. IEEE, 2020.
- Policies modulating trajectory generators. In Conference on Robot Learning, pages 916–926. PMLR, 2018.
- M. H. Raibert. Legged robots that balance. MIT press, 1986.
- Fast and accurate deep network learning by exponential linear units (elus). arXiv preprint arXiv:1511.07289, 2015.
- Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347, 2017.
- Isaac gym: High performance gpu-based physics simulation for robot learning, 2021.
- Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 32, 2019.
- Minimizing energy consumption leads to the emergence of gaits in legged robots. arXiv preprint arXiv:2111.01674, 2021.
- B. Amos and J. Z. Kolter. Optnet: Differentiable optimization as a layer in neural networks. In International Conference on Machine Learning, pages 136–145. PMLR, 2017.