Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response (2312.11460v3)
Abstract: Robust locomotion control depends on accurate state estimations. However, the sensors of most legged robots can only provide partial and noisy observations, making the estimation particularly challenging, especially for external states like terrain frictions and elevation maps. Inspired by the classical Internal Model Control principle, we consider these external states as disturbances and introduce Hybrid Internal Model (HIM) to estimate them according to the response of the robot. The response, which we refer to as the hybrid internal embedding, contains the robot's explicit velocity and implicit stability representation, corresponding to two primary goals for locomotion tasks: explicitly tracking velocity and implicitly maintaining stability. We use contrastive learning to optimize the embedding to be close to the robot's successor state, in which the response is naturally embedded. HIM has several appealing benefits: It only needs the robot's proprioceptions, i.e., those from joint encoders and IMU as observations. It innovatively maintains consistent observations between simulation reference and reality that avoids information loss in mimicking learning. It exploits batch-level information that is more robust to noises and keeps better sample efficiency. It only requires 1 hour of training on an RTX 4090 to enable a quadruped robot to traverse any terrain under any disturbances. A wealth of real-world experiments demonstrates its agility, even in high-difficulty tasks and cases never occurred during the training process, revealing remarkable open-world generalizability.
- Legged locomotion in challenging terrains using egocentric vision. In Conference on Robot Learning (CoRL), 2023.
- Vision-aided dynamic quadrupedal locomotion on discrete terrain using motion libraries. In International Conference on Robotics and Automation (ICRA), 2022.
- Mit cheetah 3: Design and control of a robust, dynamic quadruped robot. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2018.
- The mit super mini cheetah: A small, low-cost quadrupedal robot for dynamic locomotion. In 2015 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), 2015.
- Barkour: Benchmarking animal-level agility with quadruped robots. arXiv preprint arXiv:2305.14654, 2023.
- Unsupervised learning of visual features by contrasting cluster assignments. Advances in neural information processing systems, 2020.
- Learning by cheating. In Conference on Robot Learning (CoRL), 2019.
- Legs as manipulator: Pushing quadrupedal agility beyond locomotion. In IEEE International Conference on Robotics and Automation (ICRA), 2023.
- Marco Cuturi. Sinkhorn distances: Lightspeed computation of optimal transport. Advances in neural information processing systems, 2013.
- Dreamerpro: Reconstruction-free model-based reinforcement learning with prototypical representations. In International Conference on Machine Learning, 2022.
- Adversarial motion priors make good substitutes for complex reward functions. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022.
- Minimizing energy consumption leads to the emergence of gaits in legged robots. In Conference on Robot Learning (CoRL), 2021.
- Deep whole-body control: Learning a unified policy for manipulation and locomotion. In Conference on Robot Learning (CoRL), 2022.
- Legged robot state-estimation through combined forward kinematic and preintegrated contact factors. In 2018 IEEE International Conference on Robotics and Automation (ICRA), pp. 4422–4429. IEEE, 2018.
- Anymal-a highly mobile and dynamic quadrupedal robot. In IEEE/RSJ international conference on intelligent robots and systems (IROS), 2016.
- Learning agile and dynamic motor skills for legged robots. Science Robotics, 2019.
- Dribblebot: Dynamic legged manipulation in the wild. In IEEE International Conference on Robotics and Automation (ICRA), 2023.
- Auto-encoding variational bayes. In The Second International Conference on Learning Representations, 2014.
- Rma: Rapid motor adaptation for legged robots. In Robotics: Science and Systems, 2021.
- Yann LeCun. A path towards autonomous machine intelligence. Open Review, 2022.
- Learning quadrupedal locomotion over challenging terrain. Science robotics, 2020.
- Isaac gym: High performance gpu-based physics simulation for robot learning. Advances in neural information processing systems, 2021.
- Walk these ways: Tuning robot control for generalization with multiplicity of behavior. In Conference on Robot Learning (CoRL), 2023.
- Rapid locomotion via reinforcement learning. In Robotics: Science and Systems, 2022.
- Learning robust perceptive locomotion for quadrupedal robots in the wild. Science Robotics, 2022a.
- Elevation mapping for locomotion and navigation using gpu. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022b.
- Dreamwaq: Learning robust quadrupedal locomotion with implicit terrain imagination via deep reinforcement learning. In IEEE International Conference on Robotics and Automation (ICRA), 2023.
- Learning to fly—a gym environment with pybullet physics for reinforcement learning of multi-agent quadcopter control. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021.
- Learning agile robotic locomotion skills by imitating animals. In Robotics: Science and Systems, 2020.
- Internal model control: Pid controller design. Industrial & engineering chemistry process design and development, 1986.
- Learning to walk in minutes using massively parallel deep reinforcement learning. In Conference on Robot Learning (CoRL), 2022.
- A compliant hybrid zero dynamics controller for stable, efficient and fast bipedal walking on mabel. The International Journal of Robotics Research, 2011.
- Sim-to-real: Learning agile locomotion for quadruped robots. In Robotics: Science and Systems, 2018.
- Mujoco: A physics engine for model-based control. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2012.
- Laurens Van der Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of machine learning research, 2008.
- Learning robust and agile legged locomotion using adversarial motion priors. IEEE Robotics and Automation Letters, 2022.
- Daydreamer: World models for physical robot learning. In Conference on Robot Learning (CoRL), 2023.
- Physics-based modeling and simulation of human walking: a review of optimization-based and other approaches. Structural and multidisciplinary optimization, 2010.
- Dynamics randomization revisited: A case study for quadrupedal locomotion. In IEEE International Conference on Robotics and Automation (ICRA), 2021.
- Reinforcement learning with prototypical representations. In International Conference on Machine Learning, 2021.
- Simbicon: Simple biped locomotion control. ACM Transactions on Graphics (TOG), 2007.
- Visual-locomotion: Learning to walk on complex terrains with vision. In Conference on Robot Learning (CoRL), 2021.
- Robot parkour learning. In Conference on Robot Learning (CoRL), 2023.
Collections
Sign up for free to add this paper to one or more collections.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.