HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation (2403.10506v2)
Abstract: Humanoid robots hold great promise in assisting humans in diverse environments and tasks, due to their flexibility and adaptability leveraging human-like morphology. However, research in humanoid robots is often bottlenecked by the costly and fragile hardware setups. To accelerate algorithmic research in humanoid robots, we present a high-dimensional, simulated robot learning benchmark, HumanoidBench, featuring a humanoid robot equipped with dexterous hands and a variety of challenging whole-body manipulation and locomotion tasks. Our findings reveal that state-of-the-art reinforcement learning algorithms struggle with most tasks, whereas a hierarchical learning approach achieves superior performance when supported by robust low-level policies, such as walking or reaching. With HumanoidBench, we provide the robotics community with a platform to identify the challenges arising when solving diverse tasks with humanoid robots, facilitating prompt verification of algorithms and ideas. The open-source code is available at https://humanoid-bench.github.io.
- Exploring kinodynamic fabrics for reactive whole-body control of underactuated humanoid robots. In IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 10397–10404. IEEE, 2023.
- Solving rubik’s cube with a robot hand. arXiv preprint arXiv:1910.07113, 2019.
- Locomujoco: A comprehensive imitation learning benchmark for locomotion. 6th Robot Learning Workshop at NeurIPS, 2023.
- The arcade learning environment: An evaluation platform for general agents. Journal of Artificial Intelligence Research, 47:253–279, jun 2013.
- SAR: Generalization of Physiological Dexterity via Synergistic Action Representation. In Robotics: Science and Systems, 2023.
- Openai gym. arXiv preprint arXiv:1606.01540, 2016.
- Myosuite: A contact-rich simulation suite for musculoskeletal motor control. In Learning for Dynamics and Control, pages 492–507. PMLR, 2022.
- Myodex: a generalizable prior for dexterous manipulation. In International Conference on Machine Learning, pages 3327–3346. PMLR, 2023.
- Bi-dexhands: Towards human-level bimanual dexterous manipulation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023.
- Diffusion policy: Visuomotor policy learning via action diffusion. In Robotics: Science and Systems, 2023.
- One-shot imitation learning. In Advances in Neural Information Processing Systems, pages 1087–1098, 2017.
- Mobile aloha: Learning bimanual mobile manipulation with low-cost whole-body teleoperation. arXiv preprint arXiv:2401.02117, 2024.
- Dibya Ghosh. dibyaghosh/jaxrl_m, 2023. URL https://github.com/dibyaghosh/jaxrl_m.
- Relay policy learning: Solving long-horizon tasks via imitation and reinforcement learning. Conference on Robot Learning, 2019.
- Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In International Conference on Machine Learning, pages 1856–1865, 2018.
- Learning agile soccer skills for a bipedal robot with deep reinforcement learning. arXiv preprint arXiv:2304.13653, 2023.
- Mastering diverse domains through world models. arXiv preprint arXiv:2301.04104, 2023.
- Td-mpc2: Scalable, robust world models for continuous control. In International Conference on Learning Representations, 2024.
- Furniturebench: Reproducible real-world benchmark for long-horizon complex manipulation. In Robotics: Science and Systems, 2023.
- Learning agile and dynamic motor skills for legged robots. Science Robotics, 4(26):eaau5872, 2019.
- Rlbench: The robot learning benchmark & learning environment. IEEE Robotics and Automation Letters, 2020.
- Robodesk: A multi-task reinforcement learning benchmark. https://github.com/google-research/robodesk, 2021.
- Rma: Rapid motor adaptation for legged robots. In Robotics: Science and Systems, 2021.
- Scalable muscle-actuated human simulation and control. ACM Transactions on Graphics, 38(4):1–13, 2019a.
- Composing complex skills by learning transition policies. In International Conference on Learning Representations, 2019b. URL https://openreview.net/forum?id=rygrBhC5tQ.
- Learning to coordinate manipulation skills via skill behavior diversification. In International Conference on Learning Representations, 2020.
- IKEA furniture assembly environment for long-horizon complex manipulation tasks. In IEEE International Conference on Robotics and Automation, 2021. URL https://clvrai.com/furniture.
- Behavior-1k: A benchmark for embodied ai with 1,000 everyday activities and realistic simulation. In Conference on Robot Learning, 2022.
- Softgym: Benchmarking deep reinforcement learning for deformable object manipulation. In Conference on Robot Learning, 2020.
- Discovered policy optimisation. Advances in Neural Information Processing Systems, 35:16455–16468, 2022.
- Isaac gym: High performance gpu based physics simulation for robot learning. In Neural Information Processing Systems Datasets and Benchmarks Track, 2021.
- What matters in learning from offline human demonstrations for robot manipulation. In Conference on Robot Learning, 2021.
- Mimo: A multi-modal infant model for studying cognitive development. IEEE Transactions on Cognitive and Developmental Systems, 2024.
- Calvin: A benchmark for language-conditioned policy learning for long-horizon robot manipulation tasks. IEEE Robotics and Automation Letters, 2022.
- Catch & carry: reusable neural controllers for vision-guided whole-body tasks. ACM Transactions on Graphics, 39(4):39–1, 2020.
- Humanoid multimodal tactile-sensing modules. IEEE Transactions on robotics, 27(3):401–410, 2011.
- Factory: Fast contact for robotic assembly. In Robotics: Science and Systems, 2022.
- Learning dexterous in-hand manipulation. The International Journal of Robotics Research, 39(1):3–20, 2020.
- Deepmimic: Example-guided deep reinforcement learning of physics-based character skills. ACM Transactions on Graphics, 37(4):1–14, 2018a.
- Sfv: Reinforcement learning of physical skills from videos. ACM Transactions on Graphics, 37(6):1–14, 2018b.
- Amp: Adversarial motion priors for stylized physics-based character control. ACM Transactions on Graphics, 40(4):1–20, 2021.
- Multi-goal reinforcement learning: Challenging robotics environments and request for research. arXiv preprint arXiv:1802.09464, 2018.
- Learning humanoid locomotion with transformers. arXiv preprint arXiv:2303.03381, 2023.
- Stable baselines3, 2019.
- Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347, 2017.
- Sim-to-real for high-resolution optical tactile sensing: From images to three-dimensional contact force distributions. Soft Robotics, 9(5):926–937, 2022.
- The power of the senses: Generalizable manipulation from vision and touch through masked multimodal learning. arXiv preprint arXiv:2311.00924, 2023.
- Behavior: Benchmark for everyday household activities in virtual, interactive, and ecological environments. In Conference on Robot Learning, 2021.
- Habitat 2.0: Training home assistants to rearrange their habitat. In Neural Information Processing Systems, 2021.
- Deepmind control suite. arXiv preprint arXiv:1801.00690, 2018.
- dm_control: Software and tasks for continuous control. arXiv preprint arXiv:2006.12983, 2020.
- Mujoco: A physics engine for model-based control. In IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 5026–5033, 2012.
- Physhoi: Physics-based imitation of dynamic human-object interaction. arXiv preprint arXiv:2312.04393, 2023.
- Approximate convex decomposition for 3d meshes with collision-aware concavity and tree search. ACM Transactions on Graphics (TOG), 41(4):1–18, 2022.
- Hierarchical planning and control for box loco-manipulation. Symposium on Computer Animation, 2023.
- Meta-world: A benchmark and evaluation for multi-task and meta reinforcement learning. In Conference on Robot Learning, 2019.
- Robot synesthesia: In-hand manipulation with visuotactile sensing. arXiv preprint arXiv:2312.01853, 2023.
- MuJoCo Menagerie: A collection of high-quality simulation models for MuJoCo, 2022. URL http://github.com/google-deepmind/mujoco_menagerie.
- Robopianist: Dexterous piano playing with deep reinforcement learning. In Conference on Robot Learning, pages 2975–2994. PMLR, 2023.
- Learning physically simulated tennis skills from broadcast videos. ACM Transactions on Graphics, 42(4):1–14, 2023.
- Learning fine-grained bimanual manipulation with low-cost hardware. In Robotics: Science and Systems, 2023.
- robosuite: A modular simulation framework and benchmark for robot learning. arXiv preprint arXiv:2009.12293, 2020.
- Robot parkour learning. In Conference on Robot Learning, 2023.