PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction (2312.04393v1)

Published 7 Dec 2023 in cs.CV, cs.GR, and cs.RO

Abstract: Humans interact with objects all the time. Enabling a humanoid to learn human-object interaction (HOI) is a key step for future smart animation and intelligent robotics systems. However, recent progress in physics-based HOI requires carefully designed task-specific rewards, making the system unscalable and labor-intensive. This work focuses on dynamic HOI imitation: teaching humanoid dynamic interaction skills through imitating kinematic HOI demonstrations. It is quite challenging because of the complexity of the interaction between body parts and objects and the lack of dynamic HOI data. To handle the above issues, we present PhysHOI, the first physics-based whole-body HOI imitation approach without task-specific reward designs. Except for the kinematic HOI representations of humans and objects, we introduce the contact graph to model the contact relations between body parts and objects explicitly. A contact graph reward is also designed, which proved to be critical for precise HOI imitation. Based on the key designs, PhysHOI can imitate diverse HOI tasks simply yet effectively without prior knowledge. To make up for the lack of dynamic HOI scenarios in this area, we introduce the BallPlay dataset that contains eight whole-body basketball skills. We validate PhysHOI on diverse HOI tasks, including whole-body grasping and basketball skills.

Authors (6)

Yinhuai Wang (11 papers)
Jing Lin (52 papers)
Ailing Zeng (58 papers)
Zhengyi Luo (28 papers)
Jian Zhang (543 papers)
Lei Zhang (1689 papers)

Citations (19)

View on Semantic Scholar

Summary

Exploration of Dynamic Human-Object Interaction through PhysHOI

Introduction to PhysHOI

Recent efforts in enabling humanoid robots to learn human-object interaction (HOI) skills directly from kinematic demonstrations without the need for task-specific reward designs mark a significant step forward in the field. The paper introduces PhysHOI, a novel framework that focuses on imitating dynamic interactions between humans and objects in various scenarios. Unlike previous works that rely on hand-crafted rewards for each task, making systems unscalable and labor-intensive, PhysHOI employs a task-agnostic approach. The introduction of a contact graph (CG) plays a pivotal role in this process, enhancing the representation of interaction by modeling contact relationships between body parts and objects.

Related Work

The paper positions itself within the spectrum of physics-based humanoids and human-object interaction learning, comparing its approach against existing methods that primarily target isolated motion imitation or are heavily dependent on task-specific rewards. Through a meticulous comparison, it becomes evident that PhysHOI establishes itself as the first whole-body HOI imitation approach that operates effectively across diverse tasks without prior knowledge, setting a new paradigm in the field of dynamic HOI imitation.

Methodology

At the heart of PhysHOI is the introduction of the generalized contact graph, which accurately models the nuanced relationships between humanoid body segments and objects. This CG, alongside the contact-aware HOI representation, forms the basis of the task-agnostic HOI imitation reward, pivotal in guiding the humanoid to manipulate objects accurately. Additionally, the formulation of the BallPlay dataset addresses the scarcity of dynamic HOI scenarios, providing a variety of basketball skills for effective training and validation of the proposed method.

Experiments and Results

PhysHOI's capabilities were rigorously assessed through a series of whole-body grasping cases and complex basketball skills. The quantitative and qualitative comparisons demonstrate PhysHOI's superior performance over existing state-of-the-art methods, showcasing significantly improved success rates and reduced object trajectory errors. These results highlight PhysHOI's robustness and versatility in imitating complex interactions, substantiating its efficacy as a scalable solution for learning general HOI skills.

Implications and Future Directions

The implications of PhysHOI extend beyond the immediate advancements in humanoid robotics. Theoretically, it offers a new lens through which dynamic interactions can be understood and synthesized, fostering further research into more nuanced interactions, such as those involving multiple objects or more complex environments. Practically, PhysHOI's approach could catalyze the development of more autonomous robots capable of performing a broader array of tasks with minimal human oversight. Looking ahead, the potential to expand PhysHOI to accommodate interactions involving multiple humanoids or to extend its application to real-world robotics presents fertile grounds for future investigations.

Conclusion

In summary, PhysHOI represents a significant milestone in the pursuit of scalable and effective methods for teaching humanoid robots to imitate human-object interactions. By eschewing the need for task-specific rewards and introducing a general-purpose contact graph, the method achieves remarkable success in imitating a wide range of dynamic interactions. The BallPlay dataset further enriches the field, providing a valuable resource for ongoing and future research. As humanoid robotics continues to evolve, the foundational work laid by PhysHOI and its innovative approach to imitation learning will undoubtedly inspire continued advancements and applications in the field.

PDF Markdown

Related Papers

GitHub

PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction

Tweets

https://twitter.com/1172752147265441792/status/1740280877152927750