Realizing Learned Quadruped Locomotion Behaviors through Kinematic Motion Primitives

Published 9 Oct 2018 in cs.RO and cs.LG | (1810.03842v2)

Abstract: Humans and animals are believed to use a very minimal set of trajectories to perform a wide variety of tasks including walking. Our main objective in this paper is two fold 1) Obtain an effective tool to realize these basic motion patterns for quadrupedal walking, called the kinematic motion primitives (kMPs), via trajectories learned from deep reinforcement learning (D-RL) and 2) Realize a set of behaviors, namely trot, walk, gallop and bound from these kinematic motion primitives in our custom four legged robot, called the `Stoch'. D-RL is a data driven approach, which has been shown to be very effective for realizing all kinds of robust locomotion behaviors, both in simulation and in experiment. On the other hand, kMPs are known to capture the underlying structure of walking and yield a set of derived behaviors. We first generate walking gaits from D-RL, which uses policy gradient based approaches. We then analyze the resulting walking by using principal component analysis. We observe that the kMPs extracted from PCA followed a similar pattern irrespective of the type of gaits generated. Leveraging on this underlying structure, we then realize walking in Stoch by a straightforward reconstruction of joint trajectories from kMPs. This type of methodology improves the transferability of these gaits to real hardware, lowers the computational overhead on-board, and also avoids multiple training iterations by generating a set of derived behaviors from a single learned gait.

Abstract PDF Upgrade to Chat

Citations (20)

View on Semantic Scholar

Summary

The paper introduces a hybrid framework combining deep reinforcement learning with PCA-derived kinematic motion primitives to generate diverse quadruped gaits.
It employs the PyBullet simulator and proximal policy optimization to produce walking trajectories that are reduced to four to five essential motion primitives via PCA.
Empirical analysis shows over 90% cross-covariance among kMPs, highlighting the method's robustness and potential for bio-inspired robotic locomotion.

Overview of Kinematic Motion Primitives in Learned Quadruped Locomotion

The paper introduces a compelling methodology for implementing learned locomotion behaviors in quadrupedal robots using kinematic motion primitives (kMPs). The authors capitalize on deep reinforcement learning (D-RL) to generate locomotion trajectories and employ principal component analysis (PCA) to extract kMPs, facilitating the realization of walking patterns such as trot, walk, gallop, and bound in a custom-designed quadruped robot called "Stoch."

Methodology and Contributions

The primary contribution of this work is the combination of D-RL and kMPs to efficiently generate and transfer locomotion patterns to real hardware. The process begins with generating walking gaits through D-RL using policy gradient methods, specifically proximal policy optimization (PPO). The simulated environment is meticulously constructed using the Pybullet simulator to closely mimic the real robot's physical attributes.

Once the gaits are generated, the authors employ PCA to distill the kMPs from the joint angle trajectories in the simulation data. Remarkably, only four to five kMPs are sufficient to represent various locomotion behaviors, affirming the low-dimensional character of these movements. This dimensionality reduction significantly enhances the transferability of these gaits to real-world robotic platforms by simplifying the joint trajectory reconstruction process and reducing computational demands. Interestingly, the derived kMPs exhibit comparable structure irrespective of the gait, embodying a characteristic waveform prevalent across different models and environments.

Numerical Results and Robustness

The paper provides empirical data showcasing the efficacy of the proposed methodology. The authors report robust and efficient gait generation in simulation within a feasible computational timeframe. Stoch successfully demonstrates flexibility and adaptability in executing multiple gaits using the extracted kMPs, reducing reliance on multiple training episodes common in traditional methods.

The numerical results underscore the robustness of the walking patterns derived, as indicated by the strong correlation between kMPs from learned gaits and biological counterparts, such as horse gaits. Additionally, statistical analyses reveal an impressive cross-covariance exceeding 90% between different gait kMPs, validating the approach.

Implications and Speculations

On a practical level, this research substantiates the viability of synthesizing complex robot behaviors with minimal computational overhead, promoting agile and energy-efficient robotic locomotion systems. Theoretically, the alignment of kMP structure across divergent locomotion types posits intriguing parallels between synthetic and biological motion control systems, suggesting avenues for further exploration into bio-inspired robotics.

Future developments in AI for robotics may involve deeper integration of kMP-inspired control schemes, potentially leveraging neural networks designed to capture dynamic contexts beyond the current optimization of trajectories. Moreover, advancements in hardware design may further facilitate seamless translation of simulated learning to physical enactment, enhancing the fidelity and applicability of these models across various terrains and operational challenges.

In conclusion, this paper presents a methodologically sound and practically relevant approach to quadrupedal locomotion through the innovative convergence of D-RL and kMPs, offering valuable insights for the advancement of legged robotics research.

Markdown