Extension of Lipschitz-Constrained Policies Beyond Locomotion
Determine how well Lipschitz-constrained reinforcement learning policies, which impose a gradient penalty on the likelihood of a control action to encourage smoothness, extend to challenging physics-based character animation scenarios beyond locomotion.
References
While this method has been effective for locomotion tasks, it remains unclear how well it extends to more challenging scenarios.
— Learning Smooth Time-Varying Linear Policies with an Action Jacobian Penalty
(2602.18312 - Xie et al., 20 Feb 2026) in Section 1: Introduction