Papers
Topics
Authors
Recent
2000 character limit reached

Combining Sampling- and Gradient-based Planning for Contact-rich Manipulation

Published 7 Oct 2023 in cs.RO | (2310.04822v2)

Abstract: Planning over discontinuous dynamics is needed for robotics tasks like contact-rich manipulation, which presents challenges in the numerical stability and speed of planning methods when either neural network or analytical models are used. On the one hand, sampling-based planners require higher sample complexity in high-dimensional problems and cannot describe safety constraints such as force limits. On the other hand, gradient-based solvers can suffer from local optima and convergence issues when the Hessian is poorly conditioned. We propose a planning method with both sampling- and gradient-based elements, using the Cross-entropy Method to initialize a gradient-based solver, providing better search over local minima and the ability to handle explicit constraints. We show the approach allows smooth, stable contact-rich planning for an impedance-controlled robot making contact with a stiff environment, benchmarking against gradient-only MPC and CEM.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (25)
  1. M. Anitescu and F. A. Potra, “Formulating Dynamic Multi-rigid-body Contact Problems with Friction as Solvable Linear Complementarity Problems,” Nonlinear Dynamics, vol. 14, pp. 231–247, 1997.
  2. M. Posa, C. Cantu, and R. Tedrake, “A direct method for trajectory optimization of rigid bodies through contact,” The International Journal of Robotics Research, vol. 33, no. 1, pp. 69–81, 2014.
  3. S. Le Cleac’h, T. A. Howell, S. Yang, C.-Y. Lee, J. Zhang, A. Bishop, M. Schwager, and Z. Manchester, “Fast contact-implicit model predictive control,” IEEE Transactions on Robotics, 2024.
  4. A. Aydinoglu and M. Posa, “Real-time multi-contact model predictive control via admm,” in 2022 International Conference on Robotics and Automation (ICRA).   IEEE, 2022, pp. 3414–3421.
  5. W. Jin and M. Posa, “Task-driven hybrid model reduction for dexterous manipulation,” IEEE Transactions on Robotics, 2024.
  6. F. R. Hogan and A. Rodriguez, “Reactive planar non-prehensile manipulation with hybrid model predictive control,” The International Journal of Robotics Research, vol. 39, no. 7, pp. 755–773, Jun. 2020.
  7. C. Chen, P. Culbertson, M. Lepert, M. Schwager, and J. Bohg, “TrajectoTree: Trajectory Optimization Meets Tree Search for Planning Multi-contact Dexterous Manipulation,” in 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Sep. 2021, pp. 8262–8268.
  8. A. Wu, S. Sadraddini, and R. Tedrake, “R3T: Rapidly-exploring Random Reachable Set Tree for Optimal Kinodynamic Planning of Nonlinear Hybrid Systems,” in 2020 IEEE International Conference on Robotics and Automation (ICRA), May 2020, pp. 4245–4251.
  9. T. Pang, H. T. Suh, L. Yang, and R. Tedrake, “Global planning for contact-rich manipulation via local smoothing of quasi-dynamic contact models,” IEEE Transactions on Robotics, 2023.
  10. H. Bharadhwaj, K. Xie, and F. Shkurti, “Model-predictive control via cross-entropy and gradient-based optimization,” in Learning for Dynamics and Control.   PMLR, 2020, pp. 277–286.
  11. K. Huang, S. Lale, U. Rosolia, Y. Shi, and A. Anandkumar, “CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning,” in arXiv.   arXiv, Dec. 2021, p. arXiv:2112.07746.
  12. H. J. T. Suh, T. Pang, and R. Tedrake, “Bundled gradients through contact via randomized smoothing,” IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 4000–4007, 2022.
  13. A. Ö. Önol, R. Corcodel, P. Long, and T. Padır, “Tuning-Free Contact-Implicit Trajectory Optimization,” in 2020 IEEE International Conference on Robotics and Automation (ICRA), May 2020, pp. 1183–1189.
  14. D. E. Stewart, “Rigid-Body Dynamics with Friction and Impact,” SIAM Review, vol. 42, no. 1, pp. 3–39, Jan. 2000.
  15. A. Nagabandi, G. Kahn, R. S. Fearing, and S. Levine, “Neural network dynamics for model-based deep reinforcement learning with model-free fine-tuning,” CoRL, vol. abs/1708.02596, 2017.
  16. G. Williams, N. Wagener, B. Goldfain, P. Drews, J. M. Rehg, B. Boots, and E. A. Theodorou, “Information theoretic MPC for model-based reinforcement learning,” in 2017 IEEE International Conference on Robotics and Automation (ICRA), May 2017, pp. 1714–1721.
  17. R. Y. Rubinstein, “Optimization of computer simulation models with rare events,” European Journal of Operational Research, vol. 99, no. 1, pp. 89–112, May 1997.
  18. M. Okada and T. Taniguchi, “Variational inference MPC for bayesian model-based reinforcement learning,” CoRL, vol. abs/1907.04202, 2019.
  19. C. Pinneri, S. Sawant, S. Blaes, J. Achterhold, J. Stueckler, M. Rolinek, and G. Martius, “Sample-efficient cross-entropy method for real-time planning,” in Conference on Robot Learning.   PMLR, 2021, pp. 1049–1065.
  20. L. Roveda, J. Maskani, P. Franceschi, A. Abdi, F. Braghin, L. M. Tosatti, and N. Pedrocchi, “Model-based reinforcement learning variable impedance control for human-robot collaboration,” Journal of Intelligent & Robotic Systems, pp. 1–17, 2020.
  21. K. Chua, R. Calandra, R. McAllister, and S. Levine, “Deep reinforcement learning in a handful of trials using probabilistic dynamics models,” Advances in neural information processing systems, vol. 31, 2018.
  22. B. Ichter, J. Harrison, and M. Pavone, “Learning Sampling Distributions for Robot Motion Planning,” in 2018 IEEE International Conference on Robotics and Automation (ICRA).   Brisbane, QLD: IEEE, May 2018, pp. 7087–7094.
  23. A. M. Castro, F. N. Permenter, and X. Han, “An unconstrained convex formulation of compliant contact,” IEEE Transactions on Robotics, vol. 39, no. 2, pp. 1301–1320, 2022.
  24. K. Haninger, K. Samuel, F. Rozzi, S. Oh, and L. Roveda, “Differentiable Compliant Contact Primitives for Estimation and Model Predictive Control,” in ICRA 2024, To Be Presented, Oct. 2023.
  25. S. Thrun, “Probabilistic robotics,” Communications of the ACM, vol. 45, no. 3, pp. 52–57, 2002.
Citations (2)

Summary

  • The paper introduces a hybrid method that combines CEM for initial guesses and gradient-based solvers for explicit constraint handling in contact-rich tasks.
  • Simulation results in the MuJoCo environment show improved solve times and reduced performance variance compared to traditional methods.
  • Real-world experiments validate robust performance in handling discontinuous dynamics and varying stiffness during robotic manipulation.

Combining Sampling- and Gradient-based Planning for Contact-rich Manipulation

This essay explores a hybrid planning methodology integrating both sampling- and gradient-based approaches for contact-rich robotic manipulation tasks. The proposed method is evaluated through rigorous simulations and real-world experiments to address challenges associated with discontinuous dynamics and state constraints in manipulating environments.

Problem Statement and Proposed Methodology

Contact-rich manipulation tasks, such as door opening, present significant challenges due to discontinuous environmental dynamics and constraints like force limits. Traditional planning methods—sampling-based planners like CEM and gradient-based solvers—each have limitations. Sampling-based methods face sample complexity issues in high-dimensional spaces and struggle with state constraints, while gradient-based methods may suffer from convergence to local optima and poor performance on non-smooth problems.

The paper introduces a planning approach that combines these two paradigms. It utilizes the Cross-Entropy Method (CEM) to provide initial guesses for a gradient-based solver. This initialization helps overcome the sensitivity of gradient-based methods to initial conditions, allowing explicit handling of state constraints. The proposal includes integrating a particle filter for online estimation of contact modes to improve contact handling.

The implementation of the proposed approach in simulation environments demonstrates its effectiveness in improving robustness and solve time under various contact conditions and planning horizons.

Simulation Experiments and Performance Metrics

The performance of the hybrid method is extensively evaluated in the MuJoCo simulation environment, demonstrating better effectiveness than standalone planning methods in contact-rich tasks. The following figures depict relevant findings from the simulations:

The block diagram of the proposed approach outlines the integration of CEM and gradient-based solvers, showcasing the hybrid architecture. Figure 1

Figure 1: Block diagram of the proposed approach showing the integration of CEM with gradient-based solvers.

Performance evaluations in MuJoCo with a planning horizon h=20h=20 highlight how the proposed method competes favorably with traditional methods like CEM and MPC alone, particularly in tasks where discontinuous dynamics play a critical role. Figure 2

Figure 2: Performance on the MuJoCo environments with planning horizon h=20h=20.

In tasks marked by stiff dynamics, comparisons between state and control trajectories from CEM versus MPC reveal that the proposed approach maintains lower variance in performance and faster convergence, even under challenging conditions. Figure 3

Figure 3: Comparing state and control trajectory from CEM vs MPC for stiff dynamics, the x-, y-, and z-positions shown in red, green, blue, respectively.

Real-world Validation and Analysis

To further affirm the efficacy of the approach, real-world experiments were conducted on a robot executing defined contact-rich tasks, such as transitioning between free space and contact with environments of varying stiffness. The effect of stiffness on contact detection was evident, with quicker adaptation in stiffer environments due to the particle filter's efficiency. Figure 4

Figure 4: Effect of stiffness on contact detection highlights the sensitivity of mode detection to environment stiffness.

The method's real-world applicability is further confirmed in a pivot task, where the CEM+MPC approach outperforms a simple MPC method by reducing discontinuities observed in the impedance rest position during contact transitions. Figure 5

Figure 5

Figure 5: Comparison between the simple MPC and CEM+MPC approaches for the pivot task.

Conclusion

The combination of the Cross-Entropy Method with a gradient-based planner offers a robust framework for handling contact-rich manipulation tasks with discontinuous dynamics. The hybrid approach benefits from the strengths of both sampling- and gradient-based methods, enabling better initialization, explicit constraint handling, and improved solve times in environments requiring adaptive contact management. Future research directions may explore further optimization of sampling and gradient steps and apply the framework to a broader set of robotic manipulation challenges.

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.