Task-Driven Hybrid Model Reduction for Dexterous Manipulation (2211.16657v2)
Abstract: In contact-rich tasks, like dexterous manipulation, the hybrid nature of making and breaking contact creates challenges for model representation and control. For example, choosing and sequencing contact locations for in-hand manipulation, where there are thousands of potential hybrid modes, is not generally tractable. In this paper, we are inspired by the observation that far fewer modes are actually necessary to accomplish many tasks. Building on our prior work learning hybrid models, represented as linear complementarity systems, we find a reduced-order hybrid model requiring only a limited number of task-relevant modes. This simplified representation, in combination with model predictive control, enables real-time control yet is sufficient for achieving high performance. We demonstrate the proposed method first on synthetic hybrid systems, reducing the mode count by multiple orders of magnitude while achieving task performance loss of less than 5%. We also apply the proposed method to a three-fingered robotic hand manipulating a previously unknown object. With no prior knowledge, we achieve state-of-the-art closed-loop performance within a few minutes of online learning, by collecting only a few thousand environment samples.
- M. Parmar, M. Halm, and M. Posa, “Fundamental challenges in deep learning for stiff contact dynamics,” in IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2021, pp. 5181–5188.
- B. Bianchini, M. Halm, N. Matni, and M. Posa, “Generalization bounded implicit learning of nearly discontinuous functions,” in Learning for Dynamics and Control Conference. PMLR, 2022, pp. 1112–1124.
- S. Pfrommer, M. Halm, and M. Posa, “Contactnets: Learning discontinuous contact dynamics with smooth, implicit representations,” in Conference on Robot Learning, 2020.
- W. Jin, T. D. Murphey, D. Kulić, N. Ezer, and S. Mou, “Learning from sparse demonstrations,” IEEE Transactions on Robotics, 2022.
- Y.-M. Chen and M. Posa, “Optimal reduced-order modeling of bipedal locomotion,” in IEEE International Conference on Robotics and Automation. IEEE, 2020, pp. 8753–8760.
- A. Aydinoglu and M. Posa, “Real-time multi-contact model predictive control via admm,” in IEEE International Conference on Robotics and Automation, 2022, pp. 3414–3421.
- S. L. Cleac’h, T. Howell, M. Schwager, and Z. Manchester, “Fast contact-implicit model-predictive control,” arXiv preprint arXiv:2107.05616, 2021.
- F. de Avila Belbute-Peres, K. Smith, K. Allen, J. Tenenbaum, and J. Z. Kolter, “End-to-end differentiable physics for learning and control,” Advances in neural information processing systems, vol. 31, 2018.
- M. Geilinger, D. Hahn, J. Zehnder, M. Bächer, B. Thomaszewski, and S. Coros, “Add: Analytically differentiable dynamics for multi-body systems with frictional contact,” ACM Transactions on Graphics, vol. 39, no. 6, pp. 1–15, 2020.
- E. Heiden, D. Millard, E. Coumans, Y. Sheng, and G. S. Sukhatme, “Neuralsim: Augmenting differentiable simulators with neural networks,” in IEEE International Conference on Robotics and Automation. IEEE, 2021, pp. 9474–9481.
- T. A. Howell, S. L. Cleac’h, J. Z. Kolter, M. Schwager, and Z. Manchester, “Dojo: A differentiable simulator for robotics,” arXiv preprint arXiv:2203.00806, 2022.
- H. J. Suh, M. Simchowitz, K. Zhang, and R. Tedrake, “Do differentiable simulators give better policy gradients?” in International Conference on Machine Learning. PMLR, 2022, pp. 20 668–20 696.
- E. Todorov, T. Erez, and Y. Tassa, “Mujoco: A physics engine for model-based control,” in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012, pp. 5026–5033.
- D. Stewart and J. C. Trinkle, “An implicit time-stepping scheme for rigid body dynamics with coulomb friction,” in IEEE International Conference on Robotics and Automation, vol. 1, 2000, pp. 162–169.
- E. Coumans et al., “Bullet physics library,” Open source: bulletphysics. org, vol. 15, no. 49, p. 5, 2013.
- R. Tedrake and the Drake Development Team, “Drake: Model-based design and verification for robotics,” 2019. [Online]. Available: https://drake.mit.edu
- W. P. Heemels, B. De Schutter, and A. Bemporad, “Equivalence of hybrid dynamical models,” Automatica, vol. 37, no. 7, pp. 1085–1091, 2001.
- A. Bemporad, F. Borrelli, and M. Morari, “Piecewise linear optimal controllers for hybrid systems,” in American Control Conference, vol. 2. IEEE, 2000, pp. 1190–1194.
- T. Marcucci and R. Tedrake, “Mixed-integer formulations for optimal control of piecewise-affine systems,” in ACM International Conference on Hybrid Systems: Computation and Control, 2019, pp. 230–239.
- F. Lauer, “On the complexity of piecewise affine system identification,” Automatica, vol. 62, pp. 148–153, 2015.
- G. Ferrari-Trecate, M. Muselli, D. Liberati, and M. Morari, “A clustering technique for the identification of piecewise affine systems,” Automatica, vol. 39, no. 2, pp. 205–217, 2003.
- A. Bemporad, “A piecewise linear regression and classification algorithm with application to learning and model predictive control of hybrid systems,” IEEE Transactions on Automatic Control, 2022.
- J.-P. Sleiman, F. Farshidian, M. V. Minniti, and M. Hutter, “A unified mpc framework for whole-body dynamic locomotion and manipulation,” IEEE Robotics and Automation Letters, vol. 6, no. 3, pp. 4688–4695, 2021.
- C. Mastalli, R. Budhiraja, W. Merkt, G. Saurel, B. Hammoud, M. Naveau, J. Carpentier, L. Righetti, S. Vijayakumar, and N. Mansard, “Crocoddyl: An efficient and versatile framework for multi-contact optimal control,” in IEEE International Conference on Robotics and Automation. IEEE, 2020, pp. 2536–2542.
- A. W. Winkler, C. D. Bellicoso, M. Hutter, and J. Buchli, “Gait and trajectory optimization for legged systems through phase-based end-effector parameterization,” IEEE Robotics and Automation Letters, vol. 3, no. 3, pp. 1560–1567, 2018.
- F. R. Hogan and A. Rodriguez, “Reactive planar non-prehensile manipulation with hybrid model predictive control,” The International Journal of Robotics Research, vol. 39, no. 7, pp. 755–773, 2020.
- M. Posa, C. Cantu, and R. Tedrake, “A direct method for trajectory optimization of rigid bodies through contact,” The International Journal of Robotics Research, vol. 33, no. 1, pp. 69–81, 2014.
- J. A. Andersson, J. Gillis, G. Horn, J. B. Rawlings, and M. Diehl, “Casadi: a software framework for nonlinear optimization and optimal control,” Mathematical Programming Computation, vol. 11, no. 1, pp. 1–36, 2019.
- A. Nagabandi, K. Konolige, S. Levine, and V. Kumar, “Deep dynamics models for learning dexterous manipulation,” in Conference on Robot Learning. PMLR, 2020, pp. 1101–1112.
- A. Allshire, M. Mittal, V. Lodaya, V. Makoviychuk, D. Makoviichuk, F. Widmaier, M. Wüthrich, S. Bauer, A. Handa, and A. Garg, “Transferring dexterous manipulation from gpu simulation to a remote real-world trifinger,” arXiv preprint arXiv:2108.09779, 2021.
- O. M. Andrychowicz, B. Baker, M. Chociej, R. Jozefowicz, B. McGrew, J. Pachocki, A. Petron, M. Plappert, G. Powell, A. Ray et al., “Learning dexterous in-hand manipulationnagabandi2020deep,” The International Journal of Robotics Research, vol. 39, no. 1, pp. 3–20, 2020.
- P.-T. De Boer, D. P. Kroese, S. Mannor, and R. Y. Rubinstein, “A tutorial on the cross-entropy method,” Annals of operations research, vol. 134, no. 1, pp. 19–67, 2005.
- T. Wang, X. Bao, I. Clavera, J. Hoang, Y. Wen, E. Langlois, S. Zhang, G. Zhang, P. Abbeel, and J. Ba, “Benchmarking model-based reinforcement learning,” arXiv preprint arXiv:1907.02057, 2019.
- S. Kajita and K. Tani, “Study of dynamic biped locomotion on rugged terrain-derivation and application of the linear inverted pendulum mode,” in IEEE International Conference on Robotics and Automation. IEEE Computer Society, 1991, pp. 1405–1406.
- D. E. Orin, A. Goswami, and S.-H. Lee, “Centroidal dynamics of a humanoid robot,” Autonomous robots, vol. 35, no. 2, pp. 161–176, 2013.
- A. Pandala, R. T. Fawcett, U. Rosolia, A. D. Ames, and K. A. Hamed, “Robust predictive control for quadrupedal locomotion: Learning to close the gap between reduced-and full-order models,” IEEE Robotics and Automation Letters, vol. 7, no. 3, pp. 6622–6629, 2022.
- D. E. Stewart, “Rigid-body dynamics with friction and impact,” SIAM review, vol. 42, no. 1, pp. 3–39, 2000.
- E. Todorov, “A convex, smooth and invertible contact model for trajectory optimization,” in IEEE International Conference on Robotics and Automation, 2011, pp. 1071–1076.
- M. J. Tsatsomeros, “Generating and detecting matrices with positive principal minors,” Asian Information-Science-Life: An International Journal, vol. 1, no. 2, pp. 115–132, 2002.
- W. Jin, A. Aydinoglu, M. Halm, and M. Posa, “Learning linear complementarity systems,” Annual Learning for Dynamics and Control Conference, 2022.
- B. Stellato, G. Banjac, P. Goulart, A. Bemporad, and S. Boyd, “OSQP: an operator splitting solver for quadratic programs,” Mathematical Programming Computation, vol. 12, no. 4, pp. 637–672, 2020. [Online]. Available: https://doi.org/10.1007/s12532-020-00179-2
- S. Afriat, “Theory of maxima and the method of lagrange,” SIAM Journal on Applied Mathematics, vol. 20, no. 3, pp. 343–357, 1971.
- Y.-x. Yuan, “A review of trust region algorithms for optimization,” in International Council for Industrial and Applied Mathematics, vol. 99, no. 1, 2000, pp. 271–282.
- A. Wächter and L. T. Biegler, “On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming,” Mathematical programming, vol. 106, no. 1, pp. 25–57, 2006.
- M. Wüthrich, F. Widmaier, F. Grimminger, J. Akpo, S. Joshi, V. Agrawal, B. Hammoud, M. Khadiv, M. Bogdanovic, V. Berenz et al., “Trifinger: An open-source robot for learning dexterity,” arXiv preprint arXiv:2008.03596, 2020.
- P. M. Wensing and D. E. Orin, “Generation of dynamic humanoid behaviors through task-space control with conic optimization,” in IEEE International Conference on Robotics and Automation, 2013, pp. 3103–3109.
- H. Dai, G. Izatt, and R. Tedrake, “Global inverse kinematics via mixed-integer convex optimization,” The International Journal of Robotics Research, vol. 38, no. 12-13, pp. 1420–1441, 2019.
- A. Rajeswaran, I. Mordatch, and V. Kumar, “A game theoretic framework for model based reinforcement learning,” in International conference on machine learning, 2020, pp. 7953–7963.