2000 character limit reached
Deep Predictive Model Learning with Parametric Bias: Handling Modeling Difficulties and Temporal Model Changes (2404.15726v1)
Published 24 Apr 2024 in cs.RO
Abstract: When a robot executes a task, it is necessary to model the relationship among its body, target objects, tools, and environment, and to control its body to realize the target state. However, it is difficult to model them using classical methods if the relationship is complex. In addition, when the relationship changes with time, it is necessary to deal with the temporal changes of the model. In this study, we have developed Deep Predictive Model with Parametric Bias (DPMPB) as a more human-like adaptive intelligence to deal with these modeling difficulties and temporal model changes. We categorize and summarize the theory of DPMPB and various task experiments on the actual robots, and discuss the effectiveness of DPMPB.
- H. Kobayashi, K. Hyodo, and D. Ogane, “On Tendon-Driven Robotic Mechanisms with Redundant Tendons,” The International Journal of Robotics Research, vol. 17, no. 5, pp. 561–571, 1998.
- C. C. Kemp and A. Edsinger, “Robot manipulation of human tools: Autonomous detection and control of task relevant features,” in Proceeding of the 2006 International Conference on Development and Learning, 2006, pp. 1–6.
- C. Lee, M. Kim, Y. J. Kim, N. Hong, S. Ryu, H. J. Kim, and S. Kim, “Soft robot review,” International Journal of Control, Automation and Systems, vol. 15, no. 1, pp. 3–15, 2017.
- D. Tanaka, S. Arnold, and K. Yamazaki, “EMD Net: An Encode-Manipulate-Decode Network for Cloth Manipulation,” IEEE Robotics and Automation Letters, vol. 3, no. 3, pp. 1771–1778, 2018.
- Y. Kuniyoshi and S. Suzuki, “Dynamic emergence and adaptation of behavior through embodiment as coupled chaotic field,” in Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2004, pp. 2042–2049.
- J. Lee, J. Hwangbo, L. Wellhausen, V. Koltun, and M. Hutter, “Learning quadrupedal locomotion over challenging terrain,” Science Robotics, vol. 5, no. 47, 2020.
- K. Kawaharazuka, K. Tsuzuki, M. Onitsuka, Y. Asano, K. Okada, K. Kawasaki, and M. Inaba, “Object Recognition, Dynamic Contact Simulation, Detection, and Control of the Flexible Musculoskeletal Hand Using a Recurrent Neural Network With Parametric Bias,” IEEE Robotics and Automation Letters, vol. 5, no. 3, pp. 4580–4587, 2020.
- K. Kawaharazuka, N. Kanazawa, K. Okada, and M. Inaba, “Self-Supervised Learning of Visual Servoing for Low-Rigidity Robots Considering Temporal Body Changes,” IEEE Robotics and Automation Letters, vol. 7, no. 3, pp. 7881–7887, 2022.
- K. Kawaharazuka, K. Shinjo, Y. Kawamura, K. Okada, and M. Inaba, “Environmentally Adaptive Control Including Variance Minimization Using Stochastic Predictive Network with Parametric Bias: Application to Mobile Robots,” in Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021, pp. 8381–8387.
- K. Kawaharazuka, Y. Kawamura, K. Okada, and M. Inaba, “Imitation Learning with Additional Constraints on Motion Style using Parametric Bias,” IEEE Robotics and Automation Letters, vol. 6, no. 3, pp. 5897–5904, 2021.
- K. Kawaharazuka, Y. Ribayashi, A. Miki, Y. Toshimitsu, T. Suzuki, K. Okada, and M. Inaba, “Learning of Balance Controller Considering Changes in Body State for Musculoskeletal Humanoids,” in Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022, pp. 5809–5816.
- K. Kawaharazuka, A. Miki, M. Bando, K. Okada, and M. Inaba, “Dynamic Cloth Manipulation Considering Variable Stiffness and Material Change Using Deep Predictive Model With Parametric Bias,” Frontiers in Neurorobotics, vol. 16, pp. 1–16, 2022.
- J. Tani, “Self-organization of behavioral primitives as multiple attractor dynamics: a robot experiment,” in Proceedings of the 2002 International Joint Conference on Neural Networks, 2002, pp. 489–494.
- T. Ogata, H. Ohba, J. Tani, K. Komatani, and H. G. Okuno, “Extracting multi-modal dynamics of objects using RNNPB,” in Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005, pp. 966–971.
- R. Tedrake, T. W. Zhang, and H. S. Seung, “Stochastic policy gradient reinforcement learning on a simple 3D biped,” in Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2004, pp. 2849–2854.
- T. Zhang, Z. McCarthy, O. Jow, D. Lee, X. Chen, K. Goldberg, and P. Abbeel, “Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation,” in Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018, pp. 5628–5635.
- Y. Yang, K. Caluwaerts, A. Iscen, T. Zhang, J. Tan, and V. Sindhwani, “Data Efficient Reinforcement Learning for Legged Robots,” in Proceedings of the 2019 Conference on Robot Learning, 2019, pp. 1–10.
- K. Schmeckpeper, A. Xie, O. Rybkin, S. Tian, K. Daniilidis, S. Levine, and C. Finn, “Learning Predictive Models from Observation and Interaction,” in Proceedings of 2020 European Conference on Computer Vision, 2020, pp. 708–725.
- K. Kawaharazuka, T. Ogawa, J. Tamura, and C. Nabeshima, “Dynamic Manipulation of Flexible Objects with Torque Sequence Using a Deep Neural Network,” in Proceedings of the 2019 IEEE International Conference on Robotics and Automation, 2019, pp. 2139–2145.
- K. Kawaharazuka, K. Tsuzuki, M. Onitsuka, Y. Asano, K. Okada, K. Kawasaki, and M. Inaba, “Musculoskeletal AutoEncoder: A Unified Online Acquisition Method of Intersensory Networks for State Estimation, Control, and Simulation of Musculoskeletal Humanoids,” IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 2411–2418, 2020.
- S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural computation, vol. 9, no. 8, pp. 1735–1780, 1997.
- D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimization,” in Proceedings of the 3rd International Conference on Learning Representations, 2015, pp. 1–15.
- K. Kawaharazuka, S. Makino, K. Tsuzuki, M. Onitsuka, Y. Nagamatsu, K. Shinjo, T. Makabe, Y. Asano, K. Okada, K. Kawasaki, and M. Inaba, “Component Modularized Design of Musculoskeletal Humanoid Platform Musashi to Investigate Learning Control Systems,” in Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019, pp. 7294–7301.
- K. Kawaharazuka, K. Tsuzuki, S. Makino, M. Onitsuka, K. Shinjo, Y. Asano, K. Okada, K. Kawasaki, and M. Inaba, “Task-specific Self-body Controller Acquisition by Musculoskeletal Humanoids: Application to Pedal Control in Autonomous Driving,” in Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019, pp. 813–818.