Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Body Schema Acquisition through Active Learning (2402.06067v1)

Published 8 Feb 2024 in cs.RO

Abstract: We present an active learning algorithm for the problem of body schema learning, i.e. estimating a kinematic model of a serial robot. The learning process is done online using Recursive Least Squares (RLS) estimation, which outperforms gradient methods usually applied in the literature. In addiction, the method provides the required information to apply an active learning algorithm to find the optimal set of robot configurations and observations to improve the learning process. By selecting the most informative observations, the proposed method minimizes the required amount of data. We have developed an efficient version of the active learning algorithm to select the points in real-time. The algorithms have been tested and compared using both simulated environments and a real humanoid robot.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (21)
  1. A. D’Souza, S. Vijayakumar, and S. Schaal, “Learning inverse kinematics,” in International Conference on Intelligent Robots and Systems, Hawaii, USA, 2001, pp. 298–303.
  2. M. Lopes and B. Damas, “The manifold structure of sensory-motor coordination,” in IEEE - Intelligent Robotic Systems (IROS’07), USA, 2007.
  3. D. Nguyen-Tuong and J. Peters, “Local gaussian process regression for real-time model-based robot control,” in Proc. IEEE/RSJ International Conference on Intelligent Robots and Systems IROS 2008, 2008, pp. 380–385.
  4. D. Bennett and J. M. Hollerbach, “Self-calibration of single-loop, closed kinematic chains formed by dualor redundant manipulators,” in 27th IEEE Conference on Decision and Control, vol. 1, Dec 1988, pp. 627 – 629.
  5. P. Marić and V. Potkonjak, “Geometrical parameter estimation for industrial manipulators using two-stepestimation schemes,” Journal of Intelligent and Robotic Systems, vol. 24, no. 1, pp. 1573–0409, January 1999.
  6. A. Maravita and A. Iriki, “Tools for the body (schema),” Trends in Cognitive Sciences, vol. 8, no. 2, pp. 79–86, February 2004.
  7. M. Hersch, E. Sauser, and A. Billard, “Online learning of the body schema,” International Journal of Humanoid Robotics, vol. 5, no. 2, pp. 161–181, 2008.
  8. J. Sturm, C. Plagemann, and W. Burgard, “Adaptive body scheme models for robust robotic manipulation,” in RSS - Robotics Science and Systems IV, Zurich, Switzerland, june 2008.
  9. J. Bongard, V. Zykov, and H. Lipson, “Resilient machines through continuous self-modeling,” Science, vol. 314, no. 5802, pp. 1118–21, 2006.
  10. H. Choset, “Coverage for robotics – a survey of recent results,” Annals of Mathematics and Artificial Intelligence, vol. 31, no. 1, pp. 113–126, 2001.
  11. D. Katz, Y. Pyuro, and O. Brock, “Learning to manipulate articulated objects in unstructured environmentsusing a grounded relational representation,” in RSS - Robotics Science and Systems IV, Zurich, Switzerland, june 2008.
  12. K. Chaloner and I. Verdinelli, “Bayesian experimental design: A review,” J. of Statistical Science, vol. 10, pp. 273–304, 1995.
  13. D. Jones, C. Perttunen, and B. Stuckman, “Lipschitzian optimization without the Lipschitz constant,” J. of Optimiz. Theory. App., vol. 79, no. 1, pp. 157–181, October 1993.
  14. R. Sim and N. Roy, “Global A-optimal robot exploration in SLAM,” in Proc. of the IEEE Int. Conf. on Robotics & Automation, 2005.
  15. R. Martinez-Cantin, N. de Freitas, E. Brochu, J. Castellanos, and A. Doucet, “A Bayesian exploration-exploitation approach for optimal online sensing and planning with a visually guided mobile robot.” Autonomous Robots - Special Issue on Robot Learning, Part B, 2009.
  16. A. Ng and M. Jordan, “PEGASUS: A policy search method for large MDPs and POMDPs.” in Proc. of the Sixteenth Conf. on Uncertainty in Artificial Intelligence, 2000.
  17. R. Martinez-Cantin, N. de Freitas, and J. Castellanos, “Active policy learning for robot planning and exploration under uncertainty,” in Proc. of Robotics: Science and Systems, 2007.
  18. R. Smallwood and E. Sondik, “The optimal control of partially observable Markov processes over a finite horizon,” Operations Research, vol. 21, pp. 1071–1088, 1973.
  19. J. Gablonsky, “Modification of the DIRECT algorithm,” Ph.D. dissertation, Department of Mathematics, North Carolina State University, Raleigh, North Carolina, 2001.
  20. M. Lopes, R. Beira, M. Praça, and J. Santos-Victor, “An anthropomorphic robot torso for imitation: design and experiments.” in International Conference on Intelligent Robots and Systems, Sendai, Japan, 2004.
  21. I. Poupyrev, H. Kato, and M. Billinghurst, “Artoolkit user manual, version 2.33,” University of Washington, Tech. Rep., 2000.
Citations (52)

Summary

We haven't generated a summary for this paper yet.