Decomposing the Generalization Gap in Imitation Learning for Visual Robotic Manipulation (2307.03659v1)

Published 7 Jul 2023 in cs.RO and cs.AI

Abstract: What makes generalization hard for imitation learning in visual robotic manipulation? This question is difficult to approach at face value, but the environment from the perspective of a robot can often be decomposed into enumerable factors of variation, such as the lighting conditions or the placement of the camera. Empirically, generalization to some of these factors have presented a greater obstacle than others, but existing work sheds little light on precisely how much each factor contributes to the generalization gap. Towards an answer to this question, we study imitation learning policies in simulation and on a real robot language-conditioned manipulation task to quantify the difficulty of generalization to different (sets of) factors. We also design a new simulated benchmark of 19 tasks with 11 factors of variation to facilitate more controlled evaluations of generalization. From our study, we determine an ordering of factors based on generalization difficulty, that is consistent across simulation and our real robot setup.

References (52)

Citations (36)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Authors (4)

Tweets

https://twitter.com/shreyasgite/status/1933820560657527102

https://twitter.com/DJiafei/status/1757797057958654362

https://twitter.com/etaoxing/status/1775017163935695122

https://twitter.com/DJiafei/status/1757795067392389349

https://twitter.com/DJiafei/status/1757821372699717858

Decomposing the Generalization Gap in Imitation Learning for Visual Robotic Manipulation (2307.03659v1)

Summary

Follow-up Questions

Related Papers

Authors (4)

Tweets