2000 character limit reached
Meta-Learning Parameterized Skills (2206.03597v4)
Published 7 Jun 2022 in cs.LG and cs.AI
Abstract: We propose a novel parameterized skill-learning algorithm that aims to learn transferable parameterized skills and synthesize them into a new action space that supports efficient learning in long-horizon tasks. We propose to leverage off-policy Meta-RL combined with a trajectory-centric smoothness term to learn a set of parameterized skills. Our agent can use these learned skills to construct a three-level hierarchical framework that models a Temporally-extended Parameterized Action Markov Decision Process. We empirically demonstrate that the proposed algorithms enable an agent to solve a set of difficult long-horizon (obstacle-course and robot manipulation) tasks.
- Haotian Fu (22 papers)
- Shangqun Yu (12 papers)
- Saket Tiwari (6 papers)
- Michael Littman (17 papers)
- George Konidaris (71 papers)