Auto-Encoding Adversarial Imitation Learning (2206.11004v5)

Published 22 Jun 2022 in cs.LG

Abstract: Reinforcement learning (RL) provides a powerful framework for decision-making, but its application in practice often requires a carefully designed reward function. Adversarial Imitation Learning (AIL) sheds light on automatic policy acquisition without access to the reward signal from the environment. In this work, we propose Auto-Encoding Adversarial Imitation Learning (AEAIL), a robust and scalable AIL framework. To induce expert policies from demonstrations, AEAIL utilizes the reconstruction error of an auto-encoder as a reward signal, which provides more information for optimizing policies than the prior discriminator-based ones. Subsequently, we use the derived objective functions to train the auto-encoder and the agent policy. Experiments show that our AEAIL performs superior compared to state-of-the-art methods on both state and image based environments. More importantly, AEAIL shows much better robustness when the expert demonstrations are noisy.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (45)

Authors (4)

Kaifeng Zhang (11 papers)
Rui Zhao (241 papers)
Ziming Zhang (59 papers)
Yang Gao (761 papers)

Citations (1)

View on Semantic Scholar

Auto-Encoding Adversarial Imitation Learning (2206.11004v5)

Related Papers