Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Imitation Learning from Imperfect Demonstration (1901.09387v3)

Published 27 Jan 2019 in cs.LG, cs.AI, and stat.ML

Abstract: Imitation learning (IL) aims to learn an optimal policy from demonstrations. However, such demonstrations are often imperfect since collecting optimal ones is costly. To effectively learn from imperfect demonstrations, we propose a novel approach that utilizes confidence scores, which describe the quality of demonstrations. More specifically, we propose two confidence-based IL methods, namely two-step importance weighting IL (2IWIL) and generative adversarial IL with imperfect demonstration and confidence (IC-GAIL). We show that confidence scores given only to a small portion of sub-optimal demonstrations significantly improve the performance of IL both theoretically and empirically.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Yueh-Hua Wu (18 papers)
  2. Nontawat Charoenphakdee (21 papers)
  3. Han Bao (77 papers)
  4. Voot Tangkaratt (18 papers)
  5. Masashi Sugiyama (286 papers)
Citations (147)