Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios (2212.11419v2)

Published 21 Dec 2022 in cs.AI and cs.RO

Abstract: Imitation learning (IL) is a simple and powerful way to use high-quality human driving data, which can be collected at scale, to produce human-like behavior. However, policies based on imitation learning alone often fail to sufficiently account for safety and reliability concerns. In this paper, we show how imitation learning combined with reinforcement learning using simple rewards can substantially improve the safety and reliability of driving policies over those learned from imitation alone. In particular, we train a policy on over 100k miles of urban driving data, and measure its effectiveness in test scenarios grouped by different levels of collision likelihood. Our analysis shows that while imitation can perform well in low-difficulty scenarios that are well-covered by the demonstration data, our proposed approach significantly improves robustness on the most challenging scenarios (over 38% reduction in failures). To our knowledge, this is the first application of a combined imitation and reinforcement learning approach in autonomous driving that utilizes large amounts of real-world human driving data.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (12)
  1. Yiren Lu (17 papers)
  2. Justin Fu (20 papers)
  3. George Tucker (45 papers)
  4. Xinlei Pan (13 papers)
  5. Eli Bronstein (7 papers)
  6. Rebecca Roelofs (19 papers)
  7. Benjamin Sapp (16 papers)
  8. Brandyn White (7 papers)
  9. Aleksandra Faust (60 papers)
  10. Shimon Whiteson (122 papers)
  11. Dragomir Anguelov (73 papers)
  12. Sergey Levine (531 papers)
Citations (70)

Summary

We haven't generated a summary for this paper yet.