Driving Policy Transfer via Modularity and Abstraction (1804.09364v3)

Published 25 Apr 2018 in cs.RO, cs.CV, and cs.LG

Abstract: End-to-end approaches to autonomous driving have high sample complexity and are difficult to scale to realistic urban driving. Simulation can help end-to-end driving systems by providing a cheap, safe, and diverse training environment. Yet training driving policies in simulation brings up the problem of transferring such policies to the real world. We present an approach to transferring driving policies from simulation to reality via modularity and abstraction. Our approach is inspired by classic driving systems and aims to combine the benefits of modular architectures and end-to-end deep learning approaches. The key idea is to encapsulate the driving policy such that it is not directly exposed to raw perceptual input or low-level vehicle dynamics. We evaluate the presented approach in simulated urban environments and in the real world. In particular, we transfer a driving policy trained in simulation to a 1/5-scale robotic truck that is deployed in a variety of conditions, with no finetuning, on two continents. The supplementary video can be viewed at https://youtu.be/BrMDJqI6H5U

Authors (4)

Matthias Müller (41 papers)
Alexey Dosovitskiy (49 papers)
Bernard Ghanem (256 papers)
Vladlen Koltun (114 papers)

Citations (217)

View on Semantic Scholar

Summary

The paper presents a novel modular architecture that transfers driving policies from simulation to the real world by training on abstracted perception outputs.
It decomposes the system into three distinct modules—perception, driving policy, and low-level control—to enhance interpretability and robustness.
Experimental results demonstrate high route completion rates and reduced failures in both simulated and scaled real-world environments.

Analysis of "Driving Policy Transfer via Modularity and Abstraction"

The paper "Driving Policy Transfer via Modularity and Abstraction" introduces a novel approach towards enhancing the transferability of driving policies from simulation environments to real-world applications. The focus is on modular architectures to encapsulate the driving policy, thereby addressing challenges associated with the direct exposure to raw perceptual input and low-level vehicle dynamics. This technique stands as an alternative to end-to-end deep learning methods which, while promising, demand high sample complexity and present difficulties in scaling to complex environments such as urban driving.

Overview of Methodology

The presented architecture consists of three modular components: perception, driving policy, and low-level control. The perception system processes raw sensor inputs into semantic segmentations, providing a high-level abstraction of the driving scene. The driving policy consumes these segmentation outputs to formulate a trajectory plan characterized by waypoints. The low-level control module uses these waypoints to enact the desired vehicle movements.

Key to the strategy is training the driving policy on the output of the perception module, rather than idealized ground-truth data. This nuance facilitates policy resilience to perceptual noise, enhancing the reality-transfer capability. Both perception and driving modules are predominantly trained in simulation, enabling extensive exploration of diverse driving scenarios. Nevertheless, the policy outputs high-level semantic concepts, such as waypoints, which inherently possess superior generalization capacities compared to low-level control details.

Experimental Evaluation

The effectiveness of this modular approach is rigorously evaluated both in simulated and real-world environments. In the simulation experiments, the modular approach demonstrated robustness to various environmental conditions, outperforming monolithic end-to-end systems. Notably, the system successfully transferred policies trained entirely in simulation to a real-world 1/5-scale robotic truck operating in diverse conditions without requiring further finetuning.

In the physical world tests, the driving policy navigated roads with varying geometric complexity and environmental characteristics, evidencing the method’s comparative advantage against traditionally structured autonomous driving systems. The paper reports a high success rate in route completion and reduced system failures, highlighting the superiority of modular designs for sim-to-real transfer.

Implications and Future Directions

This research marks significant progress in the development of transferable autonomous driving systems. By relying on well-defined interfaces between modules, the system benefits from enhanced interpretability and diagnostic capabilities—a critical advantage over traditional approaches where error propagation across modules can obscure system performance analysis.

The modular architecture not only alleviates some limitations of deep learning approaches but also presents new avenues for integrating advanced learning techniques tailored to each module. Future research can extend this model to incorporate additional semantic data, such as obstacle detection and traffic signals, thus broadening its applicability to more varied real-world scenarios. Reinforcement learning could further be deployed within this modular framework to optimize component-specific control strategies.

This paper’s contributions lie in demonstrating a practical and scalable path for deploying autonomous driving policies across the sim-to-real gap, with prospective implications for commercial deployment where safety, reliability, and flexibility are paramount. The modular approach aligns with a growing recognition of hybrid systems, blending the best of both deep learning and classic engineering paradigms to tackle complex, real-world problems.

PDF Markdown

Related Papers

YouTube

Show All Videos