Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

80 tokens/sec

GPT-4o

59 tokens/sec

Gemini 2.5 Pro Pro

43 tokens/sec

o3 Pro

7 tokens/sec

GPT-4.1 Pro

50 tokens/sec

DeepSeek R1 via Azure Pro

28 tokens/sec

2000 character limit reached

A Survey of Deep Learning Techniques for Autonomous Driving (1910.07738v2)

Published 17 Oct 2019 in cs.LG and cs.RO

Abstract: The last decade witnessed increasingly rapid progress in self-driving vehicle technology, mainly backed up by advances in the area of deep learning and artificial intelligence. The objective of this paper is to survey the current state-of-the-art on deep learning technologies used in autonomous driving. We start by presenting AI-based self-driving architectures, convolutional and recurrent neural networks, as well as the deep reinforcement learning paradigm. These methodologies form a base for the surveyed driving scene perception, path planning, behavior arbitration and motion control algorithms. We investigate both the modular perception-planning-action pipeline, where each module is built using deep learning methods, as well as End2End systems, which directly map sensory information to steering commands. Additionally, we tackle current challenges encountered in designing AI architectures for autonomous driving, such as their safety, training data sources and computational hardware. The comparison presented in this survey helps to gain insight into the strengths and limitations of deep learning and AI approaches for autonomous driving and assist with design choices

PDF Abstract

A Survey of Deep Learning Techniques for Autonomous Driving

Introduction

The paper "A Survey of Deep Learning Techniques for Autonomous Driving" provides a comprehensive examination of the current state-of-the-art deep learning methodologies applied to various facets of self-driving car technology. These include driving scene perception, path planning, behavior arbitration, and motion control. The authors analyze several architectures, including convolutional and recurrent neural networks and the deep reinforcement learning paradigm. The paper contrasts the modular perception-planning-action pipeline with End2End systems, elaborating on the strengths, limitations, and future challenges of each approach.

Deep Learning-Based Decision-Making Architectures

The modular pipeline approach is decomposed into four primary components: Perception and Localization, High-Level Path Planning, Behavior Arbitration, and Motion Controllers. The perception system processes data from multiple sensors, including cameras, radars, LiDARs, GPS, and ultrasonic sensors, to understand and localize the vehicle within its environment. The high-level path planning component devises a route while considering dynamic and static obstacles. Behavior arbitration anticipates future actions based on environmental cues, and motion controllers execute the planned trajectory, rectifying any deviations.

Conversely, End2End learning systems leverage deep neural networks to directly map sensory inputs to control outputs, thereby potentially simplifying the pipeline and reducing latency associated with intermediate computations. The paper explores the intricate details of both approaches, providing a nuanced view of their implementation and comparative effectiveness.

Methodologies for Deep Learning in Autonomous Driving

The survey extensively discusses the three primary deep learning technologies—Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and Deep Reinforcement Learning (DRL).

CNNs: Primarily used for image processing, CNNs in autonomous driving facilitate object detection, lane detection, and semantic segmentation. The paper highlights the efficiency of CNNs in learning spatial hierarchies through convolutions and pooling operations, which make them suitable for tasks involving spatial data representations.
RNNs and LSTM Networks: These networks capture temporal dependencies and are particularly useful for tasks involving sequential data, such as tracking the dynamic changes in a driving environment. Long Short-Term Memory (LSTM) networks mitigate the vanishing gradient problem associated with traditional RNNs, making them effective for long-term dependencies.
DRL: This methodology is framed as a Partially Observable Markov Decision Process (POMDP) problem, where the agent learns to navigate from the starting state to the destination by maximizing cumulative rewards. The paper explores the integration of DRL with other deep learning methodologies to enhance decision-making processes in complex driving environments.

Driving Scene Perception and Localization

A critical aspect of autonomous driving is the vehicle's ability to perceive its surroundings. The paper surveys various methods for object detection, semantic segmentation, and environment perception using cameras and LiDAR sensors. It also explores the debate between the efficacy of camera-based and LiDAR-based sensors, referencing studies and real-world implementations from leading industry players like Tesla and Waymo.

Path Planning and Behavior Arbitration

The paper evaluates traditional and deep learning-based path planning methods. Imitation Learning (IL) and Deep Reinforcement Learning (DRL) are highlighted as two prominent approaches. IL leverages human driving data to train models, whereas DRL learns optimal driving policies through simulation and interaction with the environment. The review considers the strengths and limitations of these methods in navigating complex driving scenarios and adapting to dynamic environments.

Motion Control Systems

The discussion on motion control encompasses both traditional model-based controllers and learning-based controllers such as Iterative Learning Control (ILC) and Model Predictive Control (MPC). The paper argues that hybrid approaches combining model-based control with learning components can yield better efficiency and adaptability, accommodating the nuanced dynamics encountered in various driving conditions.

Safety Considerations

Safety is paramount in autonomous driving, warranting rigorous validation and verification of AI systems. The paper critiques the current inadequacies of safety standards like ISO 26262 in encompassing the complexities and operational idiosyncrasies of deep learning systems. It stresses the need for new safety frameworks that address deep learning's unique failure modes and uncertainty factors.

Data Sources and Computational Infrastructure

Effective training of deep learning models necessitates extensive and high-quality datasets. The paper provides an overview of publicly available datasets such as KITTI, nuScenes, and others, elaborating on their attributes and utility in training perception and control algorithms. It also discusses the computational hardware required for deploying these models, highlighting solutions from NVIDIA and Renesas suited for real-time inference in automotive applications.

Conclusion

The paper concludes by identifying key challenges and future directions for deep learning in autonomous driving, emphasizing the need for robust perception systems, effective path planning under uncertainty, scalable training data acquisition, and real-time computational capabilities. It underscores the evolving landscape of AI in autonomous vehicles and anticipates significant advancements driven by integrated deep learning techniques.

By providing a thorough survey of current methodologies and their applications, the paper serves as a valuable resource for researchers and practitioners in the field of autonomous driving, guiding future developments and innovations in this rapidly advancing domain.

PDF Markdown Bookmark Chat (Pro)

Authors (4)

Sorin Grigorescu (16 papers)
Bogdan Trasnea (10 papers)
Tiberiu Cocias (4 papers)
Gigel Macesanu (3 papers)

Citations (1,268)

View on Semantic Scholar