Automated Augmentation with Reinforcement Learning and GANs for Robust Identification of Traffic Signs using Front Camera Images (1911.06486v1)

Published 15 Nov 2019 in cs.CV, cs.AI, and cs.LG

Abstract: Traffic sign identification using camera images from vehicles plays a critical role in autonomous driving and path planning. However, the front camera images can be distorted due to blurriness, lighting variations and vandalism which can lead to degradation of detection performances. As a solution, machine learning models must be trained with data from multiple domains, and collecting and labeling more data in each new domain is time consuming and expensive. In this work, we present an end-to-end framework to augment traffic sign training data using optimal reinforcement learning policies and a variety of Generative Adversarial Network (GAN) models, that can then be used to train traffic sign detector modules. Our automated augmenter enables learning from transformed nightime, poor lighting, and varying degrees of occlusions using the LISA Traffic Sign and BDD-Nexar dataset. The proposed method enables mapping training data from one domain to another, thereby improving traffic sign detection precision/recall from 0.70/0.66 to 0.83/0.71 for nighttime images.

Authors (13)

Sohini Roy Chowdhury (1 paper)
Lars Tornberg (7 papers)
Robin Halvfordsson (1 paper)
Jonatan Nordh (1 paper)
Adam Suhren Gustafsson (1 paper)
Joel Wall (1 paper)
Mattias Westerberg (1 paper)
Adam Wirehed (1 paper)
Louis Tilloy (1 paper)
Zhanying Hu (1 paper)
Haoyuan Tan (2 papers)
Meng Pan (2 papers)
Jonas Sjoberg (2 papers)

Citations (7)

View on Semantic Scholar

Summary

Exploration of Automated Augmentation Techniques for Traffic Sign Detection

The paper "Automated Augmentation with Reinforcement Learning and GANs for Robust Identification of Traffic Signs using Front Camera Images" presents an innovative approach to enhance the training datasets for traffic sign detection systems within the domain of autonomous vehicles. Given the critical importance of accurate traffic sign recognition for autonomous driving and navigation, this paper's focus on using machine learning methodologies, specifically reinforcement learning (RL) and Generative Adversarial Networks (GANs), to amplify data augmentation appears to be well justified.

Existing challenges in traffic sign recognition, such as image distortions caused by poor lighting, blurriness, and vandalism, underscore the necessity for diverse and sufficiently large training datasets. The authors emphasize that collecting and annotating extensive datasets across various domains is costly and labor-intensive. Therefore, this paper proposes an automated augmentation framework that applies reinforcement learning policies combined with GAN models to extend existing traffic sign datasets. This is accomplished by mapping training data across different domains—specifically from daylight to nighttime scenarios—thus significantly enhancing the detection system's precision and recall.

A major contribution of this paper is the employment of a reinforced learning model alongside multiple variations of GANs, notably a Bounding Box GAN (BBGAN) variant. These models generate day-to-night transformations that ensure the ROIs (Regions of Interest), where traffic signs are located, maintain their critical features during transformation processes. Such preservation is vital for classification tasks where high precision is needed across variations in lighting and occlusions. The framework utilizes the LISA Traffic Sign and BDD-Nexar datasets and achieves significant improvements in classification performance from a precision/recall of 0.70/0.66 to 0.83/0.71, specifically under challenging nighttime conditions.

The manuscript also elaborates on specific GAN implementations, including CycleGAN, StyleGAN, and the proposed BBGAN, focusing on their capabilities for domain transfer without requiring paired image datasets. However, one insightful observation indicated is the performance limitations of standard CycleGAN due to its pronounced dark transformations that obscure traffic sign content. The BBGAN, in comparison, successfully minimizes transformations around the traffic sign areas, preserving sign integrity even as lighting conditions are altered.

Reinforcement learning-based augmentation (RLAUG) further complements this approach by optimizing transformation policies, enabling alterations such as shear, color tuning, and occlusion effects which collectively improve the diversity and robustness of training data. The synergistic effect of RLAUG coupled with BBGAN notably outperformed individual methods by optimizing the distribution of data samples across different conditions, as evident from the precision increase to 0.916 and recall up to 0.913 for nighttime images.

The implications of these findings are substantial for advancing machine learning models used in autonomous driving. They highlight an efficient and scalable route to diversify datasets without incurring excessive manual annotation costs. Furthermore, the automated augmenter is proposed to have utility beyond just traffic sign detection, potentially enhancing object recognition models present in broader autonomous driving systems.

In conclusion, the paper contributes a practical technique for data augmentation that intelligently balances data transformation and preservation of essential feature integrity using reinforcement learning and GANs. Future extensions could explore broader applications within diverse environmental conditions, such as different weather conditions, thus pushing the envelope for AI-driven perception tasks in complex real-world settings.

PDF Markdown

Related Papers

YouTube

Show All Videos