Gaussian YOLOv3: An Accurate and Fast Object Detector Using Localization Uncertainty for Autonomous Driving (1904.04620v2)

Published 9 Apr 2019 in cs.CV

Abstract: The use of object detection algorithms is becoming increasingly important in autonomous vehicles, and object detection at high accuracy and a fast inference speed is essential for safe autonomous driving. A false positive (FP) from a false localization during autonomous driving can lead to fatal accidents and hinder safe and efficient driving. Therefore, a detection algorithm that can cope with mislocalizations is required in autonomous driving applications. This paper proposes a method for improving the detection accuracy while supporting a real-time operation by modeling the bounding box (bbox) of YOLOv3, which is the most representative of one-stage detectors, with a Gaussian parameter and redesigning the loss function. In addition, this paper proposes a method for predicting the localization uncertainty that indicates the reliability of bbox. By using the predicted localization uncertainty during the detection process, the proposed schemes can significantly reduce the FP and increase the true positive (TP), thereby improving the accuracy. Compared to a conventional YOLOv3, the proposed algorithm, Gaussian YOLOv3, improves the mean average precision (mAP) by 3.09 and 3.5 on the KITTI and Berkeley deep drive (BDD) datasets, respectively. Nevertheless, the proposed algorithm is capable of real-time detection at faster than 42 frames per second (fps) and shows a higher accuracy than previous approaches with a similar fps. Therefore, the proposed algorithm is the most suitable for autonomous driving applications.

Authors (4)

Jiwoong Choi (7 papers)
Dayoung Chun (1 paper)
Hyun Kim (17 papers)
Hyuk-Jae Lee (14 papers)

Citations (376)

View on Semantic Scholar

Summary

Analysis of Gaussian YOLOv3: A Robust Object Detection Algorithm for Autonomous Driving

The paper "Gaussian YOLOv3: An Accurate and Fast Object Detector Using Localization Uncertainty for Autonomous Driving" presents an enhanced version of the YOLOv3 object detection architecture, specifically designed to meet the challenges of autonomous driving. The authors propose a novel approach by incorporating Gaussian modeling to improve localization accuracy while maintaining high inference speeds essential for real-time applications.

Key Contributions

The Gaussian YOLOv3 introduces several innovations over the standard YOLOv3 architecture:

Gaussian Modeling of Bounding Boxes: The authors replace deterministic bounding box predictions with Gaussian distributions characterized by mean and variance parameters. This approach allows the model to predict not only object locations but also the uncertainty associated with these predictions, which is crucial for reducing false positives in safety-critical applications like autonomous driving.
Redesigned Loss Function: The paper adopts a negative log likelihood loss for training the Gaussian bounding box parameters. This change not only accommodates probabilistic predictions but also provides robustness against noisy training data through a loss attenuation mechanism.
Utilization of Localization Uncertainty: By incorporating the predicted localization uncertainty into the detection criteria, the model effectively reduces false positives and increases true positives. This utilization significantly enhances the reliability of the detections, aligning it closely with the requirements of autonomous driving systems.

Strong Numerical Results

The Gaussian YOLOv3 demonstrates a substantial improvement over the conventional YOLOv3, as evidenced by the reported results:

An increase in the mean average precision (mAP) of 3.09 on the KITTI dataset and 3.5 on the Berkeley Deep Drive (BDD) dataset.
A real-time detection speed exceeding 42 frames per second (fps) with a 512 × 512 input resolution, ensuring suitability for autonomous driving scenarios.
The algorithm also achieves a reduction in false positives by over 40% on both datasets and increases true positives by 7.26% (KITTI) and 4.3% (BDD), which is critical in preventing erroneous control decisions in self-driving applications.

Practical and Theoretical Implications

The improvements offered by Gaussian YOLOv3 facilitate safer and more efficient autonomous driving. By addressing false localizations, the algorithm reduces the risk of incorrect responses to environmental stimuli, such as unnecessary braking due to false object detection. The introduction of uncertainty predictions into the decision-making process enhances robustness and reliability, which are pivotal for deployment in real-world scenarios.

From a theoretical standpoint, the integration of Gaussian models within a one-stage object detector like YOLOv3 opens avenues for further research into uncertainty-aware deep learning models. Such approaches can be generalized beyond autonomous driving, potentially impacting other domains requiring real-time and reliable object detection.

Speculations on Future Developments

The methods introduced in Gaussian YOLOv3 suggest several directions for future exploration:

Extending the uncertainty modeling approach to encompass classification predictions, potentially increasing the robustness of scene understanding in autonomous vehicles.
Investigating the impact of different types of uncertainty (e.g., epistemic versus aleatoric) on detection performance in diverse driving conditions.
Exploring combinations of Gaussian YOLOv3 with sensor fusion techniques (e.g., Lidar and Radar) to further enhance detection accuracy and reliability in challenging environments.

Overall, the proposed Gaussian YOLOv3 represents a significant step toward more accurate and reliable object detection systems for autonomous vehicles, contributing to the advancement of safer self-driving technologies.

PDF Markdown

Related Papers

Find Related Papers