UA-DETRAC: A New Benchmark and Protocol for Multi-Object Detection and Tracking (1511.04136v4)

Published 13 Nov 2015 in cs.CV

Abstract: In recent years, numerous effective multi-object tracking (MOT) methods are developed because of the wide range of applications. Existing performance evaluations of MOT methods usually separate the object tracking step from the object detection step by using the same fixed object detection results for comparisons. In this work, we perform a comprehensive quantitative study on the effects of object detection accuracy to the overall MOT performance, using the new large-scale University at Albany DETection and tRACking (UA-DETRAC) benchmark dataset. The UA-DETRAC benchmark dataset consists of 100 challenging video sequences captured from real-world traffic scenes (over 140,000 frames with rich annotations, including occlusion, weather, vehicle category, truncation, and vehicle bounding boxes) for object detection, object tracking and MOT system. We evaluate complete MOT systems constructed from combinations of state-of-the-art object detection and object tracking methods. Our analysis shows the complex effects of object detection accuracy on MOT system performance. Based on these observations, we propose new evaluation tools and metrics for MOT systems that consider both object detection and object tracking for comprehensive analysis.

Citations (525)

View on Semantic Scholar

Summary

The paper introduces a novel UA-DETRAC benchmark that advances the evaluation of multi-object detection and tracking through a rich, vehicle-focused dataset.
It presents a detailed evaluation protocol featuring the PR-MOTA metric to integrate detection precision and tracking accuracy across varying conditions.
Empirical results highlight that deep learning detectors outperform classical methods, while challenges persist in scenarios with severe occlusion and adverse lighting.

UA-DETRAC: A Benchmark for Multi-Object Detection and Tracking

The paper "UA-DETRAC: A New Benchmark and Protocol for Multi-Object Detection and Tracking" presents a comprehensive dataset and evaluation protocol aimed at advancing the research in Multi-Object Tracking (MOT) and detection systems, particularly in the context of real-world vehicle tracking. This dataset, UA-DETRAC, is thoroughly annotated and designed for evaluating the performance of various state-of-the-art detection and tracking algorithms. With over 140,000 frames encompassing diverse vehicle types and challenging conditions, this benchmark offers a robust foundation for empirical studies in both detection and MOT tasks.

Dataset Overview

The UA-DETRAC dataset includes 100 videos captured with rich annotations covering aspects such as illumination, occlusion, and vehicle types. This dataset is divided into training and testing sets containing about 60% and 40% of the videos respectively, ensuring that the sequences used for training and evaluation are shot at different locations. A salient feature of this dataset is its detailed annotation of vehicle types and conditions, providing researchers with a broad spectrum of challenges that mimic real-world conditions, including varied levels of occlusion and different lighting conditions like cloudy, sunny, rainy, and night scenes.

Evaluation Protocols

The paper proposes a new evaluation protocol for object detection that captures the comprehensive effects of detection accuracy on overall MOT system performance through PR curves. A significant contribution is the introduction of the PR-MOTA metric, which integrates tracking accuracy over the precision-recall range for different detectors, providing a more nuanced understanding of a tracker's effectiveness when coupled with various detectors.

Empirical Evaluation

Numerous state-of-the-art object detection algorithms, such as DPM, ACF, R-CNN, CompACT, and Faster R-CNN, are evaluated using the UA-DETRAC dataset. The analysis reveals that advancements in detection methods, especially those leveraging deep learning frameworks, significantly outperform older, more classical approaches in terms of AP scores. However, challenges remain, particularly in scenarios involving heavy occlusions or adverse lighting conditions.

As for the evaluation of MOT algorithms, the paper contrasts the performance of ten multi-object tracking algorithms using combinations with the aforementioned detectors. The tracking-by-detection paradigm is reinforced as the dominant framework, though the evaluation highlights its sensitivity to the quality of detections provided—a key reason behind the introduction of the PR-MOTA metric.

Implications and Future Directions

The UA-DETRAC benchmark not only fills the gap for a comprehensive, vehicle-focused dataset but also sets a standard for evaluating MOT methods by accounting for the intrinsic link between detection quality and tracking performance. The paper’s emphasis on practical application scenarios, such as traffic monitoring, underscores the real-world relevance of the benchmark.

This work suggests several avenues for future research, including the development of joint detection and tracking frameworks that can simultaneously enhance both modules' performance. Furthermore, enhancements in feature learning and computational efficiency are imperative as the field progresses towards real-time applications.

In summary, this paper presents a well-structured benchmark complete with an innovative evaluation protocol that positions UA-DETRAC as a pivotal resource for researchers in computer vision, particularly those focused on improving MOT and object detection systems in dynamic and challenging environments.

PDF Markdown