AutoTrack: Towards High-Performance Visual Tracking for UAV with Automatic Spatio-Temporal Regularization (2003.12949v1)

Published 29 Mar 2020 in cs.CV

Abstract: Most existing trackers based on discriminative correlation filters (DCF) try to introduce predefined regularization term to improve the learning of target objects, e.g., by suppressing background learning or by restricting change rate of correlation filters. However, predefined parameters introduce much effort in tuning them and they still fail to adapt to new situations that the designer did not think of. In this work, a novel approach is proposed to online automatically and adaptively learn spatio-temporal regularization term. Spatially local response map variation is introduced as spatial regularization to make DCF focus on the learning of trust-worthy parts of the object, and global response map variation determines the updating rate of the filter. Extensive experiments on four UAV benchmarks have proven the superiority of our method compared to the state-of-the-art CPU- and GPU-based trackers, with a speed of ~60 frames per second running on a single CPU. Our tracker is additionally proposed to be applied in UAV localization. Considerable tests in the indoor practical scenarios have proven the effectiveness and versatility of our localization method. The code is available at https://github.com/vision4robotics/AutoTrack.

Citations (276)

View on Semantic Scholar

Summary

The paper introduces an adaptive spatio-temporal regularization method for DCF-based tracking, eliminating the need for manual hyper-parameter tuning.
It achieves high efficiency at around 60 fps on a single CPU, outperforming many state-of-the-art trackers in UAV applications.
The approach enhances UAV localization in GPS-denied settings by robustly tracking objects amidst rapid motion, occlusions, and illumination changes.

Overview of "AutoTrack: Towards High-Performance Visual Tracking for UAV with Automatic Spatio-Temporal Regularization"

The paper "AutoTrack: Towards High-Performance Visual Tracking for UAV with Automatic Spatio-Temporal Regularization" introduces a novel approach to visual tracking tailored for Unmanned Aerial Vehicle (UAV) applications. The methodology developed focuses on enhancing the performance of Discriminative Correlation Filter (DCF)-based trackers by introducing automatic and adaptive spatio-temporal regularization. This innovation sets AutoTrack apart by eliminating the need for manually tuned hyper-parameters, which often remain static and fall short in adapting to dynamic real-world conditions.

Key Contributions

AutoTrack's major contribution lies in its introduction of a spatio-temporal regularization framework that utilizes both local and global response map variations. This dual focus allows the model to:

Adaptively and automatically determine hyper-parameters for the spatio-temporal regularization, which traditionally require manual tuning. The local response variations guide spatial regularization by identifying credible segments of the object, while global variations direct the learning rate of the filter to improve the adaptability and robustness of the tracker.
Maintain high efficiency with a speed of approximately 60 frames per second on a single CPU, thus outperforming many state-of-the-art trackers that are based on both CPU and GPU platforms. This is crucial for on-board UAV scenarios where computational resources are limited.
Introduce practical application by employing AutoTrack in UAV localization systems within GPS-denied environments. The work demonstrates that AutoTrack can surpass infrared LED-based systems in terms of versatility across diverse scenarios, due to its ability to track arbitrary objects defined in the first frame.

Experimental Evaluation

The paper evaluates AutoTrack across several UAV-specific datasets, namely DTB70, UAVDT, UAV123@10fps, and VisDrone2018-test-dev. Through rigorous testing, AutoTrack consistently shows superior performance in terms of precision and speed compared to existing state-of-the-art tracking methods, including deep learning-based approaches. Notably, the method demonstrates robustness in scenarios characterized by severe UAV motion, illumination variations, occlusions, and rapid background changes.

Implications and Future Directions

From a theoretical standpoint, the ability to adaptively regulate the parameters of the regularization term highlights the potential of AutoTrack to serve as a foundation for future tracking algorithms. Practically, its integration into UAV systems marks a significant stride toward autonomous navigation, particularly in environments where traditional localization solutions may falter.

The research opens several avenues for future inquiries, including further exploration of adaptive learning rates and response-based feedback mechanisms in other computer vision tasks. Additionally, extending the framework to incorporate deeper neural architectures could harmonize the robustness of deep models with the high efficiency of DCF trackers, offering a comprehensive solution for real-time applications.

Overall, the AutoTrack approach exemplifies how meticulous optimization of regularization parameters can substantially improve tracking performance in resource-constrained environments, marking a crucial development in UAV-based object tracking.

PDF Markdown