UCMCTrack: Multi-Object Tracking with Uniform Camera Motion Compensation (2312.08952v2)

Published 14 Dec 2023 in cs.CV

Abstract: Multi-object tracking (MOT) in video sequences remains a challenging task, especially in scenarios with significant camera movements. This is because targets can drift considerably on the image plane, leading to erroneous tracking outcomes. Addressing such challenges typically requires supplementary appearance cues or Camera Motion Compensation (CMC). While these strategies are effective, they also introduce a considerable computational burden, posing challenges for real-time MOT. In response to this, we introduce UCMCTrack, a novel motion model-based tracker robust to camera movements. Unlike conventional CMC that computes compensation parameters frame-by-frame, UCMCTrack consistently applies the same compensation parameters throughout a video sequence. It employs a Kalman filter on the ground plane and introduces the Mapped Mahalanobis Distance (MMD) as an alternative to the traditional Intersection over Union (IoU) distance measure. By leveraging projected probability distributions on the ground plane, our approach efficiently captures motion patterns and adeptly manages uncertainties introduced by homography projections. Remarkably, UCMCTrack, relying solely on motion cues, achieves state-of-the-art performance across a variety of challenging datasets, including MOT17, MOT20, DanceTrack and KITTI. More details and code are available at https://github.com/corfyi/UCMCTrack

References (50)

Citations (22)

View on Semantic Scholar

Summary

The paper introduces a novel method using uniform camera motion compensation on the ground plane, replacing traditional IoU with the Mapped Mahalanobis Distance.
The framework leverages a Kalman filter to achieve over 1000 FPS on a single CPU while delivering robust performance on datasets like MOT17, MOT20, DanceTrack, and KITTI.
The approach reduces computational overhead and improves resilience to camera-induced errors, paving the way for efficient real-time multi-object tracking.

Introduction to Multi-Object Tracking

Multi-Object Tracking (MOT) within video sequences is a complex challenge in computer vision, particularly in environments with significant camera motion. Traditional methods often employ additional appearance cues or Camera Motion Compensation (CMC) to address inaccuracies arising from camera movements. These methods, while effective, can substantially increase computational overhead, making real-time tracking more difficult.

A Novel Approach to Motion-Based Tracking

Enter UCMCTrack, a novel tracking framework designed to withstand camera movements. This motion-based tracker diverges from conventional methods by applying the same compensation parameters uniformly to an entire video sequence, rather than on a frame-by-frame basis. UCMCTrack is grounded in a Kalman filter, operating on the ground plane rather than the imaging plane. The approach introduces the Mapped Mahalanobis Distance (MMD), discarding the traditional Intersection over Union (IoU) measure. MMD effectively captures ground plane motion patterns and adeptly manages uncertainties caused by homography projections.

Advantages of Ground Plane Motion Modeling

Assessing motion patterns on the ground plane offers greater resilience to camera-induced errors. Where IoU may fail due to a lack of overlap between detection and tracking boxes (particularly in dynamic scenes), the ground plane association minimizes the impact of camera movements. This shift from reliance on the image plane to leveraging the ground plane promises superior tracking accuracy and streamlines the tracking process.

UCMCTrack Performance and Contributions

UCMCTrack has demonstrated impressive efficiency and speed, surpassing 1000 frames per second (FPS) processing on a single CPU. It achieves state-of-the-art performance on multiple challenging datasets, including MOT17, MOT20, DanceTrack, and KITTI, solely utilizing motion cues. The paper outlines three primary contributions: an innovative non-IoU distance measure based on motion cues, a uniform application of camera motion compensation parameters to reduce computational load, and UCMCTrack itself—a model showcasing the potential to complement existing distance metrics for improved MOT performance.

Experimentation and Results

Experiments conducted across various datasets corroborate the efficacy of UCMCTrack. The model's reliance on the ground plane and the innovative use of MMD have proven effective in various scenarios, including irregular target motion (DanceTrack) and intense camera motion (KITTI). The tracker performs well even in the presence of camera parameter estimation errors, highlighting its robustness.

A series of ablation studies affirm the importance of individual components within the UCMCTrack system. The tracker's adaptability is further demonstrated through its capacity to adjust to different scenes (dynamic vs. static) by altering process noise compensation factors. The results emphasize the potential advantages of pairing UCMCTrack with established MOT methodologies, setting the stage for future research.

UCMCTrack showcases a new frontier in motion-based multi-object tracking, addressing camera motion challenges efficiently. This development could have far-reaching implications for real-time applications requiring fast and accurate object tracking in video footage.

PDF Markdown

GitHub

GitHub - corfyi/UCMCTrack: [AAAI 2024] UCMCTrack: Multi-Object Tracking with Uniform Camera Motion Compensation. UCMCTrack ranks first on MOT17 without using any appearance cues. (323 stars)