Confidence Trigger Detection: Accelerating Real-time Tracking-by-detection Systems (1902.00615v6)
Abstract: Real-time object tracking necessitates a delicate balance between speed and accuracy, a challenge exacerbated by the computational demands of deep learning methods. In this paper, we propose Confidence-Triggered Detection (CTD), an innovative approach that strategically bypasses object detection for frames closely resembling intermediate states, leveraging tracker confidence scores. CTD not only enhances tracking speed but also preserves accuracy, surpassing existing tracking algorithms. Through extensive evaluation across various tracker confidence thresholds, we identify an optimal trade-off between tracking speed and accuracy, providing crucial insights for parameter fine-tuning and enhancing CTD's practicality in real-world scenarios. Our experiments across diverse detection models underscore the robustness and versatility of the CTD framework, demonstrating its potential to enable real-time tracking in resource-constrained environments.
- Simple Online and Realtime Tracking. 2016.
- Towards automated visual surveillance using gait for identity recognition and tracking across multiple non-intersecting cameras. Multimedia Tools and Applications, 2016.
- Online multiperson tracking-by-detection from a single, uncalibrated camera. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011.
- Guobin Chang. Robust Kalman filtering based on Mahalanobis distance as outlier judging criterion. Journal of Geodesy, 2014.
- Weakly and semi-supervised deep level set network for automated skin lesion segmentation. In Innovation in Medicine and Healthcare: Proceedings of 8th KES-InMed 2020, pages 145–155. Springer, 2020.
- Deep multi-magnification similarity learning for histopathological image classification. IEEE Journal of Biomedical and Health Informatics, 27(3):1535–1545, 2023.
- Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2014.
- Software-hardware co-design of heterogeneous smartnic system for recommendation models inference and training. In Proceedings of the 37th International Conference on Supercomputing, pages 336–347, 2023.
- Deep learning for visual understanding: A review. Neurocomputing, 2016.
- Online multi-person tracking with two-stage data association and online appearance model learning. IET Computer Vision, 11, 2016.
- MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. 2017.
- On the design of quantum graph convolutional neural network in the nisq-era and beyond. In 2022 IEEE 40th International Conference on Computer Design (ICCD), pages 290–297, 2022.
- Toward user-specific tracking by detection of human shapes in multi-cameras. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2015.
- Analyzing entropy features in time-series data for pattern recognition in neurological conditions. Artificial Intelligence in Medicine, page 102821, 2024.
- Alexnet. Advances In Neural Information Processing Systems, 2012.
- Real-Time Digital Signal Processing: Fundamentals, Implementations and Applications. 2013.
- Adaptive ensembles of fine-tuned transformers for llm-generated text detection, 2024.
- MOTChallenge 2015: Towards a benchmark for multi-target tracking. arXiv:1504.01942 [cs], 2015. arXiv: 1504.01942.
- Learning by Tracking: Siamese CNN for Robust Target Association. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2016.
- T3d: Towards 3d medical image understanding through vision-language pre-training. arXiv preprint arXiv:2312.01529, 2023.
- SSD: Single shot multibox detector. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9905 LNCS:21–37, 2016.
- Deep Learning for Emotion Recognition on Small Datasets using Transfer Learning. Proceedings of the 2015 ACM on International Conference on Multimodal Interaction - ICMI ’15, pages 443–449, 2015.
- Learning and Transferring Mid-Level Image Representations using Convolutional Neural Networks. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014.
- Karl Pearson. On the Criterion that a Given System of Deviations from the Probable in the Case of a Correlated System of Variables is Such that it Can be Reasonably Supposed to have Arisen from Random Sampling, pages 11–28. Springer New York, New York, NY, 1992.
- YOLOv3: An Incremental Improvement. 2018.
- Combining incremental conductance and firefly algorithm for tracking the global mpp of pv arrays. Journal of Renewable and Sustainable Energy, 9(2), 2017.
- Very Deep Convolutional Networks for Large-Scale Image Recognition. International Conference on Learning Representations (ICRL), 2015.
- Online multi-object tracking with gmphd filter and occlusion group management. IEEE Access, 7:165103–165121, 2019.
- Simple online and realtime tracking with a deep association metric. In 2017 IEEE International Conference on Image Processing (ICIP), pages 3645–3649. IEEE, 2017.
- Simple online and realtime tracking with a deep association metric. Proceedings - International Conference on Image Processing, ICIP, 2017-Septe:3645–3649, 2018.
- Learning to track: Online multi-object tracking by decision making. In Proceedings of the IEEE International Conference on Computer Vision, 2015.
- Parameter-efficient fine-tuning for pre-trained vision models: A survey. arXiv preprint arXiv:2402.02242, 2024.
- Online multi-object tracking using selective deep appearance matching. pages 206–212, 2018.
- Contextual image masking modeling via synergized contrasting without view augmentation for faster and better visual pretraining. In The Eleventh International Conference on Learning Representations, 2023a.
- Accuracy-constrained efficiency optimization and gpu profiling of cnn inference for detecting drainage crossing locations. In Proceedings of the SC’23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, pages 1780–1788, 2023b.