Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Gait Recognition from Highly Compressed Videos (2404.12183v1)

Published 18 Apr 2024 in cs.CV

Abstract: Surveillance footage represents a valuable resource and opportunities for conducting gait analysis. However, the typical low quality and high noise levels in such footage can severely impact the accuracy of pose estimation algorithms, which are foundational for reliable gait analysis. Existing literature suggests a direct correlation between the efficacy of pose estimation and the subsequent gait analysis results. A common mitigation strategy involves fine-tuning pose estimation models on noisy data to improve robustness. However, this approach may degrade the downstream model's performance on the original high-quality data, leading to a trade-off that is undesirable in practice. We propose a processing pipeline that incorporates a task-targeted artifact correction model specifically designed to pre-process and enhance surveillance footage before pose estimation. Our artifact correction model is optimized to work alongside a state-of-the-art pose estimation network, HRNet, without requiring repeated fine-tuning of the pose estimation model. Furthermore, we propose a simple and robust method for obtaining low quality videos that are annotated with poses in an automatic manner with the purpose of training the artifact correction model. We systematically evaluate the performance of our artifact correction model against a range of noisy surveillance data and demonstrate that our approach not only achieves improved pose estimation on low-quality surveillance footage, but also preserves the integrity of the pose estimation on high resolution footage. Our experiments show a clear enhancement in gait analysis performance, supporting the viability of the proposed method as a superior alternative to direct fine-tuning strategies. Our contributions pave the way for more reliable gait analysis using surveillance data in real-world applications, regardless of data quality.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (24)
  1. 2d human pose estimation: New benchmark and state of the art analysis. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2014.
  2. Gaitpt: Skeletons are all you need for gait recognition. arXiv preprint arXiv:2308.10623, 2023.
  3. The paradox of motion: Evidence for spurious correlations in skeleton-based gait recognition models. arXiv preprint arXiv:2402.08320, 2024.
  4. Gaitset: Cross-view gait recognition through utilizing gait as a deep set. IEEE Transactions on Pattern Analysis and Machine Intelligence, pages 1–1, 2021.
  5. A. Cosma and E. Radoi. Learning gait representations with noisy multi-task learning. Sensors, 22(18), 2022.
  6. A. Cosma and E. Radoi. Psymo: A dataset for estimating self-reported psychological traits from gait. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), January 2024.
  7. Analyzing and mitigating jpeg compression defects in deep learning. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 2357–2367, 2021.
  8. Gaitpart: Temporal part-based model for gait recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 14225–14233, 2020.
  9. Alphapose: Whole-body regional multi-person pose estimation and tracking in real-time. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022.
  10. R. M. French. Catastrophic forgetting in connectionist networks. Trends in cognitive sciences, 3(4):128–135, 1999.
  11. Gpgait: Generalized pose-based gait recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 19595–19604, October 2023.
  12. Deep generative adversarial compression artifact removal. In Proceedings of the IEEE international conference on computer vision, pages 4826–4835, 2017.
  13. Towards flexible blind jpeg artifacts removal. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 4997–5006, 2021.
  14. Distributed learning and inference with compressed images. IEEE transactions on image processing, 30:3069–3083, 2021.
  15. Microsoft COCO: common objects in context. CoRR, abs/1405.0312, 2014.
  16. Microsoft coco: Common objects in context, 2015.
  17. K. N. A. K. Nihal and A. R. M. Shanavas. A novel approach for compressing surveillance system videos. International Journal of Advanced Engineering, Management and Science, 2(2), 2 2016.
  18. Impact of video compression on the performance of object detection systems for surveillance applications, 2022.
  19. I. E. Richardson. The H.264 Advanced Video Compression Standard. Wiley Publishing, 2nd edition, 2010.
  20. Deep high-resolution representation learning for human pose estimation. In CVPR, 2019.
  21. Towards a deeper understanding of skeleton-based gait recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1569–1577, 2022.
  22. ViTPose: Simple vision transformer baselines for human pose estimation. In Advances in Neural Information Processing Systems, 2022.
  23. A framework for evaluating the effect of view angle, clothing and carrying condition on gait recognition. In 18th International Conference on Pattern Recognition (ICPR’06), volume 4, pages 441–444, 2006.
  24. Pose2seg: Detection free human instance segmentation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 889–898, 2019.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets