Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
149 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Advancements in Repetitive Action Counting: Joint-Based PoseRAC Model With Improved Performance (2308.08632v2)

Published 15 Aug 2023 in cs.CV

Abstract: Repetitive counting (RepCount) is critical in various applications, such as fitness tracking and rehabilitation. Previous methods have relied on the estimation of red-green-and-blue (RGB) frames and body pose landmarks to identify the number of action repetitions, but these methods suffer from a number of issues, including the inability to stably handle changes in camera viewpoints, over-counting, under-counting, difficulty in distinguishing between sub-actions, inaccuracy in recognizing salient poses, etc. In this paper, based on the work done by [1], we integrate joint angles with body pose landmarks to address these challenges and achieve better results than the state-of-the-art RepCount methods, with a Mean Absolute Error (MAE) of 0.211 and an Off-By-One (OBO) counting accuracy of 0.599 on the RepCount data set [2]. Comprehensive experimental results demonstrate the effectiveness and robustness of our method.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (21)
  1. Poserac: Pose saliency transformer for repetitive action counting. arXiv preprint arXiv:2303.08450, 2023.
  2. Transrac: Encoding multi-scale temporal correlation with transformers for repetitive action counting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 19013–19022, 2022.
  3. Generic temporal segmentation of cyclic human motion. Pattern Recognition, 41(1):6–21, 2008.
  4. Real-time human-computer interaction using eye gazes. Manufacturing Letters, 35:883–894, 2023.
  5. Durable nanocomposite face masks with high particulate filtration and rapid inactivation of coronaviruses. Scientific reports, 11(1):24318, 2021.
  6. Extraction and analysis of multiple periodic motions in video sequences. IEEE transactions on pattern analysis and machine intelligence, 29(7):1244–1261, 2007.
  7. Filtration performances of non-medical materials as candidates for manufacturing facemasks and respirators. International journal of hygiene and environmental health, 229:113582, 2020.
  8. Design of a robotic rehabilitation system for mild cognitive impairment based on computer vision. Journal of Engineering and Science in Medical Diagnostics and Therapy, 3(2):021108, 2020.
  9. Consumer-based wearable activity trackers increase physical activity participation: systematic review and meta-analysis. JMIR mHealth and uHealth, 7(4):e11819, 2019.
  10. Bill Foran. High-performance sports conditioning. Human kinetics, 2001.
  11. Factors influencing the filtration performance of homemade face masks. Journal of Occupational and Environmental Hygiene, 18(3):128–138, 2021.
  12. Milift: Efficient smartwatch-based workout tracking using automatic segmentation. IEEE Transactions on Mobile Computing, 17(7):1609–1622, 2017.
  13. Real-time multi-modal human–robot collaboration using gestures and speech. Journal of Manufacturing Science and Engineering, 144(10):101007, 2022.
  14. Counting out time: Class agnostic video repetition counting in the wild. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10387–10396, 2020.
  15. Context-aware and scale-insensitive temporal repetition counting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 670–678, 2020.
  16. Blazepose: On-device real-time body pose tracking. arXiv preprint arXiv:2006.10204, 2020.
  17. Fine-grained activity classification in assembly based on multi-visual modalities. Journal of Intelligent Manufacturing, pages 1–19, 2023.
  18. Video swin transformer. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 3202–3211, 2022.
  19. Towards perspective-free object counting with deep learning. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part VII 14, pages 615–629. Springer, 2016.
  20. G Sreenu and Saleem Durai. Intelligent video surveillance: a review through deep learning techniques for crowd analysis. Journal of Big Data, 6(1):1–27, 2019.
  21. Improving action segmentation via graph-based temporal reasoning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 14024–14034, 2020.
Citations (3)

Summary

We haven't generated a summary for this paper yet.