Advancements in Repetitive Action Counting: Joint-Based PoseRAC Model With Improved Performance (2308.08632v2)
Abstract: Repetitive counting (RepCount) is critical in various applications, such as fitness tracking and rehabilitation. Previous methods have relied on the estimation of red-green-and-blue (RGB) frames and body pose landmarks to identify the number of action repetitions, but these methods suffer from a number of issues, including the inability to stably handle changes in camera viewpoints, over-counting, under-counting, difficulty in distinguishing between sub-actions, inaccuracy in recognizing salient poses, etc. In this paper, based on the work done by [1], we integrate joint angles with body pose landmarks to address these challenges and achieve better results than the state-of-the-art RepCount methods, with a Mean Absolute Error (MAE) of 0.211 and an Off-By-One (OBO) counting accuracy of 0.599 on the RepCount data set [2]. Comprehensive experimental results demonstrate the effectiveness and robustness of our method.
- Poserac: Pose saliency transformer for repetitive action counting. arXiv preprint arXiv:2303.08450, 2023.
- Transrac: Encoding multi-scale temporal correlation with transformers for repetitive action counting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 19013–19022, 2022.
- Generic temporal segmentation of cyclic human motion. Pattern Recognition, 41(1):6–21, 2008.
- Real-time human-computer interaction using eye gazes. Manufacturing Letters, 35:883–894, 2023.
- Durable nanocomposite face masks with high particulate filtration and rapid inactivation of coronaviruses. Scientific reports, 11(1):24318, 2021.
- Extraction and analysis of multiple periodic motions in video sequences. IEEE transactions on pattern analysis and machine intelligence, 29(7):1244–1261, 2007.
- Filtration performances of non-medical materials as candidates for manufacturing facemasks and respirators. International journal of hygiene and environmental health, 229:113582, 2020.
- Design of a robotic rehabilitation system for mild cognitive impairment based on computer vision. Journal of Engineering and Science in Medical Diagnostics and Therapy, 3(2):021108, 2020.
- Consumer-based wearable activity trackers increase physical activity participation: systematic review and meta-analysis. JMIR mHealth and uHealth, 7(4):e11819, 2019.
- Bill Foran. High-performance sports conditioning. Human kinetics, 2001.
- Factors influencing the filtration performance of homemade face masks. Journal of Occupational and Environmental Hygiene, 18(3):128–138, 2021.
- Milift: Efficient smartwatch-based workout tracking using automatic segmentation. IEEE Transactions on Mobile Computing, 17(7):1609–1622, 2017.
- Real-time multi-modal human–robot collaboration using gestures and speech. Journal of Manufacturing Science and Engineering, 144(10):101007, 2022.
- Counting out time: Class agnostic video repetition counting in the wild. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10387–10396, 2020.
- Context-aware and scale-insensitive temporal repetition counting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 670–678, 2020.
- Blazepose: On-device real-time body pose tracking. arXiv preprint arXiv:2006.10204, 2020.
- Fine-grained activity classification in assembly based on multi-visual modalities. Journal of Intelligent Manufacturing, pages 1–19, 2023.
- Video swin transformer. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 3202–3211, 2022.
- Towards perspective-free object counting with deep learning. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part VII 14, pages 615–629. Springer, 2016.
- G Sreenu and Saleem Durai. Intelligent video surveillance: a review through deep learning techniques for crowd analysis. Journal of Big Data, 6(1):1–27, 2019.
- Improving action segmentation via graph-based temporal reasoning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 14024–14034, 2020.