Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
158 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot Movements (2405.12070v1)

Published 20 May 2024 in cs.CV and cs.AI

Abstract: Image understanding is a foundational task in computer vision, with recent applications emerging in soccer posture analysis. However, existing publicly available datasets lack comprehensive information, notably in the form of posture sequences and 2D pose annotations. Moreover, current analysis models often rely on interpretable linear models (e.g., PCA and regression), limiting their capacity to capture non-linear spatiotemporal relationships in complex and diverse scenarios. To address these gaps, we introduce the 3D Shot Posture (3DSP) dataset in soccer broadcast videos, which represents the most extensive sports image dataset with 2D pose annotations to our knowledge. Additionally, we present the 3DSP-GRAE (Graph Recurrent AutoEncoder) model, a non-linear approach for embedding pose sequences. Furthermore, we propose AutoSoccerPose, a pipeline aimed at semi-automating 2D and 3D pose estimation and posture analysis. While achieving full automation proved challenging, we provide a foundational baseline, extending its utility beyond the scope of annotated data. We validate AutoSoccerPose on SoccerNet and 3DSP datasets, and present posture analysis results based on 3DSP. The dataset, code, and models are available at: https://github.com/calvinyeungck/3D-Shot-Posture-Dataset.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (58)
  1. Bot-sort: Robust associations multi-pedestrian tracking. arXiv preprint arXiv:2206.14651, 2022.
  2. 2d human pose estimation: New benchmark and state of the art analysis. In Proceedings of the IEEE Conference on computer Vision and Pattern Recognition, pages 3686–3693, 2014.
  3. k-means++: The advantages of careful seeding. In Soda, pages 1027–1035, 2007.
  4. Fine-grained sports, yoga, and dance postures recognition: A benchmark analysis. IEEE Transactions on Instrumentation and Measurement, 2023.
  5. Observation-centric sort: Rethinking sort for robust multi-object tracking. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9686–9696, 2023.
  6. Realtime multi-person 2d pose estimation using part affinity fields. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 7291–7299, 2017.
  7. Higherhrnet: Scale-aware representation learning for bottom-up human pose estimation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5386–5395, 2020.
  8. Learnable human mesh triangulation for 3d human pose and shape estimation. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 2850–2859, 2023.
  9. Optimizing network structure for 3d human pose estimation. In Proceedings of the IEEE/CVF international conference on computer vision, pages 2262–2271, 2019.
  10. Soccernet 2023 challenges results. arXiv preprint arXiv:2309.06006, 2023.
  11. Vaep: an objective approach to valuing on-the-ball actions in soccer. In Proceedings of the twenty-ninth international joint conference on artificial intelligence, IJCAI-20, pages 4696–4700. International Joint Conferences on Artificial Intelligence Organization, 2020.
  12. Soccernet-v2: A dataset and benchmarks for holistic understanding of broadcast soccer videos. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 4508–4519, 2021.
  13. Strongsort: Make deepsort great again. IEEE Transactions on Multimedia, 2023.
  14. Foul prediction with estimated poses from soccer broadcast video. arXiv preprint arXiv:2402.09650, 2024.
  15. Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430, 2021.
  16. Bottom-up human pose estimation via disentangled keypoint regression. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 14676–14686, 2021.
  17. Soccernet: A scalable dataset for action spotting in soccer videos. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pages 1711–1721, 2018.
  18. Vars: Video assistant referee system for automated soccer decision making from multiple views. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5085–5096, 2023.
  19. Analysis of the swing motion on knuckling shot in soccer. Procedia Engineering, 13:176–181, 2011.
  20. Sportspose-a dynamic 3d sports pose dataset. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5218–5227, 2023.
  21. Learnable triangulation of human pose. In Proceedings of the IEEE/CVF international conference on computer vision, pages 7718–7727, 2019.
  22. Rtmpose: Real-time multi-person pose estimation based on mmpose. arXiv preprint arXiv:2303.07399, 2023.
  23. Whole-body human pose estimation in the wild. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part IX 16, pages 196–214. Springer, 2020.
  24. Clustered pose and nonlinear appearance models for human pose estimation. In bmvc, page 5. Aberystwyth, UK, 2010.
  25. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907, 2016.
  26. Microsoft coco: Common objects in context. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, pages 740–755. Springer, 2014.
  27. Hota: A higher order metric for evaluating multi-object tracking. International journal of computer vision, 129:548–578, 2021.
  28. Detrs beat yolos on real-time object detection. arXiv preprint arXiv:2304.08069, 2023.
  29. Individual locating of soccer players from a single moving view. Sensors, 23(18):7938, 2023.
  30. Motionagformer: Enhancing 3d human pose estimation with a transformer-gcnformer network. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 6920–6930, 2024.
  31. Monocular 3d human pose estimation in the wild using improved cnn supervision. In 2017 international conference on 3D vision (3DV), pages 506–516. IEEE, 2017.
  32. Trackformer: Multi-object tracking with transformers. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 8844–8854, 2022.
  33. Differences in soccer kicking type identified using principal component analysis. Sports Engineering, 21:149–159, 2018.
  34. A public data set of spatio-temporal match events in soccer competitions. Scientific data, 6(1):236, 2019.
  35. Design and validation of an observational system for penalty kick analysis in football (ospaf). Frontiers in Psychology, 12:661179, 2021.
  36. Body pose estimation integrated with notational analysis: A new approach to analyze penalty kicks strategy in elite football. Frontiers in sports and active living, 4:818556, 2022.
  37. Tessetrack: End-to-end learnable multi-person articulated 3d pose tracking. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15190–15200, 2021.
  38. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 779–788, 2016.
  39. Soccertrack: A dataset and tracking algorithm for soccer with fish-eye and drone videos. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3569–3579, 2022.
  40. Unsupervised learning of video representations using lstms. In International conference on machine learning, pages 843–852. PMLR, 2015.
  41. Deep high-resolution representation learning for human pose estimation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5693–5703, 2019.
  42. Runner re-identification from single-view video in the open-world setting. arXiv preprint arXiv:2310.11700, 2023.
  43. Automatic edge error judgment in figure skating using 3d pose estimation from a monocular camera and imus. In Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, pages 41–48, 2023.
  44. Joint training of a convolutional network and a graphical model for human pose estimation. Advances in neural information processing systems, 27, 2014.
  45. Deeppose: Human pose estimation via deep neural networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1653–1660, 2014.
  46. Laurens Van der Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of machine learning research, 9(11), 2008.
  47. Motion guided 3d pose estimation from videos. In European Conference on Computer Vision, pages 764–780. Springer, 2020.
  48. Learning hierarchical poselets for human parsing. In CVPR 2011, pages 1705–1712. IEEE, 2011.
  49. Learning from the pros: Extracting professional goalkeeper technique from broadcast footage. arXiv preprint arXiv:2202.12259, 2022.
  50. Simple online and realtime tracking with a deep association metric. In 2017 IEEE International Conference on Image Processing (ICIP), pages 3645–3649. IEEE, 2017.
  51. How to train your deep multi-object tracker. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 6787–6796, 2020.
  52. Effective whole-body pose estimation with two-stages distillation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 4210–4220, 2023.
  53. A strategic framework for optimal decisions in football 1-vs-1 shot-taking situations: An integrated approach of machine learning, theory-based modeling, and game theory. arXiv preprint arXiv:2307.14732, 2023.
  54. Transformer-based neural marked spatio temporal point process model for football match events analysis. arXiv preprint arXiv:2302.09276, 2023.
  55. Pose2seg: Detection free human instance segmentation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 889–898, 2019.
  56. Bytetrack: Multi-object tracking by associating every detection box. In European Conference on Computer Vision, pages 1–21. Springer, 2022.
  57. Motrv2: Bootstrapping end-to-end multi-object tracking by pretrained object detectors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 22056–22065, 2023.
  58. Towards 3d human pose estimation in the wild: a weakly-supervised approach. In Proceedings of the IEEE international conference on computer vision, pages 398–407, 2017.
Citations (1)

Summary

We haven't generated a summary for this paper yet.