Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
158 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data (2402.13172v4)

Published 20 Feb 2024 in cs.CV

Abstract: Accurate 3D kinematics estimation of human body is crucial in various applications for human health and mobility, such as rehabilitation, injury prevention, and diagnosis, as it helps to understand the biomechanical loading experienced during movement. Conventional marker-based motion capture is expensive in terms of financial investment, time, and the expertise required. Moreover, due to the scarcity of datasets with accurate annotations, existing markerless motion capture methods suffer from challenges including unreliable 2D keypoint detection, limited anatomic accuracy, and low generalization capability. In this work, we propose a novel biomechanics-aware network that directly outputs 3D kinematics from two input views with consideration of biomechanical prior and spatio-temporal information. To train the model, we create synthetic dataset ODAH with accurate kinematics annotations generated by aligning the body mesh from the SMPL-X model and a full-body OpenSim skeletal model. Our extensive experiments demonstrate that the proposed approach, only trained on synthetic data, outperforms previous state-of-the-art methods when evaluated across multiple datasets, revealing a promising direction for enhancing video-based human motion capture

Definition Search Book Streamline Icon: https://streamlinehq.com
References (37)
  1. Towards single camera human 3d-kinematics. Sensors, 23(1):341, 2022.
  2. Openpose: Realtime multi-person 2d pose estimation using part affinity fields, 2019.
  3. Anatomy-aware 3d human pose estimation with bone-based pose decomposition. IEEE Transactions on Circuits and Systems for Video Technology, 32(1):198–209, 2021.
  4. Muscles in action. In ICCV, 2023.
  5. Learned vertex descent: a new direction for 3d human model fitting. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part II, pages 146–165. Springer, 2022.
  6. Opensim: open-source software to create and analyze dynamic simulations of movement. IEEE transactions on biomedical engineering, 54(11):1940–1950, 2007.
  7. Comparison of marker-less and marker-based motion capture for baseball pitching kinematics. Sports Biomechanics, pages 1–10, 2022.
  8. Movi: A large multi-purpose human motion and video dataset. Plos one, 16(6):e0253157, 2021.
  9. John C Gower. Generalized procrustes analysis. Psychometrika, 40:33–51, 1975.
  10. Yolo by ultralytics.
  11. Osso: Obtaining skeletal shape from outside. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 20492–20501, 2022.
  12. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
  13. Lifting transformer for 3d human pose estimation in video. arXiv preprint arXiv:2103.14304, 2, 2021.
  14. Exploiting temporal contexts with strided transformer for 3d human pose estimation. IEEE Transactions on Multimedia, 2022.
  15. Mhformer: Multi-hypothesis transformer for 3d human pose estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13147–13156, 2022.
  16. Attention mechanism exploits temporal contexts: Real-time 3d human pose reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5064–5073, 2020.
  17. AMASS: Archive of motion capture as surface shapes. In International Conference on Computer Vision, pages 5442–5451, Oct. 2019.
  18. The evolution of methods for the capture of human movement leading to markerless motion capture for biomechanical applications. Journal of neuroengineering and rehabilitation, 3(1):1–11, 2006.
  19. Stacked hourglass networks for human pose estimation. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part VIII 14, pages 483–499. Springer, 2016.
  20. Deep learning-based estimation of whole-body kinematics from multi-view images. Computer Vision and Image Understanding, page 103780, 2023.
  21. Pose2sim: An open-source python package for multiview markerless kinematics. Journal of Open Source Software, 2022.
  22. Expressive body capture: 3D hands, face, and body from a single image. In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 10975–10985, 2019.
  23. 3d human pose estimation in video with temporal convolutions and semi-supervised training. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 7753–7762, 2019.
  24. Full-body musculoskeletal model for muscle-driven simulation of human gait. IEEE transactions on biomedical engineering, 63(10):2068–2079, 2016.
  25. Opensim: Simulating musculoskeletal dynamics and neuromuscular control to study human and animal movement. PLoS computational biology, 14(7):e1006223, 2018.
  26. Humaneva: Synchronized video and motion capture dataset and baseline algorithm for evaluation of articulated human motion. International journal of computer vision, 87(1-2):4, 2010.
  27. Markerless motion capture estimates of lower extremity kinematics and kinetics are comparable to marker-based across 8 movements. Journal of Biomechanics, 157:111751, 2023.
  28. Weakly supervised 3d hand pose estimation via biomechanical constraints. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XVII 16, pages 211–228. Springer, 2020.
  29. Opencap: 3d human movement dynamics from smartphone videos. bioRxiv, pages 2022–07, 2022.
  30. Eline Van der Kruk and Marco M Reijne. Accuracy of human motion capture systems for sport applications; state-of-the-art review. European journal of sport science, 18(6):806–819, 2018.
  31. Graph-based 3d multi-person pose estimation using multi-view images. In Proceedings of the IEEE/CVF international conference on computer vision, pages 11148–11157, 2021.
  32. Deep kinematics analysis for monocular 3d human pose estimation. In Proceedings of the IEEE/CVF Conference on computer vision and Pattern recognition, pages 899–908, 2020.
  33. In vivo measurement of 3-d skeletal kinematics from sequences of biplane radiographs: Application to knee kinematics. IEEE Transactions on Medical Imaging, 20(6):514–525, 2001.
  34. Restormer: Efficient transformer for high-resolution image restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5728–5739, 2022.
  35. Direct multi-view multi-person 3d pose estimation. Advances in Neural Information Processing Systems, 34:13153–13164, 2021.
  36. Mixste: Seq2seq mixed spatio-temporal encoder for 3d human pose estimation in video. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13232–13242, 2022.
  37. 3d human pose estimation with spatial and temporal transformers. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 11656–11665, 2021.

Summary

We haven't generated a summary for this paper yet.