Papers
Topics
Authors
Recent
Search
2000 character limit reached

MS-MANO: Enabling Hand Pose Tracking with Biomechanical Constraints

Published 16 Apr 2024 in cs.CV and cs.RO | (2404.10227v1)

Abstract: This work proposes a novel learning framework for visual hand dynamics analysis that takes into account the physiological aspects of hand motion. The existing models, which are simplified joint-actuated systems, often produce unnatural motions. To address this, we integrate a musculoskeletal system with a learnable parametric hand model, MANO, to create a new model, MS-MANO. This model emulates the dynamics of muscles and tendons to drive the skeletal system, imposing physiologically realistic constraints on the resulting torque trajectories. We further propose a simulation-in-the-loop pose refinement framework, BioPR, that refines the initial estimated pose through a multi-layer perceptron (MLP) network. Our evaluation of the accuracy of MS-MANO and the efficacy of the BioPR is conducted in two separate parts. The accuracy of MS-MANO is compared with MyoSuite, while the efficacy of BioPR is benchmarked against two large-scale public datasets and two recent state-of-the-art methods. The results demonstrate that our approach consistently improves the baseline methods both quantitatively and qualitatively.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (53)
  1. Pushing the envelope for rgb-based dense 3d hand pose estimation via neural rendering. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1067–1076, 2019.
  2. Myosuite–a contact-rich simulation suite for musculoskeletal motor control. arXiv preprint arXiv:2205.13600, 2022.
  3. Weakly-supervised 3d hand pose estimation from monocular rgb images. In Proceedings of the European Conference on Computer Vision (ECCV), 2018.
  4. Exploiting spatial-temporal relationships for 3d pose estimation via graph convolutional networks. In Proceedings of the IEEE/CVF international conference on computer vision, pages 2272–2281, 2019.
  5. Dexycb: A benchmark for capturing hand grasping of objects. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9044–9053, 2021.
  6. gsdf: Geometry-driven signed distance functions for 3d hand-object reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12890–12900, 2023.
  7. Beyond static features for temporally consistent 3d human pose and shape from a video. In Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
  8. Arctic: A dataset for dexterous bimanual hand-object manipulation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12943–12954, 2023.
  9. Demonstrating rfuniverse: A multiphysics simulation platform for embodied ai.
  10. Deformer: Dynamic fusion transformer for robust hand pose estimation. arXiv preprint arXiv:2303.04991, 2023.
  11. Flexible muscle-based locomotion for bipedal creatures. ACM Transactions on Graphics (TOG), 32(6):1–11, 2013.
  12. Leveraging photometric consistency over time for sparsely supervised hand-object reconstruction. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 571–580, 2020.
  13. Archibald Vivian Hill. The heat of shortening and the dynamic constants of muscle. Proceedings of the Royal Society of London. Series B-Biological Sciences, 1938.
  14. Synthesis of biologically realistic human motion using joint torque actuation. ACM Transactions On Graphics (TOG), 38(4):1–12, 2019.
  15. Vibe: Video inference for human body pose and shape estimation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5253–5263, 2020a.
  16. Vibe: Video inference for human body pose and shape estimation. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020b.
  17. Creating and retargetting motion by the musculoskeletal human body model. The visual computer, 16:254–270, 2000.
  18. Finger muscle attachments for an opensim upper-extremity model. PloS one, 10(4):e0121712, 2015.
  19. Dexterous manipulation and control with volumetric muscles. ACM Transactions on Graphics (TOG), 37(4):1–13, 2018.
  20. Scalable muscle-actuated human simulation and control. ACM Transactions On Graphics (TOG), 38(4):1–13, 2019.
  21. Heads up! biomechanical modeling and neuromuscular control of the neck. In ACM SIGGRAPH 2006 Papers, pages 1188–1198. 2006.
  22. Comprehensive biomechanical modeling and simulation of the upper body. ACM Transactions on Graphics (TOG), 28(4):1–17, 2009.
  23. Locomotion control for many-muscle humanoids. ACM Transactions on Graphics (TOG), 33(6):1–11, 2014.
  24. Mesh graphormer. In ICCV, 2021.
  25. Semi-supervised 3d hand-object poses estimation with interactions in time. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14687–14697, 2021.
  26. SMPL: A skinned multi-person linear model. ACM TOG, 34(6):248:1–248:16, 2015.
  27. Marching cubes: A high resolution 3d surface construction algorithm. ACM TOG, 21(4):163–169, 1987.
  28. Handtailor: Towards high-precision monocular 3d hand recovery. 2021.
  29. Neuromotion: Open-source simulator with neuromechanical and deep network models to generate surface emg signals during voluntary movement. 2023.
  30. Spatial dependency of glenohumeral joint stability during dynamic unimanual and bimanual pushing and pulling. Journal of biomechanical engineering, 141(5):051006, 2019.
  31. Animating human lower limbs using contact-invariant optimization. ACM Transactions on Graphics (TOG), 32(6):1–8, 2013.
  32. Handoccnet: Occlusion-robust 3d hand mesh estimation network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1496–1505, 2022.
  33. Expressive body capture: 3d hands, face, and body from a single image. In CVPR, 2019.
  34. Stable-baselines3: Reliable reinforcement learning implementations. Journal of Machine Learning Research, 22(268):1–8, 2021.
  35. Embodied hands: Modeling and capturing hands and bodies together. ACM Transactions on Graphics, (Proc. SIGGRAPH Asia), 2017.
  36. Benchmarking of dynamic simulation predictions in two software platforms using an upper limb musculoskeletal model. Computer methods in biomechanics and biomedical engineering, 18(13):1445–1458, 2015.
  37. Bash: Biomechanical animated skinned human for visualization of kinematics and muscle activity. In VISIGRAPP (1: GRAPP), pages 25–36, 2021.
  38. Proximal policy optimization algorithms. CoRR, abs/1707.06347, 2017.
  39. Opensim: Simulating musculoskeletal dynamics and neuromuscular control to study human and animal movement. PLoS computational biology, 14(7):e1006223, 2018.
  40. Realistic biomechanical simulation and control of human swimming. ACM Transactions on Graphics (TOG), 34(1):1–15, 2014.
  41. Musculotendon simulation for hand animation. In ACM SIGGRAPH 2008 papers, pages 1–8. 2008.
  42. Pose-ndf: Modeling human pose manifolds with neural distance fields. In European Conference on Computer Vision, pages 572–589. Springer, 2022.
  43. Helping hand: an anatomically accurate inverse dynamics solution for unconstrained hand motion. In Proceedings of the 2005 ACM SIGGRAPH/Eurographics symposium on Computer animation, pages 319–328, 2005.
  44. Optimizing locomotion controllers using biologically-based actuators and objectives. ACM Transactions on Graphics (TOG), 31(4):1–11, 2012.
  45. Seqhand: Rgb-sequence-based 3d hand pose and shape estimation. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XII 16, pages 122–139. Springer, 2020a.
  46. Bihand: Recovering hand mesh with multi-stage bisected hourglass networks. In BMVC British Machine Vision Conference, 2020b.
  47. Cpf: Learning a contact potential field to model the hand-object interaction. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 11097–11106, 2021.
  48. Artiboost: Boosting articulated 3d hand-object pose estimation via online exploration and synthesis. In CVPR IEEE Conference on Computer Vision and Pattern Recognition, pages 2750–2760, 2022a.
  49. Oakink: A large-scale knowledge repository for understanding hand-object interaction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 20953–20962, 2022b.
  50. Learning a contact potential field for modeling the hand-object interaction. IEEE transactions on pattern analysis and machine intelligence, 2024.
  51. H2o: A benchmark for visual human-human object handover analysis. In ICCV IEEE/CVF International Conference on Computer Vision, pages 15762–15771, 2021.
  52. Rcare world: A human-centric simulation world for caregiving robots. In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 33–40. IEEE, 2022.
  53. Toch: Spatio-temporal object-to-hand correspondence for motion refinement. In European Conference on Computer Vision, pages 1–19. Springer, 2022.
Citations (1)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.