Monocular hand pose estimation under complex grasps and occlusions
Develop a monocular 3D hand pose estimation method that accurately reconstructs finger joint articulation, including strongly flexed and overlapping digits during grasp and transport phases of the Box and Block Test, overcoming the articulation errors and missing-finger predictions observed in current methods such as SAM 3D Body and WiLoR when faced with self-occlusion and complex hand-object interactions.
References
It produces body pose estimates that were more consistent with observed motion than those of SMPLer-X and PromptHMR , and while it shares the same limitation in hand pose as WiLoR , this limitation remains an open challenge for monocular hand pose estimation, to our knowledge.
— Enhancing Box and Block Test with Computer Vision for Post-Stroke Upper Extremity Motor Evaluation
(2603.29101 - Robinson et al., 31 Mar 2026) in Results — Comparison of 3D Pose Estimation Methods, Model selection