Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DeepHPS: End-to-end Estimation of 3D Hand Pose and Shape by Learning from Synthetic Depth (1808.09208v1)

Published 28 Aug 2018 in cs.CV

Abstract: Articulated hand pose and shape estimation is an important problem for vision-based applications such as augmented reality and animation. In contrast to the existing methods which optimize only for joint positions, we propose a fully supervised deep network which learns to jointly estimate a full 3D hand mesh representation and pose from a single depth image. To this end, a CNN architecture is employed to estimate parametric representations i.e. hand pose, bone scales and complex shape parameters. Then, a novel hand pose and shape layer, embedded inside our deep framework, produces 3D joint positions and hand mesh. Lack of sufficient training data with varying hand shapes limits the generalized performance of learning based methods. Also, manually annotating real data is suboptimal. Therefore, we present SynHand5M: a million-scale synthetic dataset with accurate joint annotations, segmentation masks and mesh files of depth maps. Among model based learning (hybrid) methods, we show improved results on our dataset and two of the public benchmarks i.e. NYU and ICVL. Also, by employing a joint training strategy with real and synthetic data, we recover 3D hand mesh and pose from real images in 3.7ms.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Jameel Malik (8 papers)
  2. Ahmed Elhayek (8 papers)
  3. Fabrizio Nunnari (11 papers)
  4. Kiran Varanasi (9 papers)
  5. Kiarash Tamaddon (1 paper)
  6. Alexis Heloir (2 papers)
  7. Didier Stricker (144 papers)
Citations (78)

Summary

We haven't generated a summary for this paper yet.