Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MM-Hand: 3D-Aware Multi-Modal Guided Hand Generative Network for 3D Hand Pose Synthesis (2010.01158v1)

Published 2 Oct 2020 in cs.CV and cs.MM

Abstract: Estimating the 3D hand pose from a monocular RGB image is important but challenging. A solution is training on large-scale RGB hand images with accurate 3D hand keypoint annotations. However, it is too expensive in practice. Instead, we have developed a learning-based approach to synthesize realistic, diverse, and 3D pose-preserving hand images under the guidance of 3D pose information. We propose a 3D-aware multi-modal guided hand generative network (MM-Hand), together with a novel geometry-based curriculum learning strategy. Our extensive experimental results demonstrate that the 3D-annotated images generated by MM-Hand qualitatively and quantitatively outperform existing options. Moreover, the augmented data can consistently improve the quantitative performance of the state-of-the-art 3D hand pose estimators on two benchmark datasets. The code will be available at https://github.com/ScottHoang/mm-hand.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Zhenyu Wu (112 papers)
  2. Duc Hoang (12 papers)
  3. Shih-Yao Lin (7 papers)
  4. Yusheng Xie (22 papers)
  5. Liangjian Chen (10 papers)
  6. Yen-Yu Lin (38 papers)
  7. Zhangyang Wang (375 papers)
  8. Wei Fan (160 papers)
Citations (12)

Summary

We haven't generated a summary for this paper yet.