Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

4D Human Body Capture from Egocentric Video via 3D Scene Grounding (2011.13341v2)

Published 26 Nov 2020 in cs.CV

Abstract: We introduce a novel task of reconstructing a time series of second-person 3D human body meshes from monocular egocentric videos. The unique viewpoint and rapid embodied camera motion of egocentric videos raise additional technical barriers for human body capture. To address those challenges, we propose a simple yet effective optimization-based approach that leverages 2D observations of the entire video sequence and human-scene interaction constraint to estimate second-person human poses, shapes, and global motion that are grounded on the 3D environment captured from the egocentric view. We conduct detailed ablation studies to validate our design choice. Moreover, we compare our method with the previous state-of-the-art method on human motion capture from monocular video, and show that our method estimates more accurate human-body poses and shapes under the challenging egocentric setting. In addition, we demonstrate that our approach produces more realistic human-scene interaction.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Miao Liu (98 papers)
  2. Dexin Yang (1 paper)
  3. Yan Zhang (954 papers)
  4. Zhaopeng Cui (64 papers)
  5. James M. Rehg (91 papers)
  6. Siyu Tang (86 papers)
Citations (36)

Summary

We haven't generated a summary for this paper yet.