Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Contact and Human Dynamics from Monocular Video (2007.11678v2)

Published 22 Jul 2020 in cs.CV

Abstract: Existing deep models predict 2D and 3D kinematic poses from video that are approximately accurate, but contain visible errors that violate physical constraints, such as feet penetrating the ground and bodies leaning at extreme angles. In this paper, we present a physics-based method for inferring 3D human motion from video sequences that takes initial 2D and 3D pose estimates as input. We first estimate ground contact timings with a novel prediction network which is trained without hand-labeled data. A physics-based trajectory optimization then solves for a physically-plausible motion, based on the inputs. We show this process produces motions that are significantly more realistic than those from purely kinematic methods, substantially improving quantitative measures of both kinematic and dynamic plausibility. We demonstrate our method on character animation and pose estimation tasks on dynamic motions of dancing and sports with complex contact patterns.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Davis Rempe (20 papers)
  2. Leonidas J. Guibas (75 papers)
  3. Aaron Hertzmann (35 papers)
  4. Bryan Russell (36 papers)
  5. Ruben Villegas (20 papers)
  6. Jimei Yang (58 papers)
Citations (94)

Summary

We haven't generated a summary for this paper yet.