Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Enhancing Egocentric 3D Pose Estimation with Third Person Views (2201.02017v3)

Published 6 Jan 2022 in cs.CV

Abstract: In this paper, we propose a novel approach to enhance the 3D body pose estimation of a person computed from videos captured from a single wearable camera. The key idea is to leverage high-level features linking first- and third-views in a joint embedding space. To learn such embedding space we introduce First2Third-Pose, a new paired synchronized dataset of nearly 2,000 videos depicting human activities captured from both first- and third-view perspectives. We explicitly consider spatial- and motion-domain features, combined using a semi-Siamese architecture trained in a self-supervised fashion. Experimental results demonstrate that the joint multi-view embedded space learned with our dataset is useful to extract discriminatory features from arbitrary single-view egocentric videos, without needing domain adaptation nor knowledge of camera parameters. We achieve significant improvement of egocentric 3D body pose estimation performance on two unconstrained datasets, over three supervised state-of-the-art approaches. Our dataset and code will be available for research purposes.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Ameya Dhamanaskar (1 paper)
  2. Mariella Dimiccoli (38 papers)
  3. Enric Corona (14 papers)
  4. Albert Pumarola (31 papers)
  5. Francesc Moreno-Noguer (68 papers)
Citations (10)
Github Logo Streamline Icon: https://streamlinehq.com