Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction (2203.01577v4)

Published 3 Mar 2022 in cs.CV

Abstract: We present HOI4D, a large-scale 4D egocentric dataset with rich annotations, to catalyze the research of category-level human-object interaction. HOI4D consists of 2.4M RGB-D egocentric video frames over 4000 sequences collected by 4 participants interacting with 800 different object instances from 16 categories over 610 different indoor rooms. Frame-wise annotations for panoptic segmentation, motion segmentation, 3D hand pose, category-level object pose and hand action have also been provided, together with reconstructed object meshes and scene point clouds. With HOI4D, we establish three benchmarking tasks to promote category-level HOI from 4D visual signals including semantic segmentation of 4D dynamic point cloud sequences, category-level object pose tracking, and egocentric action segmentation with diverse interaction targets. In-depth analysis shows HOI4D poses great challenges to existing methods and produces great research opportunities.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Yunze Liu (17 papers)
  2. Yun Liu (213 papers)
  3. Che Jiang (8 papers)
  4. Kangbo Lyu (2 papers)
  5. Weikang Wan (9 papers)
  6. Hao Shen (100 papers)
  7. Boqiang Liang (1 paper)
  8. Zhoujie Fu (5 papers)
  9. He Wang (294 papers)
  10. Li Yi (111 papers)
Citations (118)

Summary

We haven't generated a summary for this paper yet.