Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding (2309.02423v1)

Published 5 Sep 2023 in cs.CV

Abstract: With the surge in attention to Egocentric Hand-Object Interaction (Ego-HOI), large-scale datasets such as Ego4D and EPIC-KITCHENS have been proposed. However, most current research is built on resources derived from third-person video action recognition. This inherent domain gap between first- and third-person action videos, which have not been adequately addressed before, makes current Ego-HOI suboptimal. This paper rethinks and proposes a new framework as an infrastructure to advance Ego-HOI recognition by Probing, Curation and Adaption (EgoPCA). We contribute comprehensive pre-train sets, balanced test sets and a new baseline, which are complete with a training-finetuning strategy. With our new framework, we not only achieve state-of-the-art performance on Ego-HOI benchmarks but also build several new and effective mechanisms and settings to advance further research. We believe our data and the findings will pave a new way for Ego-HOI understanding. Code and data are available at https://mvig-rhos.com/ego_pca

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Yue Xu (79 papers)
  2. Yong-Lu Li (47 papers)
  3. Zhemin Huang (4 papers)
  4. Michael Xu Liu (1 paper)
  5. Cewu Lu (203 papers)
  6. Yu-Wing Tai (123 papers)
  7. Chi-Keung Tang (81 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.