Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

The Overlooked Classifier in Human-Object Interaction Recognition (2112.06392v2)

Published 13 Dec 2021 in cs.CV

Abstract: Human-Object Interaction (HOI) recognition is challenging due to two factors: (1) significant imbalance across classes and (2) requiring multiple labels per image. This paper shows that these two challenges can be effectively addressed by improving the classifier with the backbone architecture untouched. Firstly, we encode the semantic correlation among classes into the classification head by initializing the weights with language embeddings of HOIs. As a result, the performance is boosted significantly, especially for the few-shot subset. Secondly, we propose a new loss named LSE-Sign to enhance multi-label learning on a long-tailed dataset. Our simple yet effective method enables detection-free HOI classification, outperforming the state-of-the-arts that require object detection and human pose by a clear margin. Moreover, we transfer the classification model to instance-level HOI detection by connecting it with an off-the-shelf object detector. We achieve state-of-the-art without additional fine-tuning.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Ying Jin (57 papers)
  2. Yinpeng Chen (55 papers)
  3. Lijuan Wang (133 papers)
  4. Jianfeng Wang (149 papers)
  5. Pei Yu (45 papers)
  6. Lin Liang (11 papers)
  7. Jenq-Neng Hwang (103 papers)
  8. Zicheng Liu (153 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.