Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Deep Contextual Attention for Human-Object Interaction Detection (1910.07721v1)

Published 17 Oct 2019 in cs.CV

Abstract: Human-object interaction detection is an important and relatively new class of visual relationship detection tasks, essential for deeper scene understanding. Most existing approaches decompose the problem into object localization and interaction recognition. Despite showing progress, these approaches only rely on the appearances of humans and objects and overlook the available context information, crucial for capturing subtle interactions between them. We propose a contextual attention framework for human-object interaction detection. Our approach leverages context by learning contextually-aware appearance features for human and object instances. The proposed attention module then adaptively selects relevant instance-centric context information to highlight image regions likely to contain human-object interactions. Experiments are performed on three benchmarks: V-COCO, HICO-DET and HCVRD. Our approach outperforms the state-of-the-art on all datasets. On the V-COCO dataset, our method achieves a relative gain of 4.4% in terms of role mean average precision ($mAP_{role}$), compared to the existing best approach.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Tiancai Wang (48 papers)
  2. Rao Muhammad Anwer (67 papers)
  3. Muhammad Haris Khan (68 papers)
  4. Fahad Shahbaz Khan (225 papers)
  5. Yanwei Pang (67 papers)
  6. Ling Shao (244 papers)
  7. Jorma Laaksonen (37 papers)
Citations (115)

Summary

We haven't generated a summary for this paper yet.