Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition (1812.05637v3)

Published 13 Dec 2018 in cs.CV

Abstract: Video action recognition, a critical problem in video understanding, has been gaining increasing attention. To identify actions induced by complex object-object interactions, we need to consider not only spatial relations among objects in a single frame, but also temporal relations among different or the same objects across multiple frames. However, existing approaches that model video representations and non-local features are either incapable of explicitly modeling relations at the object-object level or unable to handle streaming videos. In this paper, we propose a novel dynamic hidden graph module to model complex object-object interactions in videos, of which two instantiations are considered: a visual graph that captures appearance/motion changes among objects and a location graph that captures relative spatiotemporal position changes among objects. Additionally, the proposed graph module allows us to process streaming videos, setting it apart from existing methods. Experimental results on benchmark datasets, Something-Something and ActivityNet, show the competitive performance of our method.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Hao Huang (155 papers)
  2. Luowei Zhou (31 papers)
  3. Wei Zhang (1489 papers)
  4. Jason J. Corso (71 papers)
  5. Chenliang Xu (114 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.