Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Weakly-Supervised Multi-Person Action Recognition in 360$^{\circ}$ Videos (2002.03266v1)

Published 9 Feb 2020 in cs.CV

Abstract: The recent development of commodity 360${\circ}$ cameras have enabled a single video to capture an entire scene, which endows promising potentials in surveillance scenarios. However, research in omnidirectional video analysis has lagged behind the hardware advances. In this work, we address the important problem of action recognition in top-view 360${\circ}$ videos. Due to the wide filed-of-view, 360${\circ}$ videos usually capture multiple people performing actions at the same time. Furthermore, the appearance of people are deformed. The proposed framework first transforms omnidirectional videos into panoramic videos, then it extracts spatial-temporal features using region-based 3D CNNs for action recognition. We propose a weakly-supervised method based on multi-instance multi-label learning, which trains the model to recognize and localize multiple actions in a video using only video-level action labels as supervision. We perform experiments to quantitatively validate the efficacy of the proposed method and qualitatively demonstrate action localization results. To enable research in this direction, we introduce 360Action, the first omnidirectional video dataset for multi-person action recognition.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Junnan Li (56 papers)
  2. Jianquan Liu (8 papers)
  3. Yongkang Wong (38 papers)
  4. Shoji Nishimura (2 papers)
  5. Mohan Kankanhalli (117 papers)
Citations (11)

Summary

We haven't generated a summary for this paper yet.