Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Segregated Temporal Assembly Recurrent Networks for Weakly Supervised Multiple Action Detection (1811.07460v1)

Published 19 Nov 2018 in cs.CV

Abstract: This paper proposes a segregated temporal assembly recurrent (STAR) network for weakly-supervised multiple action detection. The model learns from untrimmed videos with only supervision of video-level labels and makes prediction of intervals of multiple actions. Specifically, we first assemble video clips according to class labels by an attention mechanism that learns class-variable attention weights and thus helps the noise relieving from background or other actions. Secondly, we build temporal relationship between actions by feeding the assembled features into an enhanced recurrent neural network. Finally, we transform the output of recurrent neural network into the corresponding action distribution. In order to generate more precise temporal proposals, we design a score term called segregated temporal gradient-weighted class activation mapping (ST-GradCAM) fused with attention weights. Experiments on THUMOS'14 and ActivityNet1.3 datasets show that our approach outperforms the state-of-the-art weakly-supervised method, and performs at par with the fully-supervised counterparts.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Yunlu Xu (18 papers)
  2. Chengwei Zhang (19 papers)
  3. Zhanzhan Cheng (28 papers)
  4. Jianwen Xie (52 papers)
  5. Yi Niu (38 papers)
  6. Shiliang Pu (106 papers)
  7. Fei Wu (317 papers)
Citations (78)

Summary

We haven't generated a summary for this paper yet.