Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Proposal Relation Network for Temporal Action Detection (2106.11812v1)

Published 20 Jun 2021 in cs.CV

Abstract: This technical report presents our solution for temporal action detection task in AcitivityNet Challenge 2021. The purpose of this task is to locate and identify actions of interest in long untrimmed videos. The crucial challenge of the task comes from that the temporal duration of action varies dramatically, and the target actions are typically embedded in a background of irrelevant activities. Our solution builds on BMN, and mainly contains three steps: 1) action classification and feature encoding by Slowfast, CSN and ViViT; 2) proposal generation. We improve BMN by embedding the proposed Proposal Relation Network (PRN), by which we can generate proposals of high quality; 3) action detection. We calculate the detection results by assigning the proposals with corresponding classification results. Finally, we ensemble the results under different settings and achieve 44.7% on the test set, which improves the champion result in ActivityNet 2020 by 1.9% in terms of average mAP.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Xiang Wang (279 papers)
  2. Zhiwu Qing (29 papers)
  3. Ziyuan Huang (43 papers)
  4. Yutong Feng (33 papers)
  5. Shiwei Zhang (179 papers)
  6. Jianwen Jiang (25 papers)
  7. Mingqian Tang (23 papers)
  8. Changxin Gao (76 papers)
  9. Nong Sang (86 papers)
Citations (23)

Summary

We haven't generated a summary for this paper yet.