Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling (2106.11811v1)

Published 20 Jun 2021 in cs.CV

Abstract: Weakly-Supervised Temporal Action Localization (WS-TAL) task aims to recognize and localize temporal starts and ends of action instances in an untrimmed video with only video-level label supervision. Due to lack of negative samples of background category, it is difficult for the network to separate foreground and background, resulting in poor detection performance. In this report, we present our 2021 HACS Challenge - Weakly-supervised Learning Track solution that based on BaSNet to address above problem. Specifically, we first adopt pre-trained CSN, Slowfast, TDN, and ViViT as feature extractors to get feature sequences. Then our proposed Local-Global Background Modeling Network (LGBM-Net) is trained to localize instances by using only video-level labels based on Multi-Instance Learning (MIL). Finally, we ensemble multiple models to get the final detection results and reach 22.45% mAP on the test set

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Xiang Wang (279 papers)
  2. Zhiwu Qing (29 papers)
  3. Ziyuan Huang (43 papers)
  4. Yutong Feng (33 papers)
  5. Shiwei Zhang (179 papers)
  6. Jianwen Jiang (25 papers)
  7. Mingqian Tang (23 papers)
  8. Yuanjie Shao (16 papers)
  9. Nong Sang (86 papers)
Citations (4)