Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning from Temporal Gradient for Semi-supervised Action Recognition (2111.13241v3)

Published 25 Nov 2021 in cs.CV

Abstract: Semi-supervised video action recognition tends to enable deep neural networks to achieve remarkable performance even with very limited labeled data. However, existing methods are mainly transferred from current image-based methods (e.g., FixMatch). Without specifically utilizing the temporal dynamics and inherent multimodal attributes, their results could be suboptimal. To better leverage the encoded temporal information in videos, we introduce temporal gradient as an additional modality for more attentive feature extraction in this paper. To be specific, our method explicitly distills the fine-grained motion representations from temporal gradient (TG) and imposes consistency across different modalities (i.e., RGB and TG). The performance of semi-supervised action recognition is significantly improved without additional computation or parameters during inference. Our method achieves the state-of-the-art performance on three video action recognition benchmarks (i.e., Kinetics-400, UCF-101, and HMDB-51) under several typical semi-supervised settings (i.e., different ratios of labeled data).

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Junfei Xiao (17 papers)
  2. Longlong Jing (23 papers)
  3. Lin Zhang (342 papers)
  4. Ju He (24 papers)
  5. Qi She (37 papers)
  6. Zongwei Zhou (60 papers)
  7. Alan Yuille (294 papers)
  8. Yingwei Li (31 papers)
Citations (42)

Summary

We haven't generated a summary for this paper yet.