Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Efficient Two-Stream Motion and Appearance 3D CNNs for Video Classification (1608.08851v2)

Published 31 Aug 2016 in cs.CV

Abstract: The video and action classification have extremely evolved by deep neural networks specially with two stream CNN using RGB and optical flow as inputs and they present outstanding performance in terms of video analysis. One of the shortcoming of these methods is handling motion information extraction which is done out side of the CNNs and relatively time consuming also on GPUs. So proposing end-to-end methods which are exploring to learn motion representation, like 3D-CNN can achieve faster and accurate performance. We present some novel deep CNNs using 3D architecture to model actions and motion representation in an efficient way to be accurate and also as fast as real-time. Our new networks learn distinctive models to combine deep motion features into appearance model via learning optical flow features inside the network.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Ali Diba (17 papers)
  2. Ali Mohammad Pazandeh (2 papers)
  3. Luc Van Gool (570 papers)
Citations (75)

Summary

We haven't generated a summary for this paper yet.