Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Activity Recognition on a Large Scale in Short Videos - Moments in Time Dataset (1809.00241v2)

Published 1 Sep 2018 in cs.CV, cs.LG, and cs.MM

Abstract: Moments capture a huge part of our lives. Accurate recognition of these moments is challenging due to the diverse and complex interpretation of the moments. Action recognition refers to the act of classifying the desired action/activity present in a given video. In this work, we perform experiments on Moments in Time dataset to recognize accurately activities occurring in 3 second clips. We use state of the art techniques for visual, auditory and spatio temporal localization and develop method to accurately classify the activity in the Moments in Time dataset. Our novel approach of using Visual Based Textual features and fusion techniques performs well providing an overall 89.23 % Top - 5 accuracy on the 20 classes - a significant improvement over the Baseline TRN model.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Ankit Shah (47 papers)
  2. Harini Kesavamoorthy (2 papers)
  3. Poorva Rane (2 papers)
  4. Pramati Kalwad (1 paper)
  5. Alexander Hauptmann (46 papers)
  6. Florian Metze (79 papers)