Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multimodal Sparse Coding for Event Detection (1605.05212v1)

Published 17 May 2016 in cs.LG and cs.CV

Abstract: Unsupervised feature learning methods have proven effective for classification tasks based on a single modality. We present multimodal sparse coding for learning feature representations shared across multiple modalities. The shared representations are applied to multimedia event detection (MED) and evaluated in comparison to unimodal counterparts, as well as other feature learning methods such as GMM supervectors and sparse RBM. We report the cross-validated classification accuracy and mean average precision of the MED system trained on features learned from our unimodal and multimodal settings for a subset of the TRECVID MED 2014 dataset.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Youngjune Gwon (20 papers)
  2. William Campbell (7 papers)
  3. Kevin Brady (2 papers)
  4. Douglas Sturim (2 papers)
  5. Miriam Cha (13 papers)
  6. H. T. Kung (34 papers)
Citations (6)