Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Unsupervised Discriminative Learning of Sounds for Audio Event Classification (2105.09279v2)

Published 19 May 2021 in cs.SD, cs.CV, and eess.AS

Abstract: Recent progress in network-based audio event classification has shown the benefit of pre-training models on visual data such as ImageNet. While this process allows knowledge transfer across different domains, training a model on large-scale visual datasets is time consuming. On several audio event classification benchmarks, we show a fast and effective alternative that pre-trains the model unsupervised, only on audio data and yet delivers on-par performance with ImageNet pre-training. Furthermore, we show that our discriminative audio learning can be used to transfer knowledge across audio datasets and optionally include ImageNet pre-training.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Sascha Hornauer (11 papers)
  2. Ke Li (723 papers)
  3. Stella X. Yu (65 papers)
  4. Shabnam Ghaffarzadegan (10 papers)
  5. Liu Ren (57 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.