Unsupervised Discriminative Learning of Sounds for Audio Event Classification (2105.09279v2)

Published 19 May 2021 in cs.SD, cs.CV, and eess.AS

Abstract: Recent progress in network-based audio event classification has shown the benefit of pre-training models on visual data such as ImageNet. While this process allows knowledge transfer across different domains, training a model on large-scale visual datasets is time consuming. On several audio event classification benchmarks, we show a fast and effective alternative that pre-trains the model unsupervised, only on audio data and yet delivers on-par performance with ImageNet pre-training. Furthermore, we show that our discriminative audio learning can be used to transfer knowledge across audio datasets and optionally include ImageNet pre-training.

Authors (5)

Sascha Hornauer (11 papers)
Ke Li (723 papers)
Stella X. Yu (65 papers)
Shabnam Ghaffarzadegan (10 papers)
Liu Ren (57 papers)

Citations (5)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Unsupervised Discriminative Learning of Sounds for Audio Event Classification (2105.09279v2)

Summary

Related Papers