2000 character limit reached
General-purpose Tagging of Freesound Audio with AudioSet Labels: Task Description, Dataset, and Baseline (1807.09902v3)
Published 26 Jul 2018 in cs.SD, cs.LG, eess.AS, and stat.ML
Abstract: This paper describes Task 2 of the DCASE 2018 Challenge, titled "General-purpose audio tagging of Freesound content with AudioSet labels". This task was hosted on the Kaggle platform as "Freesound General-Purpose Audio Tagging Challenge". The goal of the task is to build an audio tagging system that can recognize the category of an audio clip from a subset of 41 diverse categories drawn from the AudioSet Ontology. We present the task, the dataset prepared for the competition, and a baseline system.