Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Foreground-Background Ambient Sound Scene Separation (2005.07006v2)

Published 11 May 2020 in eess.AS, cs.LG, cs.SD, and eess.SP

Abstract: Ambient sound scenes typically comprise multiple short events occurring on top of a somewhat stationary background. We consider the task of separating these events from the background, which we call foreground-background ambient sound scene separation. We propose a deep learning-based separation framework with a suitable feature normaliza-tion scheme and an optional auxiliary network capturing the background statistics, and we investigate its ability to handle the great variety of sound classes encountered in ambient sound scenes, which have often not been seen in training. To do so, we create single-channel foreground-background mixtures using isolated sounds from the DESED and Audioset datasets, and we conduct extensive experiments with mixtures of seen or unseen sound classes at various signal-to-noise ratios. Our experimental findings demonstrate the generalization ability of the proposed approach.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Michel Olvera (7 papers)
  2. Emmanuel Vincent (44 papers)
  3. Romain Serizel (53 papers)
  4. Gilles Gasso (30 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.