2000 character limit reached
Multi-Speaker Localization Using Convolutional Neural Network Trained with Noise
Published 12 Dec 2017 in cs.SD, eess.AS, and stat.ML | (1712.04276v1)
Abstract: The problem of multi-speaker localization is formulated as a multi-class multi-label classification problem, which is solved using a convolutional neural network (CNN) based source localization method. Utilizing the common assumption of disjoint speaker activities, we propose a novel method to train the CNN using synthesized noise signals. The proposed localization method is evaluated for two speakers and compared to a well-known steered response power method.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.