Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech (2110.05866v1)

Published 12 Oct 2021 in cs.SD, cs.CL, and eess.AS

Abstract: Most of the deep learning-based speech enhancement models are learned in a supervised manner, which implies that pairs of noisy and clean speech are required during training. Consequently, several noisy speeches recorded in daily life cannot be used to train the model. Although certain unsupervised learning frameworks have also been proposed to solve the pair constraint, they still require clean speech or noise for training. Therefore, in this paper, we propose MetricGAN-U, which stands for MetricGAN-unsupervised, to further release the constraint from conventional unsupervised learning. In MetricGAN-U, only noisy speech is required to train the model by optimizing non-intrusive speech quality metrics. The experimental results verified that MetricGAN-U outperforms baselines in both objective and subjective metrics.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Szu-Wei Fu (46 papers)
  2. Cheng Yu (62 papers)
  3. Kuo-Hsuan Hung (22 papers)
  4. Mirco Ravanelli (72 papers)
  5. Yu Tsao (200 papers)
Citations (41)

Summary

We haven't generated a summary for this paper yet.