Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Noise Robust IOA/CAS Speech Separation and Recognition System For The Third 'CHIME' Challenge (1509.06103v1)

Published 21 Sep 2015 in cs.SD and cs.CL

Abstract: This paper presents the contribution to the third 'CHiME' speech separation and recognition challenge including both front-end signal processing and back-end speech recognition. In the front-end, Multi-channel Wiener filter (MWF) is designed to achieve background noise reduction. Different from traditional MWF, optimized parameter for the tradeoff between noise reduction and target signal distortion is built according to the desired noise reduction level. In the back-end, several techniques are taken advantage to improve the noisy Automatic Speech Recognition (ASR) performance including Deep Neural Network (DNN), Convolutional Neural Network (CNN) and Long short-term memory (LSTM) using medium vocabulary, Lattice rescoring with a big vocabulary LLM finite state transducer, and ROVER scheme. Experimental results show the proposed system combining front-end and back-end is effective to improve the ASR performance.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Xiaofei Wang (139 papers)
  2. Chao Wu (137 papers)
  3. Pengyuan Zhang (57 papers)
  4. Ziteng Wang (51 papers)
  5. Yong Liu (724 papers)
  6. Xu Li (126 papers)
  7. Qiang Fu (159 papers)
  8. Yonghong Yan (38 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.