Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

USTC-NELSLIP System Description for DIHARD-III Challenge (2103.10661v1)

Published 19 Mar 2021 in cs.SD, cs.LG, and eess.AS

Abstract: This system description describes our submission system to the Third DIHARD Speech Diarization Challenge. Besides the traditional clustering based system, the innovation of our system lies in the combination of various front-end techniques to solve the diarization problem, including speech separation and target-speaker based voice activity detection (TS-VAD), combined with iterative data purification. We also adopted audio domain classification to design domain-dependent processing. Finally, we performed post processing to do system fusion and selection. Our best system achieved DERs of 11.30% in track 1 and 16.78% in track 2 on evaluation set, respectively.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Yuxuan Wang (239 papers)
  2. Maokui He (8 papers)
  3. Shutong Niu (13 papers)
  4. Lei Sun (138 papers)
  5. Tian Gao (57 papers)
  6. Xin Fang (77 papers)
  7. Jia Pan (127 papers)
  8. Jun Du (130 papers)
  9. Chin-Hui Lee (52 papers)
Citations (27)

Summary

We haven't generated a summary for this paper yet.