2000 character limit reached
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge (2109.02002v2)
Published 5 Sep 2021 in eess.AS and cs.SD
Abstract: This report describes the submission of the DKU-DukeECE-Lenovo team to the VoxCeleb Speaker Recognition Challenge (VoxSRC) 2021 track 4. Our system including a voice activity detection (VAD) model, a speaker embedding model, two clustering-based speaker diarization systems with different similarity measurements, two different overlapped speech detection (OSD) models, and a target-speaker voice activity detection (TS-VAD) model. Our final submission, consisting of 5 independent systems, achieves a DER of 5.07% on the challenge test set.
- Weiqing Wang (54 papers)
- Danwei Cai (14 papers)
- Qingjian Lin (8 papers)
- Lin Yang (212 papers)
- Junjie Wang (164 papers)
- Jin Wang (356 papers)
- Ming Li (789 papers)