Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge (2109.02002v2)

Published 5 Sep 2021 in eess.AS and cs.SD

Abstract: This report describes the submission of the DKU-DukeECE-Lenovo team to the VoxCeleb Speaker Recognition Challenge (VoxSRC) 2021 track 4. Our system including a voice activity detection (VAD) model, a speaker embedding model, two clustering-based speaker diarization systems with different similarity measurements, two different overlapped speech detection (OSD) models, and a target-speaker voice activity detection (TS-VAD) model. Our final submission, consisting of 5 independent systems, achieves a DER of 5.07% on the challenge test set.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Weiqing Wang (54 papers)
  2. Danwei Cai (14 papers)
  3. Qingjian Lin (8 papers)
  4. Lin Yang (212 papers)
  5. Junjie Wang (164 papers)
  6. Jin Wang (356 papers)
  7. Ming Li (789 papers)
Citations (26)

Summary

We haven't generated a summary for this paper yet.