Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023 (2308.07595v2)

Published 15 Aug 2023 in eess.AS

Abstract: This paper describes the DKU-MSXF submission to track 4 of the VoxCeleb Speaker Recognition Challenge 2023 (VoxSRC-23). Our system pipeline contains voice activity detection, clustering-based diarization, overlapped speech detection, and target-speaker voice activity detection, where each procedure has a fused output from 3 sub-models. Finally, we fuse different clustering-based and TSVAD-based diarization systems using DOVER-Lap and achieve the 4.30% diarization error rate (DER), which ranks first place on track 4 of the challenge leaderboard.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Ming Cheng (69 papers)
  2. Weiqing Wang (55 papers)
  3. Xiaoyi Qin (27 papers)
  4. Yuke Lin (12 papers)
  5. Ning Jiang (177 papers)
  6. Guoqing Zhao (20 papers)
  7. Ming Li (789 papers)
Citations (10)

Summary

We haven't generated a summary for this paper yet.