Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification (2310.04760v1)

Published 7 Oct 2023 in eess.AS and cs.SD

Abstract: Utilizing the pseudo-labeling algorithm with large-scale unlabeled data becomes crucial for semi-supervised domain adaptation in speaker verification tasks. In this paper, we propose a novel pseudo-labeling method named Multi-objective Progressive Clustering (MoPC), specifically designed for semi-supervised domain adaptation. Firstly, we utilize limited labeled data from the target domain to derive domain-specific descriptors based on multiple distinct objectives, namely within-graph denoising, intra-class denoising and inter-class denoising. Then, the Infomap algorithm is adopted for embedding clustering, and the descriptors are leveraged to further refine the target domain's pseudo-labels. Moreover, to further improve the quality of pseudo labels, we introduce the subcenter-purification and progressive-merging strategy for label denoising. Our proposed MoPC method achieves 4.95% EER and ranked the 1${st}$ place on the evaluation set of VoxSRC 2023 track 3. We also conduct additional experiments on the FFSVC dataset and yield promising results.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Ze Li (41 papers)
  2. Yuke Lin (12 papers)
  3. Ning Jiang (177 papers)
  4. Xiaoyi Qin (27 papers)
  5. Guoqing Zhao (20 papers)
  6. Haiying Wu (4 papers)
  7. Ming Li (787 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.