Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition (2212.03090v1)

Published 6 Dec 2022 in cs.SD and eess.AS

Abstract: Very deep models for speaker recognition (SR) have demonstrated remarkable performance improvement in recent research. However, it is impractical to deploy these models for on-device applications with constrained computational resources. On the other hand, light-weight models are highly desired in practice despite their sub-optimal performance. This research aims to improve light-weight SR models through large-scale label-free knowledge distillation (KD). Existing KD approaches for SR typically require speaker labels to learn task-specific knowledge, due to the inefficiency of conventional loss for distillation. To address the inefficiency problem and achieve label-free KD, we propose to employ the contrastive loss from self-supervised learning for distillation. Extensive experiments are conducted on a collection of public speech datasets from diverse sources. Results on light-weight SR models show that the proposed approach of label-free KD with contrastive loss consistently outperforms both conventional distillation methods and self-supervised learning methods by a significant margin.

Authors (5)

Zhiyuan Peng (33 papers)
Xuanji He (4 papers)
Ke Ding (30 papers)
Tan Lee (70 papers)
Guanglu Wan (24 papers)

Citations (4)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition (2212.03090v1)

Summary

Related Papers