2000 character limit reached
UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023 (2308.12526v1)
Published 24 Aug 2023 in eess.AS, cs.LG, and cs.SD
Abstract: This report describes the UNISOUND submission for Track1 and Track2 of VoxCeleb Speaker Recognition Challenge 2023 (VoxSRC 2023). We submit the same system on Track 1 and Track 2, which is trained with only VoxCeleb2-dev. Large-scale ResNet and RepVGG architectures are developed for the challenge. We propose a consistency-aware score calibration method, which leverages the stability of audio voiceprints in similarity score by a Consistency Measure Factor (CMF). CMF brings a huge performance boost in this challenge. Our final system is a fusion of six models and achieves the first place in Track 1 and second place in Track 2 of VoxSRC 2023. The minDCF of our submission is 0.0855 and the EER is 1.5880%.
- Yu Zheng (196 papers)
- Yajun Zhang (11 papers)
- Chuanying Niu (1 paper)
- Yibin Zhan (2 papers)
- Yanhua Long (21 papers)
- Dongxing Xu (5 papers)