Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023 (2308.12526v1)

Published 24 Aug 2023 in eess.AS, cs.LG, and cs.SD

Abstract: This report describes the UNISOUND submission for Track1 and Track2 of VoxCeleb Speaker Recognition Challenge 2023 (VoxSRC 2023). We submit the same system on Track 1 and Track 2, which is trained with only VoxCeleb2-dev. Large-scale ResNet and RepVGG architectures are developed for the challenge. We propose a consistency-aware score calibration method, which leverages the stability of audio voiceprints in similarity score by a Consistency Measure Factor (CMF). CMF brings a huge performance boost in this challenge. Our final system is a fusion of six models and achieves the first place in Track 1 and second place in Track 2 of VoxSRC 2023. The minDCF of our submission is 0.0855 and the EER is 1.5880%.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yu Zheng (196 papers)
  2. Yajun Zhang (11 papers)
  3. Chuanying Niu (1 paper)
  4. Yibin Zhan (2 papers)
  5. Yanhua Long (21 papers)
  6. Dongxing Xu (5 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.