2000 character limit reached
THUEE system description for NIST 2020 SRE CTS challenge (2210.06111v1)
Published 12 Oct 2022 in cs.SD, cs.AI, eess.AS, and eess.SP
Abstract: This paper presents the system description of the THUEE team for the NIST 2020 Speaker Recognition Evaluation (SRE) conversational telephone speech (CTS) challenge. The subsystems including ResNet74, ResNet152, and RepVGG-B2 are developed as speaker embedding extractors in this evaluation. We used combined AM-Softmax and AAM-Softmax based loss functions, namely CM-Softmax. We adopted a two-staged training strategy to further improve system performance. We fused all individual systems as our final submission. Our approach leads to excellent performance and ranks 1st in the challenge.
- Yu Zheng (196 papers)
- Jinghan Peng (3 papers)
- Miao Zhao (14 papers)
- Yufeng Ma (7 papers)
- Min Liu (236 papers)
- Xinyue Ma (9 papers)
- Tianyu Liang (11 papers)
- Tianlong Kong (3 papers)
- Liang He (202 papers)
- Minqiang Xu (17 papers)