Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker Verification (2109.08839v1)

Published 18 Sep 2021 in cs.SD, cs.CL, cs.CV, cs.LG, and eess.AS

Abstract: Recently, x-vector has been a successful and popular approach for speaker verification, which employs a time delay neural network (TDNN) and statistics pooling to extract speaker characterizing embedding from variable-length utterances. Improvement upon the x-vector has been an active research area, and enormous neural networks have been elaborately designed based on the x-vector, eg, extended TDNN (E-TDNN), factorized TDNN (F-TDNN), and densely connected TDNN (D-TDNN). In this work, we try to identify the optimal architectures from a TDNN based search space employing neural architecture search (NAS), named SpeechNAS. Leveraging the recent advances in the speaker recognition, such as high-order statistics pooling, multi-branch mechanism, D-TDNN and angular additive margin softmax (AAM) loss with a minimum hyper-spherical energy (MHE), SpeechNAS automatically discovers five network architectures, from SpeechNAS-1 to SpeechNAS-5, of various numbers of parameters and GFLOPs on the large-scale text-independent speaker recognition dataset VoxCeleb1. Our derived best neural network achieves an equal error rate (EER) of 1.02% on the standard test set of VoxCeleb1, which surpasses previous TDNN based state-of-the-art approaches by a large margin. Code and trained weights are in https://github.com/wentaozhu/speechnas.git

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Wentao Zhu (73 papers)
  2. Tianlong Kong (3 papers)
  3. Shun Lu (6 papers)
  4. Jixiang Li (7 papers)
  5. Dawei Zhang (35 papers)
  6. Feng Deng (5 papers)
  7. Xiaorui Wang (30 papers)
  8. Sen Yang (191 papers)
  9. Ji Liu (285 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.