Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

End-to-end DNN Based Speaker Recognition Inspired by i-vector and PLDA (1710.02369v2)

Published 6 Oct 2017 in eess.AS and cs.SD

Abstract: Recently several end-to-end speaker verification systems based on deep neural networks (DNNs) have been proposed. These systems have been proven to be competitive for text-dependent tasks as well as for text-independent tasks with short utterances. However, for text-independent tasks with longer utterances, end-to-end systems are still outperformed by standard i-vector + PLDA systems. In this work, we develop an end-to-end speaker verification system that is initialized to mimic an i-vector + PLDA baseline. The system is then further trained in an end-to-end manner but regularized so that it does not deviate too far from the initial system. In this way we mitigate overfitting which normally limits the performance of end-to-end systems. The proposed system outperforms the i-vector + PLDA baseline on both long and short duration utterances.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Johan Rohdin (20 papers)
  2. Anna Silnova (22 papers)
  3. Mireia Diez (17 papers)
  4. Oldrich Plchot (80 papers)
  5. Lukas Burget (164 papers)
  6. Pavel Matejka (4 papers)
Citations (53)

Summary

We haven't generated a summary for this paper yet.