Papers
Topics
Authors
Recent
Search
2000 character limit reached

LRPD: Large Replay Parallel Dataset

Published 29 Sep 2023 in eess.AS and cs.SD | (2309.17298v1)

Abstract: The latest research in the field of voice anti-spoofing (VAS) shows that deep neural networks (DNN) outperform classic approaches like GMM in the task of presentation attack detection. However, DNNs require a lot of data to converge, and still lack generalization ability. In order to foster the progress of neural network systems, we introduce a Large Replay Parallel Dataset (LRPD) aimed for a detection of replay attacks. LRPD contains more than 1M utterances collected by 19 recording devices in 17 various environments. We also provide an example training pipeline in PyTorch [1] and a baseline system, that achieves 0.28% Equal Error Rate (EER) on evaluation subset of LRPD and 11.91% EER on publicly available ASVpoof 2017 [2] eval set. These results show that model trained with LRPD dataset has a consistent performance on the fully unknown conditions. Our dataset is free for research purposes and hosted on GDrive. Baseline code and pre-trained models are available at GitHub.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (16)
  1. “Pytorch: An imperative style, high-performance deep learning library,” Advances in neural information processing systems, vol. 32, pp. 8026–8037, 2019.
  2. “The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection,” in Proc. Interspeech 2017, 2017, pp. 2–6.
  3. “Asvspoof 2015: the first automatic speaker verification spoofing and countermeasures challenge,” in Sixteenth annual conference of the international speech communication association, 2015.
  4. “Asvspoof 2019: Future horizons in spoofed and fake audio detection,” arXiv preprint arXiv:1904.05441, 2019.
  5. “On the vulnerability of speaker verification to realistic voice spoofing,” in 2015 IEEE 7th International Conference on Biometrics Theory, Applications and Systems (BTAS). IEEE, 2015, pp. 1–6.
  6. “On the use of convolutional neural networks for speech presentation attack detection,” in 2018 IEEE 4th International Conference on Identity, Security, and Behavior Analysis (ISBA). IEEE, 2018, pp. 1–8.
  7. “Phonespoof: A new dataset for spoofing attack detection in telephone channel,” in ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2019, pp. 2572–2576.
  8. “CSTR VCTK Corpus: English multi-speaker corpus for CSTR voice cloning toolkit (version 0.92),” 2019.
  9. “Librispeech: an asr corpus based on public domain audio books,” in 2015 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, 2015, pp. 5206–5210.
  10. “Common voice: A massively-multilingual speech corpus,” arXiv preprint arXiv:1912.06670, 2019.
  11. “Google crowdsourced speech corpora and related open-source resources for low-resource languages and dialects: An overview,” arXiv preprint arXiv:2010.06778, 2020.
  12. “Cn-celeb: multi-genre speaker recognition,” arXiv preprint arXiv:2012.12468, 2020.
  13. “Rawnet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification,” arXiv preprint arXiv:1904.08104, 2019.
  14. “Musan: A music, speech, and noise corpus,” arXiv preprint arXiv:1510.08484, 2015.
  15. “DCASE 2017 challenge setup: Tasks, datasets and baseline system,” in Proceedings of the Detection and Classification of Acoustic Scenes and Events 2017 Workshop (DCASE2017), November 2017, pp. 85–92.
  16. “DEMAND: a collection of multi-channel recordings of acoustic noise in diverse environments,” June 2013, Supported by Inria under the Associate Team Program VERSAMUS.
Citations (1)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.