Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

The Vicomtech Audio Deepfake Detection System based on Wav2Vec2 for the 2022 ADD Challenge (2203.01573v1)

Published 3 Mar 2022 in eess.AS and cs.SD

Abstract: This paper describes our submitted systems to the 2022 ADD challenge withing the tracks 1 and 2. Our approach is based on the combination of a pre-trained wav2vec2 feature extractor and a downstream classifier to detect spoofed audio. This method exploits the contextualized speech representations at the different transformer layers to fully capture discriminative information. Furthermore, the classification model is adapted to the application scenario using different data augmentation techniques. We evaluate our system for audio synthesis detection in both the ASVspoof 2021 and the 2022 ADD challenges, showing its robustness and good performance in realistic challenging environments such as telephonic and audio codec systems, noisy audio, and partial deepfakes.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Juan M. Martín-Doñas (5 papers)
  2. Aitor Álvarez (3 papers)
Citations (84)

Summary

We haven't generated a summary for this paper yet.