Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Audio Deepfake Detection: A Survey (2308.14970v1)

Published 29 Aug 2023 in cs.SD and eess.AS

Abstract: Audio deepfake detection is an emerging active topic. A growing number of literatures have aimed to study deepfake detection algorithms and achieved effective performance, the problem of which is far from being solved. Although there are some review literatures, there has been no comprehensive survey that provides researchers with a systematic overview of these developments with a unified evaluation. Accordingly, in this survey paper, we first highlight the key differences across various types of deepfake audio, then outline and analyse competitions, datasets, features, classifications, and evaluation of state-of-the-art approaches. For each aspect, the basic techniques, advanced developments and major challenges are discussed. In addition, we perform a unified comparison of representative features and classifiers on ASVspoof 2021, ADD 2023 and In-the-Wild datasets for audio deepfake detection, respectively. The survey shows that future research should address the lack of large scale datasets in the wild, poor generalization of existing detection methods to unknown fake attacks, as well as interpretability of detection results.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Jiangyan Yi (77 papers)
  2. Chenglong Wang (80 papers)
  3. Jianhua Tao (139 papers)
  4. Xiaohui Zhang (105 papers)
  5. Chu Yuan Zhang (9 papers)
  6. Yan Zhao (120 papers)
Citations (28)

Summary

We haven't generated a summary for this paper yet.