Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Celeb-DF: A Large-scale Challenging Dataset for DeepFake Forensics (1909.12962v4)

Published 27 Sep 2019 in cs.CR, cs.CV, and eess.IV

Abstract: AI-synthesized face-swapping videos, commonly known as DeepFakes, is an emerging problem threatening the trustworthiness of online information. The need to develop and evaluate DeepFake detection algorithms calls for large-scale datasets. However, current DeepFake datasets suffer from low visual quality and do not resemble DeepFake videos circulated on the Internet. We present a new large-scale challenging DeepFake video dataset, Celeb-DF, which contains 5,639 high-quality DeepFake videos of celebrities generated using improved synthesis process. We conduct a comprehensive evaluation of DeepFake detection methods and datasets to demonstrate the escalated level of challenges posed by Celeb-DF.

Celeb-DF: A Large-scale Challenging Dataset for DeepFake Forensics

The paper "Celeb-DF: A Large-scale Challenging Dataset for DeepFake Forensics" by Yuezun Li et al. addresses a crucial issue in the domain of digital forensics: the detection of DeepFake videos. DeepFakes have posed significant threats to the veracity of online content, necessitating advanced detection mechanisms supported by robust datasets. Current datasets like UADFV, DF-TIMIT, and FaceForensics++, among others, suffer from low visual quality artifacts that undermine their efficacy in real-world scenarios. This paper offers a novel dataset, Celeb-DF, designed to overcome these limitations by providing a substantial number of high-quality DeepFake videos generated through an enhanced synthesis process.

The Celeb-DF dataset encompasses 5,639 DeepFake videos created from 590 real YouTube videos of 59 celebrities, showcasing a substantial range of demographic diversity. The dataset mitigates common DeepFake visual artifacts—such as low resolution and color mismatches—and improves face synthesis to more closely resemble the authentic footage distributed online. By utilizing an improved synthesis method that includes modifications such as better resolution faces (256x256 pixels), color correction through training data augmentation and post-processing, improved mask generation, and temporal smoothness, the resultant DeepFake videos in Celeb-DF exhibit significantly fewer noticeable artifacts.

A thorough quantitative assessment is carried out using the Mask-SSIM score, which verifies that Celeb-DF videos possess superior visual quality compared to their counterparts in existing datasets. The paper also offers an extensive evaluation of state-of-the-art DeepFake detection methods across Celeb-DF and other datasets. Methods like Two-stream, MesoNet, Xception, and DSP-FWA are evaluated, with DSP-FWA demonstrating the highest average AUC performance among all methods surveyed.

A crucial aspect addressed is the challenge these high-quality DeepFakes present to detection techniques. Many current methods achieve high accuracy on earlier, artifact-prone datasets but struggle with Celeb-DF, underscoring the need for more sophisticated detection techniques that generalize better to real-world conditions. Specifically, the evaluation shows that the second-generation datasets, which include Celeb-DF, present a higher challenge level, leading to a sharper decline in detection performance.

The implications of this work for the field of AI and digital forensics are substantial. The introduction of Celeb-DF sets a new benchmark for evaluating DeepFake detection algorithms' robustness and effectiveness. The dataset’s quality compels future detection approaches to address challenges closer to what is encountered in practice, fostering the development of more resilient methods.

Ultimately, while Celeb-DF represents a significant step forward, future efforts should aim to further enhance dataset breadth and synthesis quality. This should include optimizing the synthesis process to reduce computational demands and incorporating anti-forensic techniques to preemptively counter measures that may be used to evade detection methods. The work presented in this paper establishes a foundational platform that is poised to significantly advance the state of DeepFake forensics research.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Yuezun Li (37 papers)
  2. Xin Yang (314 papers)
  3. Pu Sun (9 papers)
  4. Honggang Qi (34 papers)
  5. Siwei Lyu (125 papers)
Citations (904)