DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis (2403.18471v1)
Abstract: The rapid progress in deep learning has given rise to hyper-realistic facial forgery methods, leading to concerns related to misinformation and security risks. Existing face forgery datasets have limitations in generating high-quality facial images and addressing the challenges posed by evolving generative techniques. To combat this, we present DiffusionFace, the first diffusion-based face forgery dataset, covering various forgery categories, including unconditional and Text Guide facial image generation, Img2Img, Inpaint, and Diffusion-based facial exchange algorithms. Our DiffusionFace dataset stands out with its extensive collection of 11 diffusion models and the high-quality of the generated images, providing essential metadata and a real-world internet-sourced forgery facial image dataset for evaluation. Additionally, we provide an in-depth analysis of the data and introduce practical evaluation protocols to rigorously assess discriminative models' effectiveness in detecting counterfeit facial images, aiming to enhance security in facial image authentication processes. The dataset is available for download at \url{https://github.com/Rapisurazurite/DiffFace}.
- Faceswap. https://github.com/MarekKowalski/FaceSwap/.
- Fakeapp. https://www.fakeapp.com/, 2018.
- faceswap-gan. https://github.com/shaoanlu/faceswap-GAN, 2019.
- Parents and children: Distinguishing multimodal deepfakes from natural images. arXiv preprint arXiv:2304.00500, 2023.
- Cifake: Image classification and explainable identification of ai-generated synthetic images. arXiv preprint arXiv:2303.14126, 2023.
- End-to-end reconstruction-classification learning for face forgery detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4113–4122, 2022.
- Discovering transferable forensic features for cnn-generated images detection. In European Conference on Computer Vision, pages 671–689. Springer, 2022.
- Perception prioritized training of diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11472–11481, 2022.
- On the detection of digital face manipulation. In CVPR, pages 5781–5790, 2020.
- Diffusion models beat gans on image synthesis. Advances in neural information processing systems, 34:8780–8794, 2021.
- The deepfake detection challenge (dfdc) preview dataset. arXiv preprint arXiv:1910.08854, 2019.
- Generative adversarial nets. Advances in neural information processing systems, 27, 2014.
- Forgerynet: A versatile benchmark for comprehensive forgery analysis. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 4360–4369, 2021.
- Denoising diffusion probabilistic models. Advances in neural information processing systems, 33:6840–6851, 2020.
- Deeperforensics-1.0: A large-scale dataset for real-world face forgery detection. In CVPR, 2020.
- Davis E King. Dlib-ml: A machine learning toolkit. The Journal of Machine Learning Research, 10:1755–1758, 2009.
- Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013.
- Deepfakes: a new threat to face recognition? assessment and detection. arXiv preprint arXiv:1812.08685, 2018.
- Convolutional deep belief networks on cifar-10. Unpublished manuscript, 40(7):1–9, 2010.
- Celeb-df: A large-scale challenging dataset for deepfake forensics. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 3207–3216, 2020.
- Microsoft coco: Common objects in context. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, pages 740–755. Springer, 2014.
- Pseudo numerical methods for diffusion models on manifolds. In International Conference on Learning Representations, 2022.
- Deep learning face attributes in the wild. In Proceedings of International Conference on Computer Vision (ICCV), 2015.
- Global texture enhancement for fake face detection in the wild. In CVPR, 2020.
- Generalizing face forgery detection with high-frequency features. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 16317–16326, 2021.
- Sdedit: Guided image synthesis and editing with stochastic differential equations. arXiv preprint arXiv:2108.01073, 2021.
- Towards universal fake image detectors that generalize across generative models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 24480–24489, 2023.
- Sdd-fiqa: unsupervised face image quality assessment with similarity distribution distance. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 7670–7679, 2021.
- Deepfacelab: A simple, flexible and extensible face swapping framework. arXiv preprint arXiv:2005.05535, 2020.
- Thinking in frequency: Face forgery detection by mining frequency-aware clues. In ECCV, 2020.
- Towards the detection of diffusion model deepfakes. arXiv preprint arXiv:2210.14571, 2022.
- High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10684–10695, 2022.
- Faceforensics++: Learning to detect manipulated facial images. In Proceedings of the IEEE/CVF international conference on computer vision, pages 1–11, 2019.
- De-fake: Detection and attribution of fake images generated by text-to-image diffusion models. arXiv preprint arXiv:2210.06998, 2022.
- Deep unsupervised learning using nonequilibrium thermodynamics. In International conference on machine learning, pages 2256–2265. PMLR, 2015.
- Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502, 2020.
- An information theoretic approach for attention-driven face forgery detection. In European Conference on Computer Vision, pages 111–127. Springer, 2022a.
- Dual contrastive learning for general face forgery detection. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 2316–2324, 2022b.
- Cnn-generated images are surprisingly easy to spot… for now. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 8695–8704, 2020.
- Dire for diffusion-generated image detection. arXiv preprint arXiv:2303.09295, 2023.
- Tedigan: Text-guided diverse face image generation and manipulation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
- Exposing deep fakes using inconsistent head poses. In ICASSP, 2019.
- Lsun: Construction of a large-scale image dataset using deep learning with humans in the loop. arXiv preprint arXiv:1506.03365, 2015.
- Multi-attentional deepfake detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2185–2194, 2021.
- Diffswap: High-fidelity and controllable face swapping via 3d-aware masked diffusion. CVPR, 2023.
- Genimage: A million-scale benchmark for detecting ai-generated image. arXiv preprint arXiv:2306.08571, 2023.
- Wilddeepfake: A challenging real-world dataset for deepfake detection. In Proceedings of the 28th ACM international conference on multimedia, pages 2382–2390, 2020.
- Zhongxi Chen (4 papers)
- Ke Sun (136 papers)
- Ziyin Zhou (8 papers)
- Xianming Lin (11 papers)
- Xiaoshuai Sun (91 papers)
- Liujuan Cao (73 papers)
- Rongrong Ji (315 papers)