Frequency Masking for Universal Deepfake Detection (2401.06506v3)
Abstract: We study universal deepfake detection. Our goal is to detect synthetic images from a range of generative AI approaches, particularly from emerging ones which are unseen during training of the deepfake detector. Universal deepfake detection requires outstanding generalization capability. Motivated by recently proposed masked image modeling which has demonstrated excellent generalization in self-supervised pre-training, we make the first attempt to explore masked image modeling for universal deepfake detection. We study spatial and frequency domain masking in training deepfake detectors. Based on empirical analysis, we propose a novel deepfake detector via frequency masking. Our focus on frequency domain is different from the majority, which primarily target spatial domain detection. Our comparative analyses reveal substantial performance gains over existing methods. Code and models are publicly available.
- “Towards universal fake image detectors that generalize across generative models,” in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 24480–24489.
- “Are GAN generated images easy to detect? A critical analysis of the state-of-the-art,” in 2021 IEEE International Conference on Multimedia and Expo, ICME 2021, Shenzhen, China, July 5-9, 2021. 2021, pp. 1–6, IEEE.
- “What makes fake images detectable? understanding properties that generalize,” in Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XXVI. 2020, vol. 12371 of Lecture Notes in Computer Science, pp. 103–120, Springer.
- “A survey on generative modeling with limited data, few shots, and zero shot,” ArXiv, vol. abs/2307.14397, 2023.
- “The creation and detection of deepfakes,” ACM Computing Surveys (CSUR), vol. 54, pp. 1 – 41, 2020.
- “High-resolution image synthesis with latent diffusion models,” 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 10674–10685, 2021.
- “Cnn-generated images are surprisingly easy to spot… for now,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, June 13-19, 2020. 2020, pp. 8692–8701, Computer Vision Foundation / IEEE.
- “Discovering transferable forensic features for cnn-generated images detection,” in Computer Vision - ECCV 2022 - 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XV. 2022, vol. 13675 of Lecture Notes in Computer Science, pp. 671–689, Springer.
- “OST: improving generalization of deepfake detection via one-shot test-time training,” in Neural Information Processing Systems, 2022.
- “Masked autoencoders are scalable vision learners,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. 2022, pp. 15979–15988, IEEE.
- “Rethinking out-of-distribution (ood) detection: Masked image modeling is all you need,” 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11578–11589, 2023.
- “Masked frequency modeling for self-supervised visual pre-training,” in The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. 2023, OpenReview.net.
- “Masked generative adversarial networks are data-efficient generation learners,” in Neural Information Processing Systems, 2022.
- “Intriguing properties of synthetic images: from generative adversarial networks to diffusion models,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 - Workshops, Vancouver, BC, Canada, June 17-24, 2023. 2023, pp. 973–982, IEEE.
- “On the detection of synthetic images generated by diffusion models,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023, vol. abs/2211.00680.
- “Learning transferable visual models from natural language supervision,” in International Conference on Machine Learning, 2021.
- Chandler Timm Doloriel (1 paper)
- Ngai-Man Cheung (80 papers)