Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Exposing the Deception: Uncovering More Forgery Clues for Deepfake Detection (2403.01786v1)

Published 4 Mar 2024 in cs.CV, cs.IT, and math.IT

Abstract: Deepfake technology has given rise to a spectrum of novel and compelling applications. Unfortunately, the widespread proliferation of high-fidelity fake videos has led to pervasive confusion and deception, shattering our faith that seeing is believing. One aspect that has been overlooked so far is that current deepfake detection approaches may easily fall into the trap of overfitting, focusing only on forgery clues within one or a few local regions. Moreover, existing works heavily rely on neural networks to extract forgery features, lacking theoretical constraints guaranteeing that sufficient forgery clues are extracted and superfluous features are eliminated. These deficiencies culminate in unsatisfactory accuracy and limited generalizability in real-life scenarios. In this paper, we try to tackle these challenges through three designs: (1) We present a novel framework to capture broader forgery clues by extracting multiple non-overlapping local representations and fusing them into a global semantic-rich feature. (2) Based on the information bottleneck theory, we derive Local Information Loss to guarantee the orthogonality of local representations while preserving comprehensive task-relevant information. (3) Further, to fuse the local representations and remove task-irrelevant information, we arrive at a Global Information Loss through the theoretical analysis of mutual information. Empirically, our method achieves state-of-the-art performance on five benchmark datasets.Our code is available at \url{https://github.com/QingyuLiu/Exposing-the-Deception}, hoping to inspire researchers.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (61)
  1. Mesonet: a compact facial video forgery detection network. In 2018 IEEE international workshop on information forensics and security (WIFS), 1–7. IEEE.
  2. Transferring Audio Deepfake Detection Capability across Languages. In Proceedings of the ACM Web Conference 2023, 2033–2044.
  3. AUNet: Learning Relations Between Action Units for Face Forgery Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 24709–24719.
  4. End-to-end reconstruction-classification learning for face forgery detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 4113–4122.
  5. Self-supervised learning of adversarial example: Towards good generalizations for deepfake detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 18710–18719.
  6. Ost: Improving generalization of deepfake detection via one-shot test-time training. Advances in Neural Information Processing Systems, 35: 24597–24610.
  7. Zero-shot multi-view indoor localization via graph location networks. In Proceedings of the 28th ACM International Conference on Multimedia, 3431–3440.
  8. Not made for each other-audio-visual dissonance-based deepfake detection and localization. In Proceedings of the 28th ACM international conference on multimedia, 439–447.
  9. Retinaface: Single-shot multi-level face localisation in the wild. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 5203–5212.
  10. The deepfake detection challenge (dfdc) dataset. arXiv preprint arXiv:2006.07397.
  11. Explaining deepfake detection by analysing image matching. In European Conference on Computer Vision, 18–35. Springer.
  12. Protecting celebrities from deepfake with identity consistency transformer. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 9468–9478.
  13. Leveraging frequency analysis for deep fake image recognition. In International conference on machine learning, 3247–3258. PMLR.
  14. Deepfakeucl: Deepfake detection via unsupervised contrastive learning. In 2021 international joint conference on neural networks (IJCNN), 1–8. IEEE.
  15. Hierarchical Contrastive Inconsistency Learning for Deepfake Video Detection. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XII, 596–613. Springer.
  16. Region-aware temporal inconsistency learning for deepfake video detection. In Proceedings of the 31th International Joint Conference on Artificial Intelligence, volume 1.
  17. Lips don’t lie: A generalisable and robust approach to face forgery detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 5039–5049.
  18. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, 770–778.
  19. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861.
  20. Finfer: Frame inference-based deepfake detection for high-visual-quality videos. In Proceedings of the AAAI conference on artificial intelligence, volume 36, 951–959.
  21. Learning Patch-Channel Correspondence for Interpretable Face Forgery Detection. IEEE Transactions on Image Processing, 32: 1668–1680.
  22. Implicit Identity Driven Deepfake Face Swapping Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 4490–4499.
  23. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
  24. Wavelet-enhanced Weakly Supervised Local Feature Learning for Face Forgery Detection. In Proceedings of the 30th ACM International Conference on Multimedia, 1299–1308.
  25. Faceshifter: Towards high fidelity and occlusion aware face swapping. arXiv preprint arXiv:1912.13457.
  26. Face x-ray for more general face forgery detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 5001–5010.
  27. Exposing deepfake videos by detecting face warping artifacts. arXiv preprint arXiv:1811.00656.
  28. Celeb-df: A large-scale challenging dataset for deepfake forensics. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 3207–3216.
  29. Auxiliary tasks in multi-task learning. arXiv preprint arXiv:1805.06334.
  30. Spatial-phase shallow learning: rethinking face forgery detection in frequency domain. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 772–781.
  31. Rethinking smart contract fuzzing: Fuzzing with invocation ordering and important branch revisiting. IEEE Transactions on Information Forensics and Security, 18: 1237–1251.
  32. Copy Motion From One to Another: Fake Motion Video Generation. arXiv preprint arXiv:2205.01373.
  33. Sgdr: Stochastic gradient descent with warm restarts. arXiv preprint arXiv:1608.03983.
  34. Generalizing face forgery detection with high-frequency features. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 16317–16326.
  35. Two-branch recurrent network for isolating deepfakes in videos. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part VII 16, 667–684. Springer.
  36. McGill, W. 1954. Multivariate information transmission. Transactions of the IRE Professional Group on Information Theory, 4(4): 93–111.
  37. Fsgan: Subject agnostic face swapping and reenactment. In Proceedings of the IEEE/CVF international conference on computer vision, 7184–7193.
  38. DFIL: Deepfake Incremental Learning by Exploiting Domain-invariant Forgery Clues. In Proceedings of the 31st ACM International Conference on Multimedia, 8035–8046.
  39. On variational bounds of mutual information. In International Conference on Machine Learning, 5171–5180. PMLR.
  40. Thinking in frequency: Face forgery detection by mining frequency-aware clues. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XII, 86–103. Springer.
  41. Faceforensics++: Learning to detect manipulated facial images. In Proceedings of the IEEE/CVF international conference on computer vision, 1–11.
  42. Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE international conference on computer vision, 618–626.
  43. Detecting deepfakes with self-blended images. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 18720–18729.
  44. Locate and Verify: A Two-Stream Network for Improved Deepfake Detection. In Proceedings of the 31st ACM International Conference on Multimedia, 7131–7142.
  45. Relation Triplet Construction for Cross-modal Text-to-Video Retrieval. In Proceedings of the 31st ACM International Conference on Multimedia, 4759–4767.
  46. Dual contrastive learning for general face forgery detection. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, 2316–2324.
  47. Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning, 6105–6114. PMLR.
  48. Farewell to mutual information: Variational distillation for cross-modal person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1522–1531.
  49. The information bottleneck method. arXiv preprint physics/0004057.
  50. LiSiam: Localization invariance Siamese network for deepfake detection. IEEE Transactions on Information Forensics and Security, 17: 2425–2436.
  51. AltFreezing for More General Video Face Forgery Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 4129–4138.
  52. Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 16221–16231.
  53. Generalizing Face Forgery Detection via Uncertainty Learning. In Proceedings of the 31st ACM International Conference on Multimedia, 1759–1767.
  54. Wyner, A. D. 1978. A definition of conditional mutual information for arbitrary ensembles. Information and Control, 38(1): 51–59.
  55. Bootstrapping multi-view representations for fake news detection. In Proceedings of the AAAI conference on Artificial Intelligence, volume 37, 5384–5392.
  56. Detecting Deepfake Videos with Temporal Dropout 3DCNN. In IJCAI, 1288–1294.
  57. mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412.
  58. Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks. In 2024 IEEE Symposium on Security and Privacy (SP), 53–53. IEEE Computer Society.
  59. Multi-attentional deepfake detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2185–2194.
  60. Learning self-consistency for deepfake detection. In Proceedings of the IEEE/CVF international conference on computer vision, 15023–15033.
  61. Exploring temporal coherence for more general video face forgery detection. In Proceedings of the IEEE/CVF international conference on computer vision, 15044–15054.
Citations (19)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com