Detecting GAN generated Fake Images using Co-occurrence Matrices (1903.06836v2)

Published 15 Mar 2019 in cs.CV and eess.IV

Abstract: The advent of Generative Adversarial Networks (GANs) has brought about completely novel ways of transforming and manipulating pixels in digital images. GAN based techniques such as Image-to-Image translations, DeepFakes, and other automated methods have become increasingly popular in creating fake images. In this paper, we propose a novel approach to detect GAN generated fake images using a combination of co-occurrence matrices and deep learning. We extract co-occurrence matrices on three color channels in the pixel domain and train a model using a deep convolutional neural network (CNN) framework. Experimental results on two diverse and challenging GAN datasets comprising more than 56,000 images based on unpaired image-to-image translations (cycleGAN [1]) and facial attributes/expressions (StarGAN [2]) show that our approach is promising and achieves more than 99% classification accuracy in both datasets. Further, our approach also generalizes well and achieves good results when trained on one dataset and tested on the other.

PDF Abstract

Detecting GAN Generated Fake Images using Co-occurrence Matrices: A Technical Evaluation and Discussion

The paper "Detecting GAN generated Fake Images using Co-occurrence Matrices" addresses the increasingly relevant problem of identifying images synthesized by Generative Adversarial Networks (GANs). GANs have rapidly advanced the capability to produce hyper-realistic images, making it difficult to distinguish between authentic and manipulated digital content. This capability is leveraged by various applications like DeepFakes and image-to-image translations, positioning them as both innovative tools and potential sources of disinformation. The authors propose a novel detection methodology using co-occurrence matrices in combination with deep learning, which reportedly achieves high levels of accuracy in distinguishing GAN-generated images from real ones.

Methodology

The method employed in this work leverages co-occurrence matrices to analyze pixel-level statistics in GAN-generated images. Co-occurrence matrices are extracted across RGB color channels in the pixel domain, serving as inputs into a deep convolutional neural network (CNN). This approach is rooted in the principles of steganalysis, which historically aim to detect data hiding within digital media by analyzing image statics. Unlike traditional methods that rely on feature extraction from image residuals or filtered images, this process directly utilizes the pixel co-occurrence matrices.

Experimental Results

The authors evaluate their method across diverse GAN-generated datasets from unpaired image-to-image translations (CycleGAN) and facial attribute manipulations (StarGAN), involving more than 56,000 images. Notably, their proposed technique achieves exceptionally high classification accuracy, greater than 99% on both datasets. Furthermore, the approach demonstrates robust generalization; the model trained on one dataset performs well when tested on another, maintaining significant classification performance.

Cross-Dataset Generalizability

The generalizability tests reveal the method's efficacy in broader applications, maintaining high accuracy across distinct datasets. Specifically, the method achieved a 99.45% accuracy when the model trained on the CycleGAN dataset was used to test on the StarGAN dataset, which suggests the training learned salient features applicable across diverse GAN frameworks. However, a slight drop to 93.42% accuracy was observed in the reverse test, indicating potential challenges when encountering variations inherent to the StarGAN dataset.

Comparative Analysis

This paper's findings were juxtaposed against other contemporary methods, including deep learning-based approaches and traditional features used in image forensics. The methodology outperformed state-of-the-art techniques on uncompressed images from the CycleGAN dataset. However, the efficacy of the method slightly decreased when tested on JPEG-compressed images—a commonplace scenario for images shared across social platforms.

Implications and Future Directions

The paper illuminates the potential of co-occurrence matrices combined with CNNs as a potent tool against GAN-based image manipulations. The method entails practical implications for fields where digital image integrity is crucial, such as journalism, social media moderation, and digital forensics. At a theoretical level, the paper serves as a compelling representation of cross-disciplinary integration, merging ideas from image forensics, machine learning, and computer vision.

Looking ahead, the research lays groundwork for fine-tuning effectiveness on compressed image datasets and expanding the model to localize manipulated regions within images. Further advancements may explore different types of GANs and consider real-world constraints such as image post-processing and occlusions.

In conclusion, this paper contributes significantly to the detection of GAN-generated imagery, offering compelling evidence of the utility of co-occurrence matrices in conjunction with deep learning. Through rigorous experimentation and comprehensive evaluation, it underlines a promising direction for enhancing the robustness of image forensics against sophisticated generative models.

PDF Markdown Bookmark Chat (Pro)

Authors (7)

Lakshmanan Nataraj (16 papers)
Tajuddin Manhar Mohammed (11 papers)
Shivkumar Chandrasekaran (24 papers)
Arjuna Flenner (10 papers)
Jawadul H. Bappy (5 papers)
Amit K. Roy-Chowdhury (87 papers)
B. S. Manjunath (56 papers)

Citations (245)

View on Semantic Scholar