Aligned and Non-Aligned Double JPEG Detection Using Convolutional Neural Networks (1708.00930v1)

Published 2 Aug 2017 in cs.CR and cs.MM

Abstract: Due to the wide diffusion of JPEG coding standard, the image forensic community has devoted significant attention to the development of double JPEG (DJPEG) compression detectors through the years. The ability of detecting whether an image has been compressed twice provides paramount information toward image authenticity assessment. Given the trend recently gained by convolutional neural networks (CNN) in many computer vision tasks, in this paper we propose to use CNNs for aligned and non-aligned double JPEG compression detection. In particular, we explore the capability of CNNs to capture DJPEG artifacts directly from images. Results show that the proposed CNN-based detectors achieve good performance even with small size images (i.e., 64x64), outperforming state-of-the-art solutions, especially in the non-aligned case. Besides, good results are also achieved in the commonly-recognized challenging case in which the first quality factor is larger than the second one.

Authors (8)

Mauro Barni (56 papers)
Luca Bondi (15 papers)
Nicolò Bonettini (7 papers)
Paolo Bestagini (61 papers)
Andrea Costanzo (4 papers)
Marco Maggini (36 papers)
Benedetta Tondi (43 papers)
Stefano Tubaro (55 papers)

Citations (176)

View on Semantic Scholar

Summary

Analysis of Aligned and Non-Aligned Double JPEG Detection Using CNNs

The paper "Aligned and Non-Aligned Double JPEG Detection Using Convolutional Neural Networks" addresses the challenging task of detecting double JPEG (DJPEG) compression in images, both when the compression grids are aligned and when they are not. Given the ubiquity of the JPEG format, DJPEG detection plays a crucial role in image forensics, offering insights into potential image tampering and authenticity.

Detectors and Architecture

The authors propose utilizing Convolutional Neural Networks (CNNs) for DJPEG detection, exploring different approaches to harness the capabilities of CNNs. The paper investigates three distinct CNN architectures for this purpose:

Pixel Domain CNN: This architecture processes images directly in the pixel domain, leveraging a slightly deeper variation of the LeNet model to ascertain the presence of DJPEG artifacts based purely on pixel-level information. Mean subtraction is applied as a preprocessing step.
Noise Domain CNN: Building on previous work in multimedia forensics, this method employs a denoising pre-processing step followed by CNN analysis. This approach aims to expose DJPEG artifacts by removing extraneous content, enhancing the network's capability to detect subtle compression traces.
DCT Histogram CNN: Unlike the previous approaches, this method capitalizes on the DCT domain, directly computing DCT histograms within the CNN as the input feature. By embedding DCT histogram computation into the network, the method aims to capture DJPEG artifacts directly correlated to quantized DCT coefficients.

Experimental Results and Performance

The paper reports the performance of the proposed CNN-based detectors across various scenarios, comparing them against well-established techniques such as Wang et al.'s CNN approach and other model-based methods. Noteworthy results from the research include:

Aligned DJPEG Detection: The CNN approach using DCT histograms demonstrated superior performance compared to existing methods, especially for larger image patches thanks to its focus on DCT domain features.
Non-Aligned DJPEG Detection: The CNN working in the noise domain outperformed state-of-the-art techniques, marking significant improvement on smaller image patches and challenging compression scenarios where the second quality factor (QF2) is less than the first (QF1).
Robustness and Generalization: Both the pixel and noise domain CNN models exhibited a degree of robustness to variations in compression quality factors, maintaining accuracy across several untrained QF1, QF2 pairs.

Implications and Future Directions

The findings of this paper have important implications for the field of image forensics. Practical applications include improved accuracy in identifying manipulated or tampered regions in digital images—crucial for forensic analysis and authenticity validation. Furthermore, the proposed CNN methods exhibit promising adaptability to both commonly aligned and challenging non-aligned double compression scenarios, enabling their use in diverse forensic applications.

This research opens avenues for further investigations into the fusion of CNN outputs from both aligned and non-aligned detection scenarios. Future work may explore enhancing robustness against complex preprocessing attacks or anti-forensic operations. The authors briefly touch on the potential for adversarial training to improve model resistance, indicating promising directions for continued research into advanced forensic CNN architectures.

Overall, this paper contributes a nuanced understanding of DJPEG detection through convolutional models, providing experienced researchers with innovative strategies and detailed insights into the intricacies of image compression artifacts.

Related Papers

Find Related Papers