Large-scale JPEG steganalysis using hybrid deep-learning framework (1611.03233v3)

Published 10 Nov 2016 in cs.MM

Abstract: Adoption of deep learning in image steganalysis is still in its initial stage. In this paper we propose a generic hybrid deep-learning framework for JPEG steganalysis incorporating the domain knowledge behind rich steganalytic models. Our proposed framework involves two main stages. The first stage is hand-crafted, corresponding to the convolution phase and the quantization & truncation phase of the rich models. The second stage is a compound deep neural network containing multiple deep subnets in which the model parameters are learned in the training procedure. We provided experimental evidences and theoretical reflections to argue that the introduction of threshold quantizers, though disable the gradient-descent-based learning of the bottom convolution phase, is indeed cost-effective. We have conducted extensive experiments on a large-scale dataset extracted from ImageNet. The primary dataset used in our experiments contains 500,000 cover images, while our largest dataset contains five million cover images. Our experiments show that the integration of quantization and truncation into deep-learning steganalyzers do boost the detection performance by a clear margin. Furthermore, we demonstrate that our framework is insensitive to JPEG blocking artifact alterations, and the learned model can be easily transferred to a different attacking target and even a different dataset. These properties are of critical importance in practical applications.

Citations (182)

View on Semantic Scholar

Summary

The paper proposes a two-stage hybrid framework integrating hand-crafted steganalytic features with a compound deep neural network for large-scale JPEG steganalysis.
The framework achieves substantial performance gains over existing methods on large datasets, with quantization and truncation significantly boosting detection accuracy.
This hybrid approach is robust to JPEG artifacts and demonstrates improved detection without essential reliance on full gradient-descent learning, paving the way for future research like adversarial ML.

Overview of the Hybrid Deep-Learning Framework for JPEG Steganalysis

This paper presents a hybrid deep-learning framework designed for JPEG image steganalysis, which seeks to address the early adoption challenges of deep learning in this domain. The framework integrates deep neural networks with traditional steganalytic techniques to detect concealed information within JPEG images.

This innovative approach is divided into two stages. The first is hand-crafted and incorporates convolution and quantization phases inspired by rich steganalytic models. The second stage involves a compound deep neural network consisting of multiple deep-learning subnets with trained parameters.

The authors provide compelling evidence to support the introduction of threshold quantizers, noting that although these inhibit gradient-descent-based learning in the initial convolution phase, they enhance cost-effectiveness and detection performance. The framework was tested on datasets derived from ImageNet, confirming its capability to improve detection accuracy even when faced with JPEG blocking artifact alterations and diverse datasets.

Key Numerical Results and Findings

The paper reports multiple experiments involving datasets with varying numbers of cover images, scaling up to five million. The framework achieved substantial performance gains over existing steganalytic models such as DCTR, PHARM, GFR, and SCA-GFR, particularly with larger datasets. Ensemble prediction further enhanced detection accuracy by approximately 1%.

Quantization and truncation within the deep-learning framework significantly boosted detection performance. The incorporation of threshold quantizers proved advantageous, providing a more diverse representation of the steganalytic model.

Implications and Future Directions

This framework bridges the gap between hand-crafted steganalytic features and deep learning, allowing for improved detection accuracy without essential reliance on gradient-descent-learning for every model component. It demonstrates the potential for more effective steganalytic detection with hybrid techniques, challenging the necessity of full backpropagation learning.

The insensitivity of the framework to JPEG blocking artifact alterations addresses practical applications, ensuring its robustness in real-world scenarios where image preprocessing may vary.

Future research could explore the integration of adversarial machine learning to make the model more resilient against evolving steganographic techniques. Additionally, adapting the framework for broader applications in multimedia forensics could provide valuable insights and methodologies applicable across various domains.

This paper represents a significant step towards incorporating domain knowledge into deep-learning models for specific technical applications, setting a precedent for future advancements in steganalysis and potentially other fields requiring high detection fidelity. The careful design of the hybrid framework highlights the importance of balancing handcrafted features with deep learning adaptability, a strategy that could be effectively employed in other research areas to exploit complementary strengths.

Large-scale JPEG steganalysis using hybrid deep-learning framework (1611.03233v3)

Summary

Overview of the Hybrid Deep-Learning Framework for JPEG Steganalysis

Key Numerical Results and Findings

Implications and Future Directions

Related Papers