Image Splicing Localization Using A Multi-Task Fully Convolutional Network (MFCN) (1709.02016v1)

Published 6 Sep 2017 in cs.CV

Abstract: In this work, we propose a technique that utilizes a fully convolutional network (FCN) to localize image splicing attacks. We first evaluated a single-task FCN (SFCN) trained only on the surface label. Although the SFCN is shown to provide superior performance over existing methods, it still provides a coarse localization output in certain cases. Therefore, we propose the use of a multi-task FCN (MFCN) that utilizes two output branches for multi-task learning. One branch is used to learn the surface label, while the other branch is used to learn the edge or boundary of the spliced region. We trained the networks using the CASIA v2.0 dataset, and tested the trained models on the CASIA v1.0, Columbia Uncompressed, Carvalho, and the DARPA/NIST Nimble Challenge 2016 SCI datasets. Experiments show that the SFCN and MFCN outperform existing splicing localization algorithms, and that the MFCN can achieve finer localization than the SFCN.

Citations (306)

View on Semantic Scholar

Summary

The paper introduces a multi-task FCN that uses dual-output branches to enhance pixel-level splicing detection accuracy.
The method achieves superior performance with improved F1 scores and MCC metrics compared to traditional splicing localization techniques.
The study validates the MFCN's robustness against post-processing effects like JPEG compression, Gaussian blurring, and noise.

Image Splicing Localization Using a Multi-Task Fully Convolutional Network (MFCN)

This paper offers a detailed exploration of a novel approach for image splicing localization through the use of a Multi-Task Fully Convolutional Network (MFCN). The researchers presented an advanced method to address the critical issue of image forgery detection, specifically focusing on image splicing, a type of manipulation where regions from different images are combined.

Methodology Overview

The foundation of the proposed method lies in the application of a fully convolutional network (FCN). The paper initially evaluates a Single-Task FCN (SFCN) specifically trained on the surface label to classify each pixel as either spliced or authentic. Although the SFCN demonstrates significant improvements over existing algorithms, some limitations exist in terms of localization granularity.

To address these limitations, the authors propose the use of a Multi-Task FCN (MFCN). This architecture entails two output branches: one for learning the surface label and another for the edge or boundary of the spliced region. By adopting a multi-task learning approach, the MFCN can achieve finer localization than the SFCN alone. The network is trained using a comprehensive dataset (CASIA v2.0) and tested on multiple benchmarks such as CASIA v1.0, Columbia Uncompressed, Carvalho, and the DARPA/NIST Nimble Challenge 2016 SCI datasets.

Key Results and Implications

Empirical results affirm that both SFCN and MFCN outperform a wide range of existing splicing localization algorithms. The multi-task approach of MFCN, particularly with the edge-enhanced inference method, achieved superior performance in terms of pixel-level localization metrics, specifically $F_1$ scores and Matthews Correlation Coefficient (MCC).

Performance Metrics:

The MFCN displayed enhanced localization capabilities over the SFCN and other conventional methods.
The edge-enhanced inference technique within the MFCN provided the most robust results, achieving finer boundary identification and noise resilience.

The paper also examines the resilience of the proposed methods against post-processing operations such as JPEG compression, Gaussian blurring, and additive noise. Notably, the MFCN's performance was minimally impacted by such manipulations, maintaining its advantage over traditional algorithms, which often show substantial performance degradation.

Future Directions

The introduction of MFCN with dual output branches emphasizes a promising direction for enhancing image forgery detection's spatial precision. The application of this method can be extended to other forms of image manipulations beyond splicing. Future research might explore extending multi-task learning frameworks to incorporate even more nuanced classification tasks or integrate with cross-domain datasets to build generalizable models.

The advancements presented in this paper highlight the potential in leveraging deep learning architectures like FCNs for complex image analysis problems, inviting further investigation into optimizing such networks for dynamic and varied real-world applications in digital forensics and image integrity assurance.

PDF Markdown