Embedding Watermarks into Deep Neural Networks (1701.04082v2)

Published 15 Jan 2017 in cs.CV

Abstract: Deep neural networks have recently achieved significant progress. Sharing trained models of these deep neural networks is very important in the rapid progress of researching or developing deep neural network systems. At the same time, it is necessary to protect the rights of shared trained models. To this end, we propose to use a digital watermarking technology to protect intellectual property or detect intellectual property infringement of trained models. Firstly, we formulate a new problem: embedding watermarks into deep neural networks. We also define requirements, embedding situations, and attack types for watermarking to deep neural networks. Secondly, we propose a general framework to embed a watermark into model parameters using a parameter regularizer. Our approach does not hurt the performance of networks into which a watermark is embedded. Finally, we perform comprehensive experiments to reveal the potential of watermarking to deep neural networks as a basis of this new problem. We show that our framework can embed a watermark in the situations of training a network from scratch, fine-tuning, and distilling without hurting the performance of a deep neural network. The embedded watermark does not disappear even after fine-tuning or parameter pruning; the watermark completely remains even after removing 65% of parameters were pruned. The implementation of this research is: https://github.com/yu4u/dnn-watermark

Citations (565)

View on Semantic Scholar

Summary

The paper introduces a method to embed digital watermarks into DNN parameters, safeguarding intellectual property while maintaining high model accuracy.
It leverages a parameter regularizer during training, fine-tuning, and distillation stages to achieve robust watermark integration, validated on datasets like CIFAR-10 and Caltech-101.
Experimental results demonstrate that the random embedding method effectively preserves model performance, even with significant network pruning, highlighting its practical utility.

Embedding Watermarks into Deep Neural Networks

The paper "Embedding Watermarks into Deep Neural Networks" presents a novel approach to embedding watermarks within the parameters of deep neural networks (DNNs). The objective is to safeguard intellectual property rights of trained models, which are increasingly shared within the machine learning community. This research proposes a comprehensive framework for embedding watermarks without degrading the model's performance, regardless of the embedding phase—whether during training, fine-tuning, or model distillation.

Problem Formulation and Proposed Framework

The authors introduce the problem by clearly delineating the necessity of watermarking for DNNs, contrasting it with traditional digital content such as images or audio. They establish the primary requirements for effective watermarking: fidelity, robustness, capacity, security, and efficiency. The central notion is to embed a digital watermark—a binary vector—within the parameters of neural networks during different stages: training from scratch, fine-tuning with pre-trained models, or during model distillation without access to original data labels.

The paper proposes using a parameter regularizer to implant the watermark, ensuring that the watermark coexists with the model's primary function without compromise. This approach leverages the inherent over-parameterization of DNNs, guiding model parameters towards local minima that embed the watermark yet retain task performance. Three types of embedding parameters are considered—direct, differential, and random—with experimental results favoring the random method for its minimal impact on both parameter distribution and task performance.

Experimental Validation

In extensive experimental evaluations using datasets such as CIFAR-10 and Caltech-101, the proposed framework demonstrated the capability to embed 256-bit watermarks effectively without performance loss. The embedded watermarks showed robustness against common model adjustments such as fine-tuning, even when 65% of network parameters were pruned.

The paper explores various settings of watermark embedding, analyzing the trade-offs between the size of the watermark and possible embedding layers within the network architecture. It indicates potential shortcomings when watermark size exceeds parameter capacity, suggesting solutions through more complex perceptron architectures in the regularizer.

Implications and Future Work

The integration of watermarking into DNNs has both theoretical and practical implications. Theoretically, it opens avenues for further research into enhancing robustness against sophisticated attacks such as network morphism or deliberate watermark overwriting. Practically, it facilitates safer sharing of neural network models by providing an additional layer of intellectual property protection. This is especially relevant as the commercialization and distribution of AI models grow.

Future research directions involve addressing the challenges of watermark overwriting, refining embedding strategies during model compression, and developing robust detection methodologies against steganalysis. Additionally, exploring complementary techniques like fingerprinting within neural networks could offer comprehensive solutions for model protection.

The paper sets a foundational precedent for watermarking within machine learning models, empowering owners with tools for legal recourse in case of intellectual property infringement while keeping models fully functional and distributable.

PDF Markdown

Related Papers

GitHub

GitHub - yu4u/dnn-watermark: Implementation of "Embedding Watermarks into Deep Neural Networks," in Proc. of ICMR'17. (116 stars)

Tweets

https://twitter.com/ritualnet/status/1853910353370857852

https://twitter.com/testtweeta6088/status/1853899547636039943