Robust and Undetectable White-Box Watermarks for Deep Neural Networks

10/31/2019
by   Tianhao Wang, et al.
0

Training deep neural networks (DNN) is expensive in terms of computational power and the amount of necessary labeled training data. Thus, deep learning models constitute business value for data owners. Watermarking of deep neural networks can enable their tracing once released by a data owner. In this paper we define and formalize white-box watermarking algorithms for DNNs, where the data owner needs white-box access to the model to extract the watermark. White-box watermarking algorithms have the advantage that they do not impact the accuracy of the watermarked model. We demonstrate a new property inference attack using a DNN that can detect watermarking by any existing, manually designed algorithms regardless of training dataset and model architecture. We then propose the first white-box DNN watermarking algorithm that is undetectable by the property inference attack. We further extend the capacity and robustness of the watermark. Unlike prior watermarking schemes which restrict the content of watermark message to short binary strings, our new scheme largely increase the capacity and flexibility of the embedded watermark message. Experiments show that our new white-box watermarking algorithm does not impact accuracy, is undetectable and robust against moderate model transformation attacks.

READ FULL TEXT
research
12/28/2021

Fostering the Robustness of White-Box Deep Neural Network Watermarks by Neuron Alignment

The wide application of deep learning techniques is boosting the regulat...
research
10/27/2022

DICTION: DynamIC robusT whIte bOx watermarkiNg scheme

Deep neural network (DNN) watermarking is a suitable method for protecti...
research
08/23/2022

Robust DNN Watermarking via Fixed Embedding Weights with Optimized Distribution

Watermarking has been proposed as a way to protect the Intellectual Prop...
research
02/13/2018

Turning Your Weakness Into a Strength: Watermarking Deep Neural Networks by Backdooring

Deep Neural Networks have recently gained lots of success after enabling...
research
07/15/2021

Subnet Replacement: Deployment-stage backdoor attack against deep neural networks in gray-box setting

We study the realistic potential of conducting backdoor attack against d...
research
03/09/2021

Robust Black-box Watermarking for Deep NeuralNetwork using Inverse Document Frequency

Deep learning techniques are one of the most significant elements of any...
research
07/16/2019

Graph Interpolating Activation Improves Both Natural and Robust Accuracies in Data-Efficient Deep Learning

Improving the accuracy and robustness of deep neural nets (DNNs) and ada...

Please sign up or login with your details

Forgot password? Click here to reset