Persistent and Unforgeable Watermarks for Deep Neural Networks

10/02/2019
by   Huiying Li, et al.
0

As deep learning classifiers continue to mature, model providers with sufficient data and computation resources are exploring approaches to monetize the development of increasingly powerful models. Licensing models is a promising approach, but requires a robust tool for owners to claim ownership of models, i.e. a watermark. Unfortunately, current watermarks are all vulnerable to piracy attacks, where attackers embed forged watermarks into a model to dispute ownership. We believe properties of persistence and piracy resistance are critical to watermarks, but are fundamentally at odds with the current way models are trained and tuned. In this work, we propose two new training techniques (out-of-bound values and null-embedding) that provide persistence and limit the training of certain inputs into trained models. We then introduce "wonder filters", a new primitive that embeds a persistent bit-sequence into a model, but only at initial training time. Wonder filters enable model owners to embed a bit-sequence generated from their private keys into a model at training time. Attackers cannot remove wonder filters via tuning, and cannot add their own filters to pretrained models. We provide analytical proofs of key properties, and experimentally validate them over a variety of tasks and models. Finally, we explore a number of adaptive counter-measures, and show our watermark remains robust.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2018

Digital Watermarking for Deep Neural Networks

Although deep neural networks have made tremendous progress in the area ...
research
03/05/2021

Don't Forget to Sign the Gradients!

Engineering a top-notch deep learning model is an expensive procedure th...
research
01/15/2017

Embedding Watermarks into Deep Neural Networks

Deep neural networks have recently achieved significant progress. Sharin...
research
09/30/2018

Master of Web Puppets: Abusing Web Browsers for Persistent and Stealthy Computation

The proliferation of web applications has essentially transformed modern...
research
11/06/2018

MixTrain: Scalable Training of Verifiably Robust Neural Networks

Making neural networks robust against adversarial inputs has resulted in...
research
09/01/2021

Towards Learning a Vocabulary of Visual Concepts and Operators using Deep Neural Networks

Deep neural networks have become the default choice for many application...

Please sign up or login with your details

Forgot password? Click here to reset