Backdoor Watermarking Deep Learning Classification Models With Deep Fidelity

08/01/2022
by   Guang Hua, et al.
0

Backdoor Watermarking is a promising paradigm to protect the copyright of deep neural network (DNN) models for classification tasks. In the existing works on this subject, researchers have intensively focused on watermarking robustness, while fidelity, which is concerned with the original functionality, has received less attention. In this paper, we show that the existing shared notion of the sole measurement of learning accuracy is insufficient to characterize backdoor fidelity. Meanwhile, we show that the analogous concept of embedding distortion in multimedia watermarking, interpreted as the total weight loss (TWL) in DNN backdoor watermarking, is also unsuitable to measure the fidelity. To solve this problem, we propose the concept of deep fidelity, which states that the backdoor watermarked DNN model should preserve both the feature representation and decision boundary of the unwatermarked host model. Accordingly, to realize deep fidelity, we propose two loss functions termed as penultimate feature loss (PFL) and softmax probability-distribution loss (SPL) to preserve feature representation, while the decision boundary is preserved by the proposed fix last layer (FixLL) treatment, inspired by the recent discovery that deep learning with a fixed classifier causes no loss of learning accuracy. With the above designs, both embedding from scratch and fine-tuning strategies are implemented to evaluate deep fidelity of backdoor embedding, whose advantages over the existing methods are verified via experiments using ResNet18 for MNIST and CIFAR-10 classifications, and wide residual network (i.e., WRN28_10) for CIFAR-100 task.

READ FULL TEXT

page 1

page 8

page 11

research
10/28/2019

IPGuard: Protecting the Intellectual Property of Deep Neural Networks via Fingerprinting the Classification Boundary

A deep neural network (DNN) classifier represents a model owner's intell...
research
03/15/2018

Large Margin Deep Networks for Classification

We present a formulation of deep learning that aims at producing a large...
research
02/11/2020

Population-Based Training for Loss Function Optimization

Metalearning of deep neural network (DNN) architectures and hyperparamet...
research
10/22/2017

Deep Triphone Embedding Improves Phoneme Recognition

In this paper, we present a novel Deep Triphone Embedding (DTE) represen...
research
08/10/2022

Customized Watermarking for Deep Neural Networks via Label Distribution Perturbation

With the increasing application value of machine learning, the intellect...
research
11/18/2020

Vector Embeddings with Subvector Permutation Invariance using a Triplet Enhanced Autoencoder

The use of deep neural network (DNN) autoencoders (AEs) has recently exp...
research
11/13/2018

Co-Representation Learning For Classification and Novel Class Detection via Deep Networks

Deep Neural Network (DNN) has been largely demonstrated to be effective ...

Please sign up or login with your details

Forgot password? Click here to reset