MixTrain: Scalable Training of Formally Robust Neural Networks

11/06/2018
by   Shiqi Wang, et al.
0

There is an arms race to defend neural networks against adversarial examples. Notably, adversarially robust training and verifiably robust training are the most promising defenses. The adversarially robust training scales well but cannot provide provable robustness guarantee for the absence of attacks. We present an Interval Attack that reveals fundamental problems about the threat model used by adversarially robust training. On the contrary, verifiably robust training achieves sound guarantee, but it is computationally expensive and sacrifices accuracy, which prevents it being applied in practice. In this paper, we propose two novel techniques for verifiably robust training, stochastic output approximation and dynamic mixed training, to solve the aforementioned challenges. They are based on two critical insights: (1) soundness is only needed in a subset of training data; and (2) verifiable robustness and test accuracy are conflicting to achieve after a certain point of verifiably robust training. On both MNIST and CIFAR datasets, we are able to achieve similar test accuracy and estimated robust accuracy against PGD attacks within 14× less training time compared to state-of-the-art adversarially robust training techniques. In addition, we have up to 95.2 accuracy as a bonus. Also, to achieve similar verified robust accuracy, we are able to save up to 5× computation time and offer 9.2 improvement compared to current state-of-the-art verifiably robust training techniques.

READ FULL TEXT

page 1

page 6

research
11/06/2018

MixTrain: Scalable Training of Verifiably Robust Neural Networks

Making neural networks robust against adversarial inputs has resulted in...
research
02/01/2021

Fast Training of Provably Robust Neural Networks by SingleProp

Recent works have developed several methods of defending neural networks...
research
10/26/2021

Improving Local Effectiveness for Global robust training

Despite its popularity, deep neural networks are easily fooled. To allev...
research
11/17/2022

SparseVLR: A Novel Framework for Verified Locally Robust Sparse Neural Networks Search

The compute-intensive nature of neural networks (NNs) limits their deplo...
research
06/10/2020

Scalable Backdoor Detection in Neural Networks

Recently, it has been shown that deep learning models are vulnerable to ...
research
02/16/2021

Globally-Robust Neural Networks

The threat of adversarial examples has motivated work on training certif...
research
08/25/2021

Backdoor Attacks on Network Certification via Data Poisoning

Certifiers for neural networks have made great progress towards provable...

Please sign up or login with your details

Forgot password? Click here to reset