A Bayes-Optimal View on Adversarial Examples

02/20/2020
by   Eitan Richardson, et al.
0

The ability to fool modern CNN classifiers with tiny perturbations of the input has lead to the development of a large number of candidate defenses and often conflicting explanations. In this paper, we argue for examining adversarial examples from the perspective of Bayes-Optimal classification. We construct realistic image datasets for which the Bayes-Optimal classifier can be efficiently computed and derive analytic conditions on the distributions so that the optimal classifier is either robust or vulnerable. By training different classifiers on these datasets (for which the "gold standard" optimal classifiers are known), we can disentangle the possible sources of vulnerability and avoid the accuracy-robustness tradeoff that may occur in commonly used datasets. Our results show that even when the optimal classifier is robust, standard CNN training consistently learns a vulnerable classifier. At the same time, for exactly the same training data, RBF SVMs consistently learn a robust classifier. The same trend is observed in experiments with real images.

READ FULL TEXT

page 2

page 3

page 6

page 7

page 13

page 15

page 16

page 17

research
01/02/2019

Adversarial Robustness May Be at Odds With Simplicity

Current techniques in machine learning are so far are unable to learn cl...
research
07/01/2021

The Interplay between Distribution Parameters and the Accuracy-Robustness Tradeoff in Classification

Adversarial training tends to result in models that are less accurate on...
research
02/10/2018

Critères de qualité d'un classifieur généraliste

This paper considers the problem of choosing a good classifier. For each...
research
06/03/2021

A Little Robustness Goes a Long Way: Leveraging Universal Features for Targeted Transfer Attacks

Adversarial examples for neural network image classifiers are known to b...
research
10/18/2020

Poisoned classifiers are not only backdoored, they are fundamentally broken

Under a commonly-studied "backdoor" poisoning attack against classificat...
research
12/03/2021

On the Existence of the Adversarial Bayes Classifier (Extended Version)

Adversarial robustness is a critical property in a variety of modern mac...
research
06/02/2018

Optimal Clustering under Uncertainty

Classical clustering algorithms typically either lack an underlying prob...

Please sign up or login with your details

Forgot password? Click here to reset