The Interplay between Distribution Parameters and the Accuracy-Robustness Tradeoff in Classification

07/01/2021
by   Alireza Mousavi Hosseini, et al.
0

Adversarial training tends to result in models that are less accurate on natural (unperturbed) examples compared to standard models. This can be attributed to either an algorithmic shortcoming or a fundamental property of the training data distribution, which admits different solutions for optimal standard and adversarial classifiers. In this work, we focus on the latter case under a binary Gaussian mixture classification problem. Unlike earlier work, we aim to derive the natural accuracy gap between the optimal Bayes and adversarial classifiers, and study the effect of different distributional parameters, namely separation between class centroids, class proportions, and the covariance matrix, on the derived gap. We show that under certain conditions, the natural error of the optimal adversarial classifier, as well as the gap, are locally minimized when classes are balanced, contradicting the performance of the Bayes classifier where perfect balance induces the worst accuracy. Moreover, we show that with an ℓ_∞ bounded perturbation and an adversarial budget of ϵ, this gap is Θ(ϵ^2) for the worst-case parameters, which for suitably small ϵ indicates the theoretical possibility of achieving robust classifiers with near-perfect accuracy, which is rarely reflected in practical algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/20/2020

A Bayes-Optimal View on Adversarial Examples

The ability to fool modern CNN classifiers with tiny perturbations of th...
research
11/26/2011

Optimal exponential bounds on the accuracy of classification

We consider a standard binary classification problem. The performance of...
research
05/06/2020

Proper measure for adversarial robustness

This paper analyzes the problems of standard adversarial accuracy and st...
research
06/07/2021

Evaluating State-of-the-Art Classification Models Against Bayes Optimality

Evaluating the inherent difficulty of a given data-driven classification...
research
07/22/2020

Robust Machine Learning via Privacy/Rate-Distortion Theory

Robust machine learning formulations have emerged to address the prevale...
research
06/12/2023

How robust accuracy suffers from certified training with convex relaxations

Adversarial attacks pose significant threats to deploying state-of-the-a...
research
02/17/2023

Revisiting adversarial training for the worst-performing class

Despite progress in adversarial training (AT), there is a substantial ga...

Please sign up or login with your details

Forgot password? Click here to reset