Taming Adversarial Robustness via Abstaining

04/06/2021
by   Abed AlRahman Al Makdah, et al.
0

In this work, we consider a binary classification problem and cast it into a binary hypothesis testing framework, where the observations can be perturbed by an adversary. To improve the adversarial robustness of a classifier, we include an abstaining option, where the classifier abstains from taking a decision when it has low confidence about the prediction. We propose metrics to quantify the nominal performance of a classifier with abstaining option and its robustness against adversarial perturbations. We show that there exist a tradeoff between the two metrics regardless of what method is used to choose the abstaining region. Our results imply that the robustness of a classifier with abstaining can only be improved at the expense of its nominal performance. Further, we provide necessary conditions to design the abstaining region for a 1-dimensional binary classification problem. We validate our theoretical results on the MNIST dataset, where we numerically show that the tradeoff between performance and robustness also exist for the general multi-class classification problems.

READ FULL TEXT
research
04/15/2014

Multi-borders classification

The number of possible methods of generalizing binary classification to ...
research
09/12/2020

Multiclass Model for Agriculture development using Multivariate Statistical method

Mahalanobis taguchi system (MTS) is a multi-variate statistical method e...
research
10/29/2020

Robustifying Binary Classification to Adversarial Perturbation

Despite the enormous success of machine learning models in various appli...
research
10/05/2021

Tradeoffs in Streaming Binary Classification under Limited Inspection Resources

Institutions are increasingly relying on machine learning models to iden...
research
02/25/2020

General Framework for Binary Classification on Top Samples

Many binary classification problems minimize misclassification above (or...
research
10/29/2018

Adversarial Risk and Robustness: General Definitions and Implications for the Uniform Distribution

We study adversarial perturbations when the instances are uniformly dist...
research
05/10/2023

Pearson-Matthews correlation coefficients for binary and multinary classification and hypothesis testing

The Pearson-Matthews correlation coefficient (usually abbreviated MCC) i...

Please sign up or login with your details

Forgot password? Click here to reset