Attributing AUC-ROC to Analyze Binary Classifier Performance

05/24/2022
by   Arya Tafvizi, et al.
0

Area Under the Receiver Operating Characteristic Curve (AUC-ROC) is a popular evaluation metric for binary classifiers. In this paper, we discuss techniques to segment the AUC-ROC along human-interpretable dimensions. AUC-ROC is not an additive/linear function over the data samples, therefore such segmenting the overall AUC-ROC is different from tabulating the AUC-ROC of data segments. To segment the overall AUC-ROC, we must first solve an attribution problem to identify credit for individual examples. We observe that AUC-ROC, though non-linear over examples, is linear over pairs of examples. This observation leads to a simple, efficient attribution technique for examples (example attributions), and for pairs of examples (pair attributions). We automatically slice these attributions using decision trees by making the tree predict the attributions; we use the notion of honest estimates along with a t-test to mitigate false discovery. Our experiments with the method show that an inferior model can outperform a superior model (trained to optimize a different training objective) on the inferior model's own training objective, a manifestation of Goodhart's Law. In contrast, AUC attributions enable a reasonable comparison. Example attributions can be used to slice this comparison. Pair attributions are used to categorize pairs of items – one positively labeled and one negatively – that the model has trouble separating. These categories identify the decision boundary of the classifier and the headroom to improve AUC.

READ FULL TEXT

page 10

page 11

research
03/12/2019

ROC and AUC with a Binary Predictor: a Potentially Misleading Metric

In analysis of binary outcomes, the receiver operator characteristic (RO...
research
11/16/2015

Efficient AUC Optimization for Information Ranking Applications

Adequate evaluation of an information retrieval system to estimate futur...
research
06/02/2023

Does it pay to optimize AUC?

The Area Under the ROC Curve (AUC) is an important model metric for eval...
research
08/23/2019

Bayesian Receiver Operating Characteristic Metric for Linear Classifiers

We propose a novel classifier accuracy metric: the Bayesian Area Under t...
research
09/07/2017

Improving Sonar Image Patch Matching via Deep Learning

Matching sonar images with high accuracy has been a problem for a long t...
research
04/17/2023

Enhancing Personalized Ranking With Differentiable Group AUC Optimization

AUC is a common metric for evaluating the performance of a classifier. H...
research
03/22/2016

Enhanced perceptrons using contrastive biclusters

Perceptrons are neuronal devices capable of fully discriminating linearl...

Please sign up or login with your details

Forgot password? Click here to reset