Optimizing ROC Curves with a Sort-Based Surrogate Loss Function for Binary Classification and Changepoint Detection

07/02/2021
by   Jonathan Hillman, et al.
0

Receiver Operating Characteristic (ROC) curves are plots of true positive rate versus false positive rate which are useful for evaluating binary classification models, but difficult to use for learning since the Area Under the Curve (AUC) is non-convex. ROC curves can also be used in other problems that have false positive and true positive rates such as changepoint detection. We show that in this more general context, the ROC curve can have loops, points with highly sub-optimal error rates, and AUC greater than one. This observation motivates a new optimization objective: rather than maximizing the AUC, we would like a monotonic ROC curve with AUC=1 that avoids points with large values for Min(FP,FN). We propose a convex relaxation of this objective that results in a new surrogate loss function called the AUM, short for Area Under Min(FP, FN). Whereas previous loss functions are based on summing over all labeled examples or pairs, the AUM requires a sort and a sum over the sequence of points on the ROC curve. We show that AUM directional derivatives can be efficiently computed and used in a gradient descent learning algorithm. In our empirical study of supervised binary classification and changepoint detection problems, we show that our new AUM minimization learning algorithm results in improved AUC and comparable speed relative to previous baselines.

READ FULL TEXT
research
02/21/2023

A Log-linear Gradient Descent Algorithm for Unbalanced Binary Classification using the All Pairs Squared Hinge Loss

Receiver Operating Characteristic (ROC) curves are plots of true positiv...
research
11/29/2019

ROC movies – a new generalization to a popular classic

Throughout science and technology, receiver operating characteristic (RO...
research
06/09/2018

Abstaining Classification When Error Costs are Unequal and Unknown

Abstaining classificaiton aims to reject to classify the easily misclass...
research
02/21/2021

Constrained Optimization for Training Deep Neural Networks Under Class Imbalance

Deep neural networks (DNNs) are notorious for making more mistakes for t...
research
03/03/2022

Large-scale Optimization of Partial AUC in a Range of False Positive Rates

The area under the ROC curve (AUC) is one of the most widely used perfor...
research
02/07/2018

Directly and Efficiently Optimizing Prediction Error and AUC of Linear Classifiers

The predictive quality of machine learning models is typically measured ...
research
01/28/2021

The fraud loss for selecting the model complexity in fraud detection

In fraud detection applications, the investigator is typically limited t...

Please sign up or login with your details

Forgot password? Click here to reset