A Log-linear Gradient Descent Algorithm for Unbalanced Binary Classification using the All Pairs Squared Hinge Loss

02/21/2023
by   Kyle R. Rust, et al.
0

Receiver Operating Characteristic (ROC) curves are plots of true positive rate versus false positive rate which are used to evaluate binary classification algorithms. Because the Area Under the Curve (AUC) is a constant function of the predicted values, learning algorithms instead optimize convex relaxations which involve a sum over all pairs of labeled positive and negative examples. Naive learning algorithms compute the gradient in quadratic time, which is too slow for learning using large batch sizes. We propose a new functional representation of the square loss and squared hinge loss, which results in algorithms that compute the gradient in either linear or log-linear time, and makes it possible to use gradient descent learning with large batch sizes. In our empirical study of supervised binary classification problems, we show that our new algorithm can achieve higher test AUC values on imbalanced data sets than previous algorithms, and make use of larger batch sizes than were previously feasible.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/02/2021

Optimizing ROC Curves with a Sort-Based Surrogate Loss Function for Binary Classification and Changepoint Detection

Receiver Operating Characteristic (ROC) curves are plots of true positiv...
research
06/14/2018

Stochastic Gradient Descent with Exponential Convergence Rates of Expected Classification Errors

We consider stochastic gradient descent for binary classification proble...
research
05/29/2018

MBA: Mini-Batch AUC Optimization

Area under the receiver operating characteristics curve (AUC) is an impo...
research
01/30/2020

Analytic Study of Double Descent in Binary Classification: The Impact of Loss

Extensive empirical evidence reveals that, for a wide range of different...
research
03/23/2020

A termination criterion for stochastic gradient descent for binary classification

We propose a new, simple, and computationally inexpensive termination te...
research
09/02/2020

A Heaviside Function Approximation for Neural Network Binary Classification

Neural network binary classifiers are often evaluated on metrics like ac...
research
07/10/2018

A New Variational Model for Binary Classification in the Supervised Learning Context

We examine the supervised learning problem in its continuous setting and...

Please sign up or login with your details

Forgot password? Click here to reset