Learning with Multiclass AUC: Theory and Algorithms

07/28/2021
by   Zhiyong Yang, et al.
59

The Area under the ROC curve (AUC) is a well-known ranking metric for problems such as imbalanced learning and recommender systems. The vast majority of existing AUC-optimization-based machine learning methods only focus on binary-class cases, while leaving the multiclass cases unconsidered. In this paper, we start an early trial to consider the problem of learning multiclass scoring functions via optimizing multiclass AUC metrics. Our foundation is based on the M metric, which is a well-known multiclass extension of AUC. We first pay a revisit to this metric, showing that it could eliminate the imbalance issue from the minority class pairs. Motivated by this, we propose an empirical surrogate risk minimization framework to approximately optimize the M metric. Theoretically, we show that: (i) optimizing most of the popular differentiable surrogate losses suffices to reach the Bayes optimal scoring function asymptotically; (ii) the training framework enjoys an imbalance-aware generalization error bound, which pays more attention to the bottleneck samples of minority classes compared with the traditional O(√(1/N)) result. Practically, to deal with the low scalability of the computational operations, we propose acceleration methods for three popular surrogate loss functions, including the exponential loss, squared loss, and hinge loss, to speed up loss and gradient evaluations. Finally, experimental results on 11 real-world datasets demonstrate the effectiveness of our proposed framework.

READ FULL TEXT

page 12

page 14

page 16

page 17

research
04/16/2018

A Univariate Bound of Area Under ROC

Area under ROC (AUC) is an important metric for binary classification an...
research
08/03/2012

On the Consistency of AUC Pairwise Optimization

AUC (area under ROC curve) is an important evaluation criterion, which h...
research
08/25/2015

AUC Optimisation and Collaborative Filtering

In recommendation systems, one is interested in the ranking of the predi...
research
07/23/2022

Density-Aware Personalized Training for Risk Prediction in Imbalanced Medical Data

Medical events of interest, such as mortality, often happen at a low rat...
research
04/17/2023

Enhancing Personalized Ranking With Differentiable Group AUC Optimization

AUC is a common metric for evaluating the performance of a classifier. H...
research
10/19/2022

AUC-based Selective Classification

Selective classification (or classification with a reject option) pairs ...
research
02/07/2018

Directly and Efficiently Optimizing Prediction Error and AUC of Linear Classifiers

The predictive quality of machine learning models is typically measured ...

Please sign up or login with your details

Forgot password? Click here to reset