Quadruply Stochastic Gradients for Large Scale Nonlinear Semi-Supervised AUC Optimization

07/29/2019
by   Wanli Shi, et al.
0

Semi-supervised learning is pervasive in real-world applications, where only a few labeled data are available and large amounts of instances remain unlabeled. Since AUC is an important model evaluation metric in classification, directly optimizing AUC in semi-supervised learning scenario has drawn much attention in the machine learning community. Recently, it has been shown that one could find an unbiased solution for the semi-supervised AUC maximization problem without knowing the class prior distribution. However, this method is hardly scalable for nonlinear classification problems with kernels. To address this problem, in this paper, we propose a novel scalable quadruply stochastic gradient algorithm (QSG-S2AUC) for nonlinear semi-supervised AUC optimization. In each iteration of the stochastic optimization process, our method randomly samples a positive instance, a negative instance, an unlabeled instance and their random features to compute the gradient and then update the model by using this quadruply stochastic gradient to approach the optimal solution. More importantly, we prove that QSG-S2AUC can converge to the optimal solution in O(1/t), where t is the iteration number. Extensive experimental results on a variety of benchmark datasets show that QSG-S2AUC is far more efficient than the existing state-of-the-art algorithms for semi-supervised AUC maximization while retaining the similar generalization performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/24/2019

Quadruply Stochastic Gradient Method for Large Scale Nonlinear Semi-Supervised Ordinal Regression AUC Optimization

Semi-supervised ordinal regression (S^2OR) problems are ubiquitous in re...
research
07/26/2019

Scalable Semi-Supervised SVM via Triply Stochastic Gradients

Semi-supervised learning (SSL) plays an increasingly important role in t...
research
05/04/2017

Semi-Supervised AUC Optimization based on Positive-Unlabeled Learning

Maximizing the area under the receiver operating characteristic curve (A...
research
05/29/2018

MBA: Mini-Batch AUC Optimization

Area under the receiver operating characteristics curve (AUC) is an impo...
research
09/09/2021

Lexico-semantic and affective modelling of Spanish poetry: A semi-supervised learning approach

Text classification tasks have improved substantially during the last ye...
research
07/08/2022

Balanced Self-Paced Learning for AUC Maximization

Learning to improve AUC performance is an important topic in machine lea...
research
03/18/2018

A Robust AUC Maximization Framework with Simultaneous Outlier Detection and Feature Selection for Positive-Unlabeled Classification

The positive-unlabeled (PU) classification is a common scenario in real-...

Please sign up or login with your details

Forgot password? Click here to reset