Evolutionary Multitasking AUC Optimization

01/04/2022
by   Chao Wang, et al.
5

Learning to optimize the area under the receiver operating characteristics curve (AUC) performance for imbalanced data has attracted much attention in recent years. Although there have been several methods of AUC optimization, scaling up AUC optimization is still an open issue due to its pairwise learning style. Maximizing AUC in the large-scale dataset can be considered as a non-convex and expensive problem. Inspired by the characteristic of pairwise learning, the cheap AUC optimization task with a small-scale dataset sampled from the large-scale dataset is constructed to promote the AUC accuracy of the original, large-scale, and expensive AUC optimization task. This paper develops an evolutionary multitasking framework (termed EMTAUC) to make full use of information among the constructed cheap and expensive tasks to obtain higher performance. In EMTAUC, one mission is to optimize AUC from the sampled dataset, and the other is to maximize AUC from the original dataset. Moreover, due to the cheap task containing limited knowledge, a strategy for dynamically adjusting the data structure of inexpensive tasks is proposed to introduce more knowledge into the multitasking AUC optimization environment. The performance of the proposed method is evaluated on a series of binary classification datasets. The experimental results demonstrate that EMTAUC is highly competitive to single task methods and online methods. Supplementary materials and source code implementation of EMTAUC can be accessed at https://github.com/xiaofangxd/EMTAUC.

READ FULL TEXT
research
12/27/2016

A Sparse Nonlinear Classifier Design Using AUC Optimization

AUC (Area under the ROC curve) is an important performance measure for a...
research
11/17/2015

AUC-maximized Deep Convolutional Neural Fields for Sequence Labeling

Deep Convolutional Neural Networks (DCNN) has shown excellent performanc...
research
05/04/2017

Semi-Supervised AUC Optimization based on Positive-Unlabeled Learning

Maximizing the area under the receiver operating characteristic curve (A...
research
09/23/2020

Online AUC Optimization for Sparse High-Dimensional Datasets

The Area Under the ROC Curve (AUC) is a widely used performance measure ...
research
07/08/2022

Balanced Self-Paced Learning for AUC Maximization

Learning to improve AUC performance is an important topic in machine lea...
research
05/01/2018

Deep Factorization Machines for Knowledge Tracing

This paper introduces our solution to the 2018 Duolingo Shared Task on S...
research
12/19/2017

Wikidata Vandalism Detection - The Loganberry Vandalism Detector at WSDM Cup 2017

Wikidata is the new, large-scale knowledge base of the Wikimedia Foundat...

Please sign up or login with your details

Forgot password? Click here to reset