SelectNet: Learning to Sample from the Wild for Imbalanced Data Training

05/23/2019
by   Yunru Liu, et al.
0

Supervised learning from training data with imbalanced class sizes, a commonly encountered scenario in real applications such as anomaly/fraud detection, has long been considered a significant challenge in machine learning. Motivated by recent progress in curriculum and self-paced learning, we propose to adopt a semi-supervised learning paradigm by training a deep neural network, referred to as SelectNet, to selectively add unlabelled data together with their predicted labels to the training dataset. Unlike existing techniques designed to tackle the difficulty in dealing with class imbalanced training data such as resampling, cost-sensitive learning, and margin-based learning, SelectNet provides an end-to-end approach for learning from important unlabelled data "in the wild" that most likely belong to the under-sampled classes in the training data, thus gradually mitigates the imbalance in the data used for training the classifier. We demonstrate the efficacy of SelectNet through extensive numerical experiments on standard datasets in computer vision.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2022

Learning to Adapt Classifier for Imbalanced Semi-supervised Learning

Pseudo-labeling has proven to be a promising semi-supervised learning (S...
research
08/20/2021

Semi-supervised learning for medical image classification using imbalanced training data

Medical image classification is often challenging for two reasons: a lac...
research
04/28/2018

Imbalanced Deep Learning by Minority Class Incremental Rectification

Model learning from class imbalanced training data is a long-standing an...
research
10/22/2021

Prototypical Classifier for Robust Class-Imbalanced Learning

Deep neural networks have been shown to be very powerful methods for man...
research
11/02/2017

Oversampling for Imbalanced Learning Based on K-Means and SMOTE

Learning from class-imbalanced data continues to be a common and challen...
research
02/09/2018

Deep Learning for Malicious Flow Detection

Cyber security has grown up to be a hot issue in recent years. How to id...
research
02/04/2020

Introduction to quasi-open set semi-supervised learning for big data analytics

State-of-the-art performance and low system complexity has made deep-lea...

Please sign up or login with your details

Forgot password? Click here to reset