G-SMOTE: A GMM-based synthetic minority oversampling technique for imbalanced learning

10/24/2018
by   Tianlun Zhang, et al.
0

Imbalanced Learning is an important learning algorithm for the classification models, which have enjoyed much popularity on many applications. Typically, imbalanced learning algorithms can be partitioned into two types, i.e., data level approaches and algorithm level approaches. In this paper, the focus is to develop a robust synthetic minority oversampling technique which falls the umbrella of data level approaches. On one hand, we proposed a method to generate synthetic samples in a high dimensional feature space, instead of a linear sampling space. On the other hand, in the proposed imbalanced learning framework, Gaussian Mixture Model is employed to distinguish the outliers from minority class instances and filter out the synthetic majority class instances. Last and more importantly, an adaptive optimization method is proposed to optimize these parameters in sampling process. By doing so, an effectiveness and efficiency imbalanced learning framework is developed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/09/2021

GMOTE: Gaussian based minority oversampling technique for imbalanced classification adapting tail probability of outliers

Classification of imbalanced data is one of the common problems in the r...
research
07/26/2022

Distribution Learning Based on Evolutionary Algorithm Assisted Deep Neural Networks for Imbalanced Image Classification

To address the trade-off problem of quality-diversity for the generated ...
research
03/24/2021

A Novel Adaptive Minority Oversampling Technique for Improved Classification in Data Imbalanced Scenarios

Imbalance in the proportion of training samples belonging to different c...
research
03/22/2020

robROSE: A robust approach for dealing with imbalanced data in fraud detection

A major challenge when trying to detect fraud is that the fraudulent act...
research
12/15/2022

Interpretable ML for Imbalanced Data

Deep learning models are being increasingly applied to imbalanced data i...
research
04/07/2020

CSMOUTE: Combined Synthetic Oversampling and Undersampling Technique for Imbalanced Data Classification

In this paper we propose two novel data-level algorithms for handling da...
research
05/05/2022

Automated Imbalanced Classification via Layered Learning

In this paper we address imbalanced binary classification (IBC) tasks. A...

Please sign up or login with your details

Forgot password? Click here to reset