A Novel Adaptive Minority Oversampling Technique for Improved Classification in Data Imbalanced Scenarios

03/24/2021
by   Ayush Tripathi, et al.
0

Imbalance in the proportion of training samples belonging to different classes often poses performance degradation of conventional classifiers. This is primarily due to the tendency of the classifier to be biased towards the majority classes in the imbalanced dataset. In this paper, we propose a novel three step technique to address imbalanced data. As a first step we significantly oversample the minority class distribution by employing the traditional Synthetic Minority OverSampling Technique (SMOTE) algorithm using the neighborhood of the minority class samples and in the next step we partition the generated samples using a Gaussian-Mixture Model based clustering algorithm. In the final step synthetic data samples are chosen based on the weight associated with the cluster, the weight itself being determined by the distribution of the majority class samples. Extensive experiments on several standard datasets from diverse domains shows the usefulness of the proposed technique in comparison with the original SMOTE and its state-of-the-art variants algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/02/2018

Clustering and Learning from Imbalanced Data

A learning classifier must outperform a trivial solution, in case of imb...
research
10/24/2018

G-SMOTE: A GMM-based synthetic minority oversampling technique for imbalanced learning

Imbalanced Learning is an important learning algorithm for the classific...
research
09/28/2022

Class-Imbalanced Complementary-Label Learning via Weighted Loss

Complementary-label learning (CLL) is a common application in the scenar...
research
05/09/2021

GMOTE: Gaussian based minority oversampling technique for imbalanced classification adapting tail probability of outliers

Classification of imbalanced data is one of the common problems in the r...
research
06/05/2017

Progressive Boosting for Class Imbalance

Pattern recognition applications often suffer from skewed data distribut...
research
04/07/2020

CSMOUTE: Combined Synthetic Oversampling and Undersampling Technique for Imbalanced Data Classification

In this paper we propose two novel data-level algorithms for handling da...
research
05/16/2023

BSGAN: A Novel Oversampling Technique for Imbalanced Pattern Recognitions

Class imbalanced problems (CIP) are one of the potential challenges in d...

Please sign up or login with your details

Forgot password? Click here to reset