UGRWO-Sampling: A modified random walk under-sampling approach based on graphs to imbalanced data classification

02/10/2020
by   Saeideh Roshanfekr, et al.
0

In this paper, we propose a new RWO-Sampling (Random Walk Over-Sampling) based on graphs for imbalanced datasets. In this method, two figures based on under-sampling and over-sampling methods are introduced to keep the proximity information, which is robust to noises and outliers. After the construction of the first graph on minority class, RWO-Sampling will be implemented on selected samples, and the rest of them will remain unchanged. The second graph is constructed for the majority class, and the samples in a low-density area (outliers) are removed. In the proposed method, examples of the majority class in a high-density area are selected, and the rest of them are eliminated. Furthermore, utilizing RWO-sampling, the boundary of minority class is increased though, the outliers are not raised. This method is tested, and the number of evaluation measures is compared to previous methods on nine continuous attribute datasets with different over-sampling rates. The experimental results were an indicator of the high efficiency and flexibility of the proposed method for the classification of imbalanced data.

READ FULL TEXT
research
09/25/2021

Random Walk-steered Majority Undersampling

In this work, we propose Random Walk-steered Majority Undersampling (RWM...
research
07/29/2020

Evaluation of Sampling Methods for Scatterplots

Given a scatterplot with tens of thousands of points or even more, a nat...
research
04/26/2019

Weighted second-order cone programming twin support vector machine for imbalanced data classification

We propose a method of using a Weighted second-order cone programming tw...
research
06/09/2011

SMOTE: Synthetic Minority Over-sampling Technique

An approach to the construction of classifiers from imbalanced datasets ...
research
03/22/2020

robROSE: A robust approach for dealing with imbalanced data in fraud detection

A major challenge when trying to detect fraud is that the fraudulent act...
research
03/22/2022

Graph spatial sampling

We develop lagged Metropolis-Hastings walk for sampling from simple undi...
research
08/21/2020

Counterfactual-based minority oversampling for imbalanced classification

A key challenge of oversampling in imbalanced classification is that the...

Please sign up or login with your details

Forgot password? Click here to reset