Weighted Least Squares Twin Support Vector Machine with Fuzzy Rough Set Theory for Imbalanced Data Classification

05/03/2021
by   Maysam Behmanesh, et al.
0

Support vector machines (SVMs) are powerful supervised learning tools developed to solve classification problems. However, SVMs are likely to perform poorly in the classification of imbalanced data. The rough set theory presents a mathematical tool for inference in nondeterministic cases that provides methods for removing irrelevant information from data. In this work, we propose an approach that efficiently used fuzzy rough set theory in weighted least squares twin support vector machine called FRLSTSVM for classification of imbalanced data. The first innovation is introducing a new fuzzy rough set based under-sampling strategy to make the classifier robust in terms of imbalanced data. For constructing the two proximal hyperplanes in FRLSTSVM, data points from the minority class remain unchanged while a subset of data points in the majority class are selected using a new method. In this model, we embedded the weight biases in the LSTSVM formulations to overcome the bias phenomenon in the original twin SVM for the classification of imbalanced data. In order to determine these weights in this formulation, we introduced a new strategy that uses fuzzy rough set theory as the second innovation. Experimental results on famous imbalanced datasets, compared with related traditional SVM-based methods, demonstrate the superiority of our proposed FRLSTSVM model in imbalanced data classification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2019

Weighted second-order cone programming twin support vector machine for imbalanced data classification

We propose a method of using a Weighted second-order cone programming tw...
research
05/20/2015

Fuzzy Least Squares Twin Support Vector Machines

Least Squares Twin Support Vector Machine (LSTSVM) is an extremely effic...
research
05/19/2023

Three-way Imbalanced Learning based on Fuzzy Twin SVM

Three-way decision (3WD) is a powerful tool for granular computing to de...
research
07/11/2018

Instance-based entropy fuzzy support vector machine for imbalanced data

Imbalanced classification has been a major challenge for machine learnin...
research
07/26/2020

Fully Bayesian Analysis of the Relevance Vector Machine Classification for Imbalanced Data

Relevance Vector Machine (RVM) is a supervised learning algorithm extend...
research
08/31/2023

Least Squares Maximum and Weighted Generalization-Memorization Machines

In this paper, we propose a new way of remembering by introducing a memo...
research
03/22/2020

Deep Synthetic Minority Over-Sampling Technique

Synthetic Minority Over-sampling Technique (SMOTE) is the most popular o...

Please sign up or login with your details

Forgot password? Click here to reset