A New Gene Selection Algorithm using Fuzzy-Rough Set Theory for Tumor Classification

In statistics and machine learning, feature selection is the process of picking a subset of relevant attributes for utilizing in a predictive model. Recently, rough set-based feature selection techniques, that employ feature dependency to perform selection process, have been drawn attention. Classification of tumors based on gene expression is utilized to diagnose proper treatment and prognosis of the disease in bioinformatics applications. Microarray gene expression data includes superfluous feature genes of high dimensionality and smaller training instances. Since exact supervised classification of gene expression instances in such high-dimensional problems is very complex, the selection of appropriate genes is a crucial task for tumor classification. In this study, we present a new technique for gene selection using a discernibility matrix of fuzzy-rough sets. The proposed technique takes into account the similarity of those instances that have the same and different class labels to improve the gene selection results, while the state-of-the art previous approaches only address the similarity of instances with different class labels. To meet that requirement, we extend the Johnson reducer technique into the fuzzy case. Experimental results demonstrate that this technique provides better efficiency compared to the state-of-the-art approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2013

Verdict Accuracy of Quick Reduct Algorithm using Clustering and Classification Techniques for Gene Expression Data

In most gene expression data, the number of training samples is very sma...
research
07/31/2018

A Fuzzy-Rough based Binary Shuffled Frog Leaping Algorithm for Feature Selection

Feature selection and attribute reduction are crucial problems, and wide...
research
05/04/2010

Feature Selection with Conjunctions of Decision Stumps and Learning from Microarray Data

One of the objectives of designing feature selection learning algorithms...
research
01/08/2013

An Analysis of Gene Expression Data using Penalized Fuzzy C-Means Approach

With the rapid advances of microarray technologies, large amounts of hig...
research
02/07/2013

Feature Selection for Microarray Gene Expression Data using Simulated Annealing guided by the Multivariate Joint Entropy

In this work a new way to calculate the multivariate joint entropy is pr...
research
02/09/2019

Inverse Projection Representation and Category Contribution Rate for Robust Tumor Recognition

Sparse representation based classification (SRC) methods have achieved r...

Please sign up or login with your details

Forgot password? Click here to reset