Biogeography-Based Informative Gene Selection and Cancer Classification Using SVM and Random Forests

07/12/2012
by   Sarvesh Nikumbh, et al.
0

Microarray cancer gene expression data comprise of very high dimensions. Reducing the dimensions helps in improving the overall analysis and classification performance. We propose two hybrid techniques, Biogeography - based Optimization - Random Forests (BBO - RF) and BBO - SVM (Support Vector Machines) with gene ranking as a heuristic, for microarray gene expression analysis. This heuristic is obtained from information gain filter ranking procedure. The BBO algorithm generates a population of candidate subset of genes, as part of an ecosystem of habitats, and employs the migration and mutation processes across multiple generations of the population to improve the classification accuracy. The fitness of each gene subset is assessed by the classifiers - SVM and Random Forests. The performances of these hybrid techniques are evaluated on three cancer gene expression datasets retrieved from the Kent Ridge Biomedical datasets collection and the libSVM data repository. Our results demonstrate that genes selected by the proposed techniques yield classification accuracies comparable to previously reported algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/06/2022

A SVM Model for Candidate Y-chromosome Gene Discovery in Prostate Cancer

Prostate cancer is widely known to be one of the most common cancers amo...
research
04/06/2020

Breast and Colon Cancer Classification from Gene Expression Profiles Using Data Mining Techniques

Early detection of cancer increases the probability of recovery. This pa...
research
08/11/2016

Semi-Supervised Prediction of Gene Regulatory Networks Using Machine Learning Algorithms

Use of computational methods to predict gene regulatory networks (GRNs) ...
research
02/24/2022

An Efficient Binary Harris Hawks Optimization based on Quantum SVM for Cancer Classification Tasks

Cancer classification based on gene expression increases early diagnosis...
research
06/26/2016

Discriminating sample groups with multi-way data

High-dimensional linear classifiers, such as the support vector machine ...
research
05/27/2022

Gene selection from microarray expression data: A Multi-objective PSO with adaptive K-nearest neighborhood

Cancer detection is one of the key research topics in the medical field....
research
06/05/2015

Gene selection for cancer classification using a hybrid of univariate and multivariate feature selection methods

Various approaches to gene selection for cancer classification based on ...

Please sign up or login with your details

Forgot password? Click here to reset