A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-treated Atlantic Cod (Gadus morhua) Liver

05/20/2019
by   Xiaokang Zhang, et al.
0

Univariate and multivariate feature selection methods can be used for biomarker discovery in analysis of toxicant exposure. Among the univariate methods, differential expression analysis (DEA) is often applied for its simplicity and interpretability. A characteristic of methods for DEA is that they treat genes individually, disregarding the correlation that exists between them. On the other hand, some multivariate feature selection methods are proposed for biomarker discovery. Provided with various biomarker discovery methods, how to choose the most suitable method for a specific dataset becomes a problem. In this paper, we present a framework for comparison of potential biomarker discovery methods: three methods that stem from different theories are compared by how stable they are and how well they can improve the classification accuracy. The three methods we have considered are: Significance Analysis of Microarrays (SAM) which identifies the differentially expressed genes; minimum Redundancy Maximum Relevance (mRMR) based on information theory; and Characteristic Direction (GeoDE) inspired by a graphical perspective. Tested on the gene expression data from two experiments exposing the cod fish to two different toxicants (MeHg and PCB 153), different methods stand out in different cases, so a decision upon the most suitable method should be made based on the dataset under study and the research interest.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/03/2021

Multivariate feature ranking of gene expression data

Gene expression datasets are usually of high dimensionality and therefor...
research
06/05/2015

Gene selection for cancer classification using a hybrid of univariate and multivariate feature selection methods

Various approaches to gene selection for cancer classification based on ...
research
09/30/2021

Feature Selection on a Flare Forecasting Testbed: A Comparative Study of 24 Methods

The Space-Weather ANalytics for Solar Flares (SWAN-SF) is a multivariate...
research
07/31/2021

A Hybrid Ensemble Feature Selection Design for Candidate Biomarkers Discovery from Transcriptome Profiles

The discovery of disease biomarkers from gene expression data has been g...
research
06/11/2018

Network reconstruction with local partial correlation: comparative evaluation

Over the past decade, various methods have been proposed for the reconst...
research
11/16/2021

On the utility of power spectral techniques with feature selection techniques for effective mental task classification in noninvasive BCI

In this paper classification of mental task-root Brain-Computer Interfac...

Please sign up or login with your details

Forgot password? Click here to reset