A variable selection approach for highly correlated predictors in high-dimensional genomic data

07/21/2020
by   Wencan Zhu, et al.
0

In genomic studies, identifying biomarkers associated with a variable of interest is a major concern in biomedical research. Regularized approaches are classically used to perform variable selection in high-dimensional linear models. However, these methods can fail in highly correlated settings. We propose a novel variable selection approach called WLasso, taking these correlations into account. It consists in rewriting the initial high-dimensional linear model to remove the correlation between the biomarkers (predictors) and in applying the generalized Lasso criterion. The performance of WLasso is assessed using synthetic data in several scenarios and compared with recent alternative approaches. The results show that when the biomarkers are highly correlated, WLasso outperforms the other approaches in sparse high-dimensional frameworks. The method is also successfully illustrated on publicly available gene expression data in breast cancer. Our method is implemented in the WLasso R package which is available from the Comprehensive R Archive Network.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/10/2021

The EAS approach to variable selection for multivariate response data in high-dimensional settings

In this paper, we extend the epsilon admissible subsets (EAS) model sele...
research
06/29/2022

Variable selection in high-dimensional logistic regression models using a whitening approach

In bioinformatics, the rapid development of sequencing technology has en...
research
02/04/2022

Identification of prognostic and predictive biomarkers in high-dimensional data with PPLasso

In clinical trials, identification of prognostic and predictive biomarke...
research
04/02/2022

Structural randomised selection

An important problem in the analysis of high-dimensional omics data is t...
research
04/12/2022

Evolutionary shift detection with ensemble variable selection

1. Abrupt environmental changes can lead to evolutionary shifts in trait...
research
09/23/2020

Bayesian Hierarchical Models for High-Dimensional Mediation Analysis with Coordinated Selection of Correlated Mediators

We consider Bayesian high-dimensional mediation analysis to identify amo...
research
06/10/2021

Sign Consistency of the Generalized Elastic Net Estimator

In this paper, we propose a novel variable selection approach in the fra...

Please sign up or login with your details

Forgot password? Click here to reset