A statistical methodology to select covariates in high-dimensional data under dependence. Application to the classification of genetic profiles in oncology

09/12/2019
by   Bérangère Bastien, et al.
0

We propose a new methodology for selecting and ranking covariates associated with a variable of interest in a context of high-dimensional data under dependence but few observations. The methodology successively intertwines the clustering of covariates, decorrelation of covariates using Factor Latent Analysis, selection using aggregation of adapted methods and finally ranking. Simulations study shows the interest of the decorrelation inside the different clusters of covariates. We first apply our method to transcriptomic data of 37 patients with advanced non-small-cell lung cancer who have received chemotherapy, to select the transcriptomic covariates that explain the survival outcome of the treatment. Secondly, we apply our method to 79 breast tumor samples to define patient profiles for a new metastatic biomarker and associated gene network in order to personalize the treatments.

READ FULL TEXT
research
03/17/2023

A New Covariate Selection Strategy for High Dimensional Data in Causal Effect Estimation with Multivariate Treatments

Selection of covariates is crucial in the estimation of average treatmen...
research
02/10/2022

A Clustering Approach to Integrative Analysis of Multiomic Cancer Data

Rapid technological advances have allowed for molecular profiling across...
research
01/04/2022

Estimating Heterogeneous Causal Effects of High-Dimensional Treatments: Application to Conjoint Analysis

Estimation of heterogeneous treatment effects is an active area of resea...
research
07/29/2016

The Phylogenetic LASSO and the Microbiome

Scientific investigations that incorporate next generation sequencing in...
research
04/11/2023

A nonparametric framework for treatment effect modifier discovery in high dimensions

Heterogeneous treatment effects are driven by treatment effect modifiers...
research
06/22/2022

Automated Cancer Subtyping via Vector Quantization Mutual Information Maximization

Cancer subtyping is crucial for understanding the nature of tumors and p...
research
04/18/2022

A Greedy and Optimistic Approach to Clustering with a Specified Uncertainty of Covariates

In this study, we examine a clustering problem in which the covariates o...

Please sign up or login with your details

Forgot password? Click here to reset