Variable selection via Group LASSO Approach : Application to the Cox Regression and frailty model

02/23/2018
by   Jean Claude Utazirubanda, et al.
0

In the analysis of survival outcome supplemented with both clinical information and high-dimensional gene expression data, use of the traditional Cox proportional hazards model (1972) fails to meet some emerging needs in biomedical research. First, the number of covariates is generally much larger the sample size. Secondly, predicting an outcome based on individual gene expression is inadequate because multiple biological processes and functional pathways regulate the expression associated with a gene. Another challenge is that the Cox model assumes that populations are homogenous, implying that all individuals have the same risk of death, which is rarely true due to unmeasured risk factors among populations. In this paper we propose group LASSO with gamma-distributed frailty for variable selection in Cox regression by extending previous scholarship to account for heterogeneity among group structures related to exposure and susceptibility. The consistency property of the proposed method is established. This method is appropriate for addressing a wide variety of research questions from genetics to air pollution. Simulated analysis shows promising performance by group LASSO compared with other methods, including group SCAD and group MCP. Future directions include expanding the use of frailty with adaptive group LASSO and sparse group LASS.

READ FULL TEXT

page 18

page 20

research
08/30/2018

Simulation-Selection-Extrapolation: Estimation in High-Dimensional Errors-in-Variables Models

This paper considers errors-in-variables models in a high-dimensional se...
research
09/12/2017

Identifying Genetic Risk Factors via Sparse Group Lasso with Group Graph Structure

Genome-wide association studies (GWA studies or GWAS) investigate the re...
research
02/11/2008

On the ℓ_1-ℓ_q Regularized Regression

In this paper we consider the problem of grouped variable selection in h...
research
04/16/2020

Combining heterogeneous subgroups with graph-structured variable selection priors for Cox regression

Important objectives in cancer research are the prediction of a patient'...
research
05/20/2023

Inferring diagnostic and prognostic gene expression signatures across WHO glioma classifications: A network-based approach

Tumor heterogeneity is a challenge to designing effective and targeted t...
research
08/31/2020

Variable selection in social-environmental data: Sparse regression and tree ensemble machine learning approaches

Objective: Social-environmental data obtained from the U.S. Census is an...
research
09/22/2020

ABM: an automatic supervised feature engineering method for loss based models based on group and fused lasso

A vital problem in solving classification or regression problem is to ap...

Please sign up or login with your details

Forgot password? Click here to reset