When to encourage using Gaussian regression for feature selection tasks with time-to-event outcome

10/10/2022
by   Rong Lu, et al.
0

IMPORTANCE: Feature selection with respect to time-to-event outcomes is one of the fundamental problems in clinical trials and biomarker discovery studies. But it's unclear which statistical methods should be used when sample size is small or some of the key covariates are not measured. DESIGN: In this simulation study, the true models are multivariate Cox proportional hazards models with 10 covariates. It's assumed that only 5 out the 10 true features are observed/measured for all model fitting, along with 5 random noise features. Each sample size scenario is explored using 10,000 simulation datasets. Eight regression models are applied to each dataset to estimate feature effects, including both regularized Gaussian regression (elastic net penalty) and regularized Cox regression (glmnet Cox). RESULTS: If the covariates are highly correlated Gaussian, the Gaussian regression of log-transformed survival time with only two covariates outperforms all tested Cox regression models when total number of events <500.

READ FULL TEXT

page 18

page 30

page 31

research
08/20/2022

Should univariate Cox regression be used for feature selection with respect to time-to-event outcomes?

IMPORTANCE: Time-to-event outcomes are commonly used in clinical trials ...
research
09/27/2020

RENT – Repeated Elastic Net Technique for Feature Selection

In this study we present the RENT feature selection method for binary cl...
research
04/06/2023

Bivariate copula regression models for semi-competing risks

Time-to-event semi-competing risk endpoints may be correlated when both ...
research
08/02/2019

FeatureExplorer: Interactive Feature Selection and Exploration of Regression Models for Hyperspectral Images

Feature selection is used in machine learning to improve predictions, de...
research
08/01/2023

CoxKnockoff: Controlled Feature Selection for the Cox Model Using Knockoffs

Although there is a huge literature on feature selection for the Cox mod...
research
06/23/2022

The Effective Sample Size in Bayesian Information Criterion for Level-Specific Fixed and Random Effects Selection in a Two-Level Nested Model

Popular statistical software provides Bayesian information criterion (BI...
research
02/07/2018

Cadre Modeling: Simultaneously Discovering Subpopulations and Predictive Models

We consider the problem in regression analysis of identifying subpopulat...

Please sign up or login with your details

Forgot password? Click here to reset