Should univariate Cox regression be used for feature selection with respect to time-to-event outcomes?

08/20/2022
by   Rong Lu, et al.
0

IMPORTANCE: Time-to-event outcomes are commonly used in clinical trials and biomarker discovery studies and have been primarily analyzed using Cox proportional hazards models. But it's unclear which statistical models should be recommended for feature selection tasks when time-to-event outcomes are of the primary interest. OBJECTIVE: To explore if Gaussian regression of log-transformed survival time could outperform Cox proportional hazards models in feature selection. DESIGN: In this simulation study, the true models are multivariate Cox proportional hazards models with 10 covariates. For all feature selection comparisons, it's assumed that only 5 out the 10 true features are observed/measured for all model fitting, along with 5 random noise features. Each sample size and censoring rate scenario is explored using 10,000 simulation datasets. Different statistical models are applied to the same dataset to estimate feature effects. Model performance is compared using sensitivity, specificity, and accuracy of effect size ranking. RESULTS: When features are independent and the true models are multivariate Cox proportional hazards models, Gaussian regression of log-transformed survival time (response variable) with only two covariates outperformed both the univariate Cox proportional hazards model and logistic regression in feature selection, in terms of not only higher sensitivity, comparable specificity, but also higher accuracy of effect size ranking, regardless of the sample size and censoring rate values. CONCLUSIONS AND RELEVANCE: This study demonstrates the importance of including Gaussian regression of log-transformed survival time in feature selection practice for time-to-event outcomes.

READ FULL TEXT

page 11

page 15

research
10/10/2022

When to encourage using Gaussian regression for feature selection tasks with time-to-event outcome

IMPORTANCE: Feature selection with respect to time-to-event outcomes is ...
research
08/01/2023

CoxKnockoff: Controlled Feature Selection for the Cox Model Using Knockoffs

Although there is a huge literature on feature selection for the Cox mod...
research
04/01/2020

A generalised OMP algorithm for feature selection with application to gene expression data

Feature selection for predictive analytics is the problem of identifying...
research
09/24/2019

Survival analysis as a classification problem

In this paper, we explore a method for treating survival analysis as a c...
research
11/03/2014

Bayesian feature selection with strongly-regularizing priors maps to the Ising Model

Identifying small subsets of features that are relevant for prediction a...
research
01/24/2019

A XGBoost risk model via feature selection and Bayesian hyper-parameter optimization

This paper aims to explore models based on the extreme gradient boosting...
research
08/05/2022

On concordance indices for models with time-varying risk

Harrel's concordance index is a commonly used discrimination metric for ...

Please sign up or login with your details

Forgot password? Click here to reset