Dimension reduction for integrative survival analysis

08/04/2021
by   Aaron J. Molstad, et al.
0

We propose a constrained maximum partial likelihood estimator for dimension reduction in integrative (e.g., pan-cancer) survival analysis with high-dimensional covariates. We assume that for each population in the study, the hazard function follows a distinct Cox proportional hazards model. To borrow information across populations, we assume that all of the hazard functions depend only on a small number of linear combinations of the predictors. We estimate these linear combinations using an algorithm based on "distance-to-set" penalties. This allows us to impose both low-rankness and sparsity. We derive asymptotic results which reveal that our regression coefficient estimator is more efficient than fitting a separate proportional hazards model for each population. Numerical experiments suggest that our method outperforms related competitors under various data generating models. We use our method to perform a pan-cancer survival analysis relating protein expression to survival across 18 distinct cancer types. Our approach identifies six linear combinations, depending on only 20 proteins, which explain survival across the cancer types. Finally, we validate our fitted model on four external datasets and show that our estimated coefficients can lead to better prediction than popular competitors.

READ FULL TEXT
research
10/15/2017

Efficient Estimation for Dimension Reduction with Censored Data

We propose a general index model for survival data, which generalizes ma...
research
07/30/2019

Classification Algorithm for High Dimensional Protein Markers in Time-course Data

Identification of biomarkers is an emerging area in Oncology. In this ar...
research
09/05/2019

A new reproducing kernel based nonlinear dimension reduction method for survival data

Based on the theories of sliced inverse regression (SIR) and reproducing...
research
02/05/2023

Optimal subsampling for the Cox proportional hazards model with massive survival data

The use of massive survival data has become common in survival analysis....
research
05/14/2019

Nonlinear Semi-Parametric Models for Survival Analysis

Semi-parametric survival analysis methods like the Cox Proportional Haza...
research
09/17/2018

A convex formulation for high-dimensional sparse sliced inverse regression

Sliced inverse regression is a popular tool for sufficient dimension red...
research
12/15/2020

Certifiably Optimal Sparse Sufficient Dimension Reduction

Sufficient dimension reduction (SDR) is a popular tool in regression ana...

Please sign up or login with your details

Forgot password? Click here to reset