A Fast Divide-and-Conquer Sparse Cox Regression

04/02/2018
by   Yan Wang, et al.
0

We propose a computationally and statistically efficient divide-and-conquer (DAC) algorithm to fit sparse Cox regression to massive datasets where the sample size n_0 is exceedingly large and the covariate dimension p is not small but n_0≫ p. The proposed algorithm achieves computational efficiency through a one-step linear approximation followed by a least square approximation to the partial likelihood (PL). These sequences of linearization enable us to maximize the PL with only a small subset and perform penalized estimation via a fast approximation to the PL. The algorithm is applicable for the analysis of both time-independent and time-dependent survival data. Simulations suggest that the proposed DAC algorithm substantially outperforms the full sample-based estimators and the existing DAC algorithm with respect to the computational speed, while it achieves similar statistical efficiency as the full sample-based estimators. The proposed algorithm was applied to an extraordinarily large time-independent survival dataset and an extraordinarily large time-dependent survival dataset for the prediction of heart failure-specific readmission within 30 days among Medicare heart failure patients.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/02/2017

Scalable Sparse Cox's Regression for Large-Scale Survival Data via Broken Adaptive Ridge

This paper develops a new sparse Cox regression method for high-dimensio...
research
12/03/2020

Optimal Cox Regression Subsampling Procedure with Rare Events

Massive sized survival datasets are becoming increasingly prevalent with...
research
05/19/2020

Matching methods for obtaining survival functions to estimate the effect of a time-dependent treatment

In observational studies of survival time featuring a binary time-depend...
research
09/15/2018

ROC-Guided Survival Trees and Forests

Tree-based methods are popular nonparametric tools in studying time-to-e...
research
01/20/2021

Data-driven sparse polynomial chaos expansion for models with dependent inputs

Polynomial chaos expansions (PCEs) have been used in many real-world eng...
research
03/17/2020

Statistically Guided Divide-and-Conquer for Sparse Factorization of Large Matrix

The sparse factorization of a large matrix is fundamental in modern stat...

Please sign up or login with your details

Forgot password? Click here to reset