DeepAI AI Chat
Log In Sign Up

Massive Parallelization of Massive Sample-size Survival Analysis

04/18/2022
by   Jianxiao Yang, et al.
0

Large-scale observational health databases are increasingly popular for conducting comparative effectiveness and safety studies of medical products. However, increasing number of patients poses computational challenges when fitting survival regression models in such studies. In this paper, we use graphics processing units (GPUs) to parallelize the computational bottlenecks of massive sample-size survival analyses. Specifically, we develop and apply time- and memory-efficient single-pass parallel scan algorithms for Cox proportional hazards models and forward-backward parallel scan algorithms for Fine-Gray models for analysis with and without a competing risk using a cyclic coordinate descent optimization approach We demonstrate that GPUs accelerate the computation of fitting these complex models in large databases by orders-of-magnitude as compared to traditional multi-core CPU parallelism. Our implementation enables efficient large-scale observational studies involving millions of patients and thousands of patient characteristics.

READ FULL TEXT
12/02/2017

Scalable Sparse Cox's Regression for Large-Scale Survival Data via Broken Adaptive Ridge

This paper develops a new sparse Cox regression method for high-dimensio...
11/07/2019

Scalable Algorithms for Large Competing Risks Data

This paper develops two orthogonal contributions to scalable sparse regr...
12/03/2020

Optimal Cox Regression Subsampling Procedure with Rare Events

Massive sized survival datasets are becoming increasingly prevalent with...
10/15/2021

Sparsity-Specific Code Optimization using Expression Trees

We introduce a code generator that converts unoptimized C++ code operati...
02/13/2019

Selective recruitment designs for improving observational studies using electronic health records

Large scale electronic health records (EHRs) present an opportunity to q...
10/17/2019

Generalized Mixed Modeling in Massive Electronic Health Record Databases: what is a healthy serum potassium?

Converting electronic health record (EHR) entries to useful clinical inf...