A scalable and flexible Cox proportional hazards model for high-dimensional survival prediction and functional selection

05/23/2022
by   Boyi Guo, et al.
0

Cox proportional hazards model is one of the most popular models in biomedical data analysis. There have been continuing efforts to improve the flexibility of such models for complex signal detection, for example, via additive functions. Nevertheless, the task to extend Cox additive models to accommodate high-dimensional data is nontrivial. When estimating additive functions, commonly used group sparse regularization may introduce excess smoothing shrinkage on additive functions, damaging predictive performance. Moreover, an "all-in-all-out" approach makes functional selection challenging to answer if nonlinear effects exist. We develop an additive Cox PH model to address these challenges in high-dimensional data analysis. Notably, we impose a novel spike-and-slab LASSO prior that motivates the bi-level functional selection on additive functions. A scalable and deterministic algorithm, EM-Coordinate Descent, is designed for scalable model fitting. We compare the predictive and computational performance against state-of-the-art models in simulation studies and metabolomics data analysis. The proposed model is broadly applicable to various fields of research, e.g. genomics and population health, via the freely available R package BHAM (https://boyiguo1.github.io/BHAM/).

READ FULL TEXT
research
10/27/2021

Spike-and-Slab Generalized Additive Models and Scalable Algorithms for High-Dimensional Data

There are proposals that extend the classical generalized additive model...
research
07/05/2022

The R Package BHAM: Fast and Scalable Bayesian Hierarchical Additive Model for High-dimensional Data

BHAM is a freely avaible R pakcage that implments Bayesian hierarchical ...
research
08/15/2020

Ultra high dimensional generalized additive model: Unified Theory and Methods

Generalized additive model is a powerful statistical learning and predic...
research
07/17/2014

Sparse Partially Linear Additive Models

The generalized partially linear additive model (GPLAM) is a flexible an...
research
02/12/2022

DeepPAMM: Deep Piecewise Exponential Additive Mixed Models for Complex Hazard Structures in Survival Analysis

Survival analysis (SA) is an active field of research that is concerned ...
research
05/05/2020

Semiparametric analysis of clustered interval-censored survival data using Soft Bayesian Additive Regression Trees (SBART)

Popular parametric and semiparametric hazards regression models for clus...
research
07/14/2017

Toward A Scalable Exploratory Framework for Complex High-Dimensional Phenomics Data

Phenomics is an emerging branch of modern biology, which uses high throu...

Please sign up or login with your details

Forgot password? Click here to reset