BoXHED 2.0: Scalable boosting of functional data in survival analysis

by   Arash Pakbin, et al.

Modern applications of survival analysis increasingly involve time-dependent covariates, which constitute a form of functional data. Learning from functional data generally involves repeated evaluations of time integrals which is numerically expensive. In this work we propose a lightweight data preprocessing step that transforms functional data into nonfunctional data. Boosting implementations for nonfunctional data can then be used, whereby the required numerical integration comes for free as part of the training phase. We use this to develop BoXHED 2.0, a quantum leap over the tree-boosted hazard package BoXHED 1.0. BoXHED 2.0 extends BoXHED 1.0 to Aalen's multiplicative intensity model, which covers censoring schemes far beyond right-censoring and also supports recurrent events data. It is also massively scalable because of preprocessing and also because it borrows from the core components of XGBoost. BoXHED 2.0 supports the use of GPUs and multicore CPUs, and is available from GitHub:



There are no comments yet.


page 1

page 2

page 3

page 4


Partial Least Squares for Functional Joint Models

Many biomedical studies have identified important imaging biomarkers tha...

Extension of the Gradient Boosting Algorithm for Joint Modeling of Longitudinal and Time-to-Event data

In various data situations joint models are an efficient tool to analyze...

Boosting hazard regression with time-varying covariates

Consider a left-truncated right-censored survival process whose evolutio...

Functional Time Series Forecasting: Functional Singular Spectrum Analysis Approaches

In this paper, we propose two nonparametric methods used in the forecast...

Survival trees for right-censored data based on score based parameter instability test

Survival analysis of right censored data arises often in many areas of r...

Gradient Boosting Survival Tree with Applications in Credit Scoring

Credit scoring (Thomas et al., 2002) plays a vital role in the field of ...

Leveraging Deep Representations of Radiology Reports in Survival Analysis for Predicting Heart Failure Patient Mortality

Utilizing clinical texts in survival analysis is difficult because they ...

Code Repositories

This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.