Boost-R: Gradient Boosted Trees for Recurrence Data

07/03/2021
by   Xiao Liu, et al.
0

Recurrence data arise from multi-disciplinary domains spanning reliability, cyber security, healthcare, online retailing, etc. This paper investigates an additive-tree-based approach, known as Boost-R (Boosting for Recurrence Data), for recurrent event data with both static and dynamic features. Boost-R constructs an ensemble of gradient boosted additive trees to estimate the cumulative intensity function of the recurrent event process, where a new tree is added to the ensemble by minimizing the regularized L2 distance between the observed and predicted cumulative intensity. Unlike conventional regression trees, a time-dependent function is constructed by Boost-R on each tree leaf. The sum of these functions, from multiple trees, yields the ensemble estimator of the cumulative intensity. The divide-and-conquer nature of tree-based methods is appealing when hidden sub-populations exist within a heterogeneous population. The non-parametric nature of regression trees helps to avoid parametric assumptions on the complex interactions between event processes and features. Critical insights and advantages of Boost-R are investigated through comprehensive numerical examples. Datasets and computer code of Boost-R are made available on GitHub. To our best knowledge, Boost-R is the first gradient boosted additive-tree-based approach for modeling large-scale recurrent event data with both static and dynamic feature information.

READ FULL TEXT
research
01/27/2021

Boost-S: Gradient Boosted Trees for Spatial Data and Its Application to FDG-PET Imaging Data

Boosting Trees are one of the most successful statistical learning appro...
research
08/17/2021

Semi-parametric Bayesian Additive Regression Trees

We propose a new semi-parametric model based on Bayesian Additive Regres...
research
06/24/2021

Regression Trees and Ensembles for Cumulative Incidence Functions

The use of cumulative incidence functions for characterizing the risk of...
research
06/04/2017

InfiniteBoost: building infinite ensembles with gradient descent

In machine learning ensemble methods have demonstrated high accuracy for...
research
11/13/2020

Regression Trees for Cumulative Incidence Functions

The use of cumulative incidence functions for characterizing the risk of...
research
08/10/2018

BooST: Boosting Smooth Trees for Partial Effect Estimation in Nonlinear Regressions

In this paper we introduce a new machine learning (ML) model for nonline...
research
10/10/2022

A copula-based boosting model for time-to-event prediction with dependent censoring

A characteristic feature of time-to-event data analysis is possible cens...

Please sign up or login with your details

Forgot password? Click here to reset