Random survival forests for competing risks with multivariate longitudinal endogenous covariates

08/11/2022
by   Anthony Devaux, et al.
0

Predicting the individual risk of a clinical event using the complete patient history is still a major challenge for personalized medicine. Among the methods developed to compute individual dynamic predictions, the joint models have the assets of using all the available information while accounting for dropout. However, they are restricted to a very small number of longitudinal predictors. Our objective was to propose an innovative alternative solution to predict an event probability using a possibly large number of longitudinal predictors. We developed DynForest, an extension of random survival forests for competing risks that handles endogenous longitudinal predictors. At each node of the trees, the time-dependent predictors are translated into time-fixed features (using mixed models) to be used as candidates for splitting the subjects into two subgroups. The individual event probability is estimated in each tree by the Aalen-Johansen estimator of the leaf in which the subject is classified according to his/her history of predictors. The final individual prediction is given by the average of the tree-specific individual event probabilities. We carried out a simulation study to demonstrate the performances of DynForest both in a small dimensional context (in comparison with joint models) and in a large dimensional context (in comparison with a regression calibration method that ignores informative dropout). We also applied DynForest to (i) predict the individual probability of dementia in the elderly according to repeated measures of cognitive, functional, vascular and neuro-degeneration markers, and (ii) quantify the importance of each type of markers for the prediction of dementia. Implemented in the R package DynForest, our methodology provides a solution for the prediction of events from longitudinal endogenous predictors whatever their number.

READ FULL TEXT

page 25

page 26

page 28

research
02/02/2021

Individual dynamic prediction of clinical endpoint from large dimensional longitudinal biomarker history: a landmark approach

The individual data collected throughout patient follow-up constitute cr...
research
02/06/2023

Random Forests for time-fixed and time-dependent predictors: The DynForest R package

The R package DynForest implements random forests for predicting a categ...
research
01/12/2021

Penalized regression calibration: a method for the prediction of survival outcomes using complex longitudinal and high-dimensional data

Longitudinal and high-dimensional measurements have become increasingly ...
research
03/03/2020

Prediction of Time to a Terminal Event (TTTE) of New Units in a Dynamic Recurrent Competing Risks Model

In this paper, we propose a simulation approach to predict time to termi...
research
01/31/2019

Random forests for high-dimensional longitudinal data

Random forests is a state-of-the-art supervised machine learning method ...
research
07/23/2022

Random Competing Risks Forests for Large Data

Random forests are a sensible non-parametric model to predict competing ...
research
12/03/2018

Prediction of New Onset Diabetes after Liver Transplant

25 within the next 5 years. These thousands of individuals are at 2-fold...

Please sign up or login with your details

Forgot password? Click here to reset