ggRandomForests: Exploring Random Forest Survival

12/28/2016
by   John Ehrlinger, et al.
0

Random forest (Leo Breiman 2001a) (RF) is a non-parametric statistical method requiring no distributional assumptions on covariate relation to the response. RF is a robust, nonlinear technique that optimizes predictive accuracy by fitting an ensemble of trees to stabilize model estimates. Random survival forests (RSF) (Ishwaran and Kogalur 2007; Ishwaran et al. 2008) are an extension of Breimans RF techniques allowing efficient nonparametric analysis of time to event data. The randomForestSRC package (Ishwaran and Kogalur 2014) is a unified treatment of Breimans random forest for survival, regression and classification problems. Predictive accuracy makes RF an attractive alternative to parametric models, though complexity and interpretability of the forest hinder wider application of the method. We introduce the ggRandomForests package, tools for visually understand random forest models grown in R (R Core Team 2014) with the randomForestSRC package. The ggRandomForests package is structured to extract intermediate data objects from randomForestSRC objects and generate figures using the ggplot2 (Wickham 2009) graphics package. This document is structured as a tutorial for building random forest for survival with the randomForestSRC package and using the ggRandomForests package for investigating how the forest is constructed. We analyse the Primary Biliary Cirrhosis of the liver data from a clinical trial at the Mayo Clinic (Fleming and Harrington 1991). Our aim is to demonstrate the strength of using Random Forest methods for both prediction and information retrieval, specifically in time to event data settings.

READ FULL TEXT

page 4

page 12

page 21

page 29

page 38

research
01/28/2015

ggRandomForests: Visually Exploring a Random Forest for Regression

Random Forests [Breiman:2001] (RF) are a fully non-parametric statistica...
research
04/16/2018

conformalClassification: A Conformal Prediction R Package for Classification

The conformalClassification package implements Transductive Conformal Pr...
research
10/04/2019

A Random Interaction Forest for Prioritizing Predictive Biomarkers

Precision medicine is becoming a focus in medical research recently, as ...
research
05/24/2018

Prediction of Autism Treatment Response from Baseline fMRI using Random Forests and Tree Bagging

Treating children with autism spectrum disorders (ASD) with behavioral i...
research
12/10/2020

A machine learning approach to galaxy properties: Joint redshift - stellar mass probability distributions with Random Forest

We demonstrate that highly accurate joint redshift - stellar mass PDFs c...
research
05/08/2020

Explainable Matrix – Visualization for Global and Local Interpretability of Random Forest Classification Ensembles

Over the past decades, classification models have proven to be one of th...
research
04/16/2018

RFCDE: Random Forests for Conditional Density Estimation

Random forests is a common non-parametric regression technique which per...

Please sign up or login with your details

Forgot password? Click here to reset