Prediction with Missing Data via Bayesian Additive Regression Trees

06/03/2013
by   Adam Kapelner, et al.
0

We present a method for incorporating missing data in non-parametric statistical learning without the need for imputation. We focus on a tree-based method, Bayesian Additive Regression Trees (BART), enhanced with "Missingness Incorporated in Attributes," an approach recently proposed incorporating missingness into decision trees (Twala, 2008). This procedure takes advantage of the partitioning mechanisms found in tree-based models. Simulations on generated models and real data indicate that our proposed method can forecast well on complicated missing-at-random and not-missing-at-random models as well as models where missingness itself influences the response. Our procedure has higher predictive performance and is more stable than competitors in many cases. We also illustrate BART's abilities to incorporate missingness into uncertainty intervals and to detect the influence of missingness on the model fit.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/07/2023

Trinary Decision Trees for missing value handling

This paper introduces the Trinary decision tree, an algorithm designed t...
research
12/08/2013

bartMachine: Machine Learning with Bayesian Additive Regression Trees

We present a new package in R implementing Bayesian additive regression ...
research
06/29/2020

Handling Missing Data in Decision Trees: A Probabilistic Approach

Decision trees are a popular family of models due to their attractive pr...
research
08/26/2021

On Soft Bayesian Additive Regression Trees and asynchronous longitudinal regression analysis

In many longitudinal studies, the covariate and response are often inter...
research
10/20/2020

A Comparative Study of Imputation Methods for Multivariate Ordinal Data

Missing data remains a very common problem in large datasets, including ...
research
06/25/2020

Joints in Random Forests

Decision Trees (DTs) and Random Forests (RFs) are powerful discriminativ...
research
02/13/2017

metboost: Exploratory regression analysis with hierarchically clustered data

As data collections become larger, exploratory regression analysis becom...

Please sign up or login with your details

Forgot password? Click here to reset