Distributed Soft Bayesian Additive Regression Trees

08/26/2021
by   Hao Ran, et al.
0

Bayesian Additive Regression Trees(BART) is a Bayesian nonparametric approach which has been shown to be competitive with the best modern predictive methods such as random forest and Gradient Boosting Decision Tree.The sum of trees structure combined with a Bayesian inferential framework provide a accurate and robust statistic method.BART variant named SBART using randomized decision trees has been developed and show practical benefits compared to BART. The primary bottleneck of SBART is the speed to compute the sufficient statistics and the publicly avaiable implementation of the SBART algorithm in the R package is very slow.In this paper we show how the SBART algorithm can be modified and computed using single program,multiple data(SPMD) distributed computation with the Message Passing Interface(MPI) library.This approach scales nearly linearly in the number of processor cores, enabling the practitioner to perform statistical inference on massive datasets. Our approach can also handle datasets too massive to fit on any single data repository.We have made modification to this algorithm to make it capable to handle classfication problem which can not be done with the original R package.With data experiments we show the advantage of distributed SBART for classfication problem compared to BART.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/26/2021

On Soft Bayesian Additive Regression Trees and asynchronous longitudinal regression analysis

In many longitudinal studies, the covariate and response are often inter...
research
12/08/2013

bartMachine: Machine Learning with Bayesian Additive Regression Trees

We present a new package in R implementing Bayesian additive regression ...
research
02/21/2023

Variational Boosted Soft Trees

Gradient boosting machines (GBMs) based on decision trees consistently d...
research
03/27/2013

Multiple decision trees

This paper describes experiments, on two domains, to investigate the eff...
research
10/28/2022

SoftBart: Soft Bayesian Additive Regression Trees

Bayesian additive regression tree (BART) models have seen increased atte...
research
07/26/2022

Single MCMC Chain Parallelisation on Decision Trees

Decision trees are highly famous in machine learning and usually acquire...
research
10/04/2018

Accelerated Bayesian Additive Regression Trees

Although less widely known than random forests or boosted regression tre...

Please sign up or login with your details

Forgot password? Click here to reset