Stochastic tree ensembles for regularized nonlinear regression

02/09/2020
by   Jingyu He, et al.
0

This paper develops a novel stochastic tree ensemble method for nonlinear regression, which we refer to as XBART, short for Accelerated Bayesian Additive Regression Trees. By combining regularization and stochastic search strategies from Bayesian modeling with computationally efficient techniques from recursive partitioning approaches, the new method attains state-of-the-art performance: in many settings it is both faster and more accurate than the widely-used XGBoost algorithm. Via careful simulation studies, we demonstrate that our new approach provides accurate point-wise estimates of the mean function and does so faster than popular alternatives, such as BART, XGBoost and neural networks (using Keras). We also prove a number of basic theoretical results about the new algorithm, including consistency of the single tree version of the model and stationarity of the Markov chain produced by the ensemble version. Furthermore, we demonstrate that initializing standard Bayesian additive regression trees Markov chain Monte Carlo (MCMC) at XBART-fitted trees considerably improves credible interval coverage and reduces total run-time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/04/2018

Accelerated Bayesian Additive Regression Trees

Although less widely known than random forests or boosted regression tre...
research
07/26/2022

Single MCMC Chain Parallelisation on Decision Trees

Decision trees are highly famous in machine learning and usually acquire...
research
04/19/2019

Continuous-Time Birth-Death MCMC for Bayesian Regression Tree Models

Decision trees are flexible models that are well suited for many statist...
research
09/16/2021

How trustworthy is your tree? Bayesian phylogenetic effective sample size through the lens of Monte Carlo error

Bayesian inference is a popular and widely-used approach to infer phylog...
research
01/22/2023

Parallel Approaches to Accelerate Bayesian Decision Trees

Markov Chain Monte Carlo (MCMC) is a well-established family of algorith...
research
06/19/2008

BART: Bayesian additive regression trees

We develop a Bayesian "sum-of-trees" model where each tree is constraine...
research
08/07/2021

Bayesian L_1/2 regression

It is well known that bridge regression enjoys superior theoretical prop...

Please sign up or login with your details

Forgot password? Click here to reset