Bayesian Decision Trees via Tractable Priors and Probabilistic Context-Free Grammars

02/15/2023
by   Colin Sullivan, et al.
0

Decision Trees are some of the most popular machine learning models today due to their out-of-the-box performance and interpretability. Often, Decision Trees models are constructed greedily in a top-down fashion via heuristic search criteria, such as Gini impurity or entropy. However, trees constructed in this manner are sensitive to minor fluctuations in training data and are prone to overfitting. In contrast, Bayesian approaches to tree construction formulate the selection process as a posterior inference problem; such approaches are more stable and provide greater theoretical guarantees. However, generating Bayesian Decision Trees usually requires sampling from complex, multimodal posterior distributions. Current Markov Chain Monte Carlo-based approaches for sampling Bayesian Decision Trees are prone to mode collapse and long mixing times, which makes them impractical. In this paper, we propose a new criterion for training Bayesian Decision Trees. Our criterion gives rise to BCART-PCFG, which can efficiently sample decision trees from a posterior distribution across trees given the data and find the maximum a posteriori (MAP) tree. Learning the posterior and training the sampler can be done in time that is polynomial in the dataset size. Once the posterior has been learned, trees can be sampled efficiently (linearly in the number of nodes). At the core of our method is a reduction of sampling the posterior to sampling a derivation from a probabilistic context-free grammar. We find that trees sampled via BCART-PCFG perform comparable to or better than greedily-constructed Decision Trees in classification accuracy on several datasets. Additionally, the trees sampled via BCART-PCFG are significantly smaller – sometimes by as much as 20x.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2013

Top-down particle filtering for Bayesian decision trees

Decision tree learning is a popular approach for classification and regr...
research
06/12/2023

Prediction Algorithms Achieving Bayesian Decision Theoretical Optimality Based on Decision Trees as Data Observation Processes

In the field of decision trees, most previous studies have difficulty en...
research
11/13/2017

Machine Learning Meets Microeconomics: The Case of Decision Trees and Discrete Choice

We provide a microeconomic framework for decision trees: a popular machi...
research
04/13/2021

Deep imagination is a close to optimal policy for planning in large decision trees under limited resources

Many decisions involve choosing an uncertain course of actions in deep a...
research
09/06/2022

Some models are useful, but how do we know which ones? Towards a unified Bayesian model taxonomy

Probabilistic (Bayesian) modeling has experienced a surge of application...
research
01/20/2021

Dive into Decision Trees and Forests: A Theoretical Demonstration

Based on decision trees, many fields have arguably made tremendous progr...
research
07/08/2022

Decision Trees for Analyzing Influences on the Accuracy of Indoor Localization Systems

Absolute position accuracy is the key performance criterion of an Indoor...

Please sign up or login with your details

Forgot password? Click here to reset