Dive into Decision Trees and Forests: A Theoretical Demonstration

01/20/2021
by   Jinxiong Zhang, et al.
0

Based on decision trees, many fields have arguably made tremendous progress in recent years. In simple words, decision trees use the strategy of "divide-and-conquer" to divide the complex problem on the dependency between input features and labels into smaller ones. While decision trees have a long history, recent advances have greatly improved their performance in computational advertising, recommender system, information retrieval, etc. We introduce common tree-based models (e.g., Bayesian CART, Bayesian regression splines) and training techniques (e.g., mixed integer programming, alternating optimization, gradient descent). Along the way, we highlight probabilistic characteristics of tree-based models and explain their practical and theoretical benefits. Except machine learning and data mining, we try to show theoretical advances on tree-based models from other fields such as statistics and operation research. We list the reproducible resource at the end of each method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/19/2018

Deep Neural Decision Trees

Deep neural networks have been proven powerful at processing perceptual ...
research
06/11/2023

Improving the Validity of Decision Trees as Explanations

In classification and forecasting with tabular data, one often utilizes ...
research
02/15/2023

Bayesian Decision Trees via Tractable Priors and Probabilistic Context-Free Grammars

Decision Trees are some of the most popular machine learning models toda...
research
11/30/2006

Lossless fitness inheritance in genetic algorithms for decision trees

When genetic algorithms are used to evolve decision trees, key tree qual...
research
04/10/2019

A Selective Overview of Deep Learning

Deep learning has arguably achieved tremendous success in recent years. ...
research
05/28/2022

Optimal Decision Diagrams for Classification

Decision diagrams for classification have some notable advantages over d...
research
02/19/2018

Finding Influential Training Samples for Gradient Boosted Decision Trees

We address the problem of finding influential training samples for a par...

Please sign up or login with your details

Forgot password? Click here to reset