Learning Optimal Decision Trees from Large Datasets

04/12/2019
by   Florent Avellaneda, et al.
0

Inferring a decision tree from a given dataset is one of the classic problems in machine learning. This problem consists of buildings, from a labelled dataset, a tree such that each node corresponds to a class and a path between the tree root and a leaf corresponds to a conjunction of features to be satisfied in this class. Following the principle of parsimony, we want to infer a minimal tree consistent with the dataset. Unfortunately, inferring an optimal decision tree is known to be NP-complete for several definitions of optimality. Hence, the majority of existing approaches relies on heuristics, and as for the few exact inference approaches, they do not work on large data sets. In this paper, we propose a novel approach for inferring a decision tree of a minimum depth based on the incremental generation of Boolean formula. The experimental results indicate that it scales sufficiently well and the time it takes to run grows slowly with the size of dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2020

Succinct Explanations With Cascading Decision Trees

Classic decision tree learning is a binary classification algorithm that...
research
04/29/2019

Optimal Sparse Decision Trees

Decision tree algorithms have been among the most popular algorithms for...
research
07/02/2021

Decision tree heuristics can fail, even in the smoothed setting

Greedy decision tree learning heuristics are mainstays of machine learni...
research
01/30/2018

Indistinguishable binomial decision tree of 3-SAT: Proof of class P is a proper subset of class NP

This paper solves a long standing open problem of whether NP-complete pr...
research
09/26/2022

On (Simple) Decision Tree Rank

In the decision tree computation model for Boolean functions, the depth ...
research
05/11/2022

Subspace Learning Machine (SLM): Methodology and Performance

Inspired by the feedforward multilayer perceptron (FF-MLP), decision tre...
research
06/03/2011

An Analysis of Reduced Error Pruning

Top-down induction of decision trees has been observed to suffer from th...

Please sign up or login with your details

Forgot password? Click here to reset