The Max-Cut Decision Tree: Improving on the Accuracy and Running Time of Decision Trees

06/25/2020
by   Jonathan Bodine, et al.
0

Decision trees are a widely used method for classification, both by themselves and as the building blocks of multiple different ensemble learning methods. The Max-Cut decision tree involves novel modifications to a standard, baseline model of classification decision tree construction, precisely CART Gini. One modification involves an alternative splitting metric, maximum cut, based on maximizing the distance between all pairs of observations belonging to separate classes and separate sides of the threshold value. The other modification is to select the decision feature from a linear combination of the input features constructed using Principal Component Analysis (PCA) locally at each node. Our experiments show that this node-based localized PCA with the novel splitting modification can dramatically improve classification, while also significantly decreasing computational time compared to the baseline decision tree. Moreover, our results are most significant when evaluated on data sets with higher dimensions, or more classes; which, for the example data set CIFAR-100, enable a 49 94 decision trees for difficult classification tasks.

READ FULL TEXT

page 5

page 8

research
01/27/2021

Decision Machines: Interpreting Decision Tree as a Model Combination Method

Based on decision trees, it is efficient to handle tabular data. Convent...
research
04/23/2022

A Novel Splitting Criterion Inspired by Geometric Mean Metric Learning for Decision Tree

Decision tree (DT) attracts persistent research attention due to its imp...
research
09/18/2023

Des-q: a quantum algorithm to construct and efficiently retrain decision trees for regression and binary classification

Decision trees are widely used in machine learning due to their simplici...
research
07/04/2022

Discrete Tree Flows via Tree-Structured Permutations

While normalizing flows for continuous data have been extensively resear...
research
08/09/2022

Global Evaluation for Decision Tree Learning

We transfer distances on clusterings to the building process of decision...
research
04/26/2019

Formal Verification of Decision-Tree Ensemble Model and Detection of its Violating-input-value Ranges

As one type of machine-learning model, a "decision-tree ensemble model" ...
research
02/27/2019

Neural Packet Classification

Packet classification is a fundamental problem in computer networking. T...

Please sign up or login with your details

Forgot password? Click here to reset