Des-q: a quantum algorithm to construct and efficiently retrain decision trees for regression and binary classification

09/18/2023
by   Niraj Kumar, et al.
0

Decision trees are widely used in machine learning due to their simplicity in construction and interpretability. However, as data sizes grow, traditional methods for constructing and retraining decision trees become increasingly slow, scaling polynomially with the number of training examples. In this work, we introduce a novel quantum algorithm, named Des-q, for constructing and retraining decision trees in regression and binary classification tasks. Assuming the data stream produces small increments of new training examples, we demonstrate that our Des-q algorithm significantly reduces the time required for tree retraining, achieving a poly-logarithmic time complexity in the number of training examples, even accounting for the time needed to load the new examples into quantum-accessible memory. Our approach involves building a decision tree algorithm to perform k-piecewise linear tree splits at each internal node. These splits simultaneously generate multiple hyperplanes, dividing the feature space into k distinct regions. To determine the k suitable anchor points for these splits, we develop an efficient quantum-supervised clustering method, building upon the q-means algorithm of Kerenidis et al. Des-q first efficiently estimates each feature weight using a novel quantum technique to estimate the Pearson correlation. Subsequently, we employ weighted distance estimation to cluster the training examples in k disjoint regions and then proceed to expand the tree using the same procedure. We benchmark the performance of the simulated version of our algorithm against the state-of-the-art classical decision tree for regression and binary classification on multiple data sets with numerical features. Further, we showcase that the proposed algorithm exhibits similar performance to the state-of-the-art decision tree while significantly speeding up the periodic tree retraining.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2019

A Novel Hyperparameter-free Approach to Decision Tree Construction that Avoids Overfitting by Design

Decision trees are an extremely popular machine learning technique. Unfo...
research
08/30/2021

Representation of binary classification trees with binary features by quantum circuits

We propose a quantum representation of binary classification trees with ...
research
06/25/2020

The Max-Cut Decision Tree: Improving on the Accuracy and Running Time of Decision Trees

Decision trees are a widely used method for classification, both by them...
research
12/28/2022

The Improvement of Decision Tree Construction Algorithm Based On Quantum Heuristic Algorithms

This work is related to the implementation of a decision tree constructi...
research
05/16/2022

The Influence of Dimensions on the Complexity of Computing Decision Trees

A decision tree recursively splits a feature space ℝ^d and then assigns ...
research
04/23/2022

A Novel Splitting Criterion Inspired by Geometric Mean Metric Learning for Decision Tree

Decision tree (DT) attracts persistent research attention due to its imp...
research
02/27/2017

Memory-Efficient Global Refinement of Decision-Tree Ensembles and its Application to Face Alignment

Ren et al. recently introduced a method for aggregating multiple decisio...

Please sign up or login with your details

Forgot password? Click here to reset