HHCART: An Oblique Decision Tree

04/14/2015
by   D. C. Wickramarachchi, et al.
0

Decision trees are a popular technique in statistical data classification. They recursively partition the feature space into disjoint sub-regions until each sub-region becomes homogeneous with respect to a particular class. The basic Classification and Regression Tree (CART) algorithm partitions the feature space using axis parallel splits. When the true decision boundaries are not aligned with the feature axes, this approach can produce a complicated boundary structure. Oblique decision trees use oblique decision boundaries to potentially simplify the boundary structure. The major limitation of this approach is that the tree induction algorithm is computationally expensive. In this article we present a new decision tree algorithm, called HHCART. The method utilizes a series of Householder matrices to reflect the training data at each node during the tree construction. Each reflection is based on the directions of the eigenvectors from each classes' covariance matrix. Considering axis parallel splits in the reflected training data provides an efficient way of finding oblique splits in the unreflected training data. Experimental results show that the accuracy and size of the HHCART trees are comparable with some benchmark methods in the literature. The appealing feature of HHCART is that it can handle both qualitative and quantitative features in the same oblique split.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2022

The Influence of Dimensions on the Complexity of Computing Decision Trees

A decision tree recursively splits a feature space ℝ^d and then assigns ...
research
10/07/2021

Coresets for Decision Trees of Signals

A k-decision tree t (or k-tree) is a recursive partition of a matrix (2D...
research
04/26/2020

Classification Trees for Imbalanced and Sparse Data: Surface-to-Volume Regularization

Classification algorithms face difficulties when one or more classes hav...
research
04/10/2017

A Decision Tree Based Approach Towards Adaptive Profiling of Distributed Applications

The adoption of the distributed paradigm has allowed applications to inc...
research
04/26/2018

Handling Missing Values using Decision Trees with Branch-Exclusive Splits

In this article we propose a new decision tree construction algorithm. T...
research
10/26/2017

Big Data Classification Using Augmented Decision Trees

We present an algorithm for classification tasks on big data. Experiment...

Please sign up or login with your details

Forgot password? Click here to reset