CO2 Forest: Improved Random Forest by Continuous Optimization of Oblique Splits

06/19/2015
by   Mohammad Norouzi, et al.
0

We propose a novel algorithm for optimizing multivariate linear threshold functions as split functions of decision trees to create improved Random Forest classifiers. Standard tree induction methods resort to sampling and exhaustive search to find good univariate split functions. In contrast, our method computes a linear combination of the features at each node, and optimizes the parameters of the linear combination (oblique) split functions by adopting a variant of latent variable SVM formulation. We develop a convex-concave upper bound on the classification loss for a one-level decision tree, and optimize the bound by stochastic gradient descent at each internal node of the tree. Forests of up to 1000 Continuously Optimized Oblique (CO2) decision trees are created, which significantly outperform Random Forest with univariate splits and previous techniques for constructing oblique trees. Experimental results are reported on multi-class classification benchmarks and on Labeled Faces in the Wild (LFW) dataset.

READ FULL TEXT
research
11/12/2015

Efficient non-greedy optimization of decision trees

Decision trees and randomized forests are widely used in computer vision...
research
12/12/2019

Adaptive Reticulum

Neural Networks and Random Forests: two popular techniques for supervise...
research
04/13/2023

Heterogeneous Oblique Double Random Forest

The decision tree ensembles use a single data feature at each node for s...
research
12/14/2022

MABSplit: Faster Forest Training Using Multi-Armed Bandits

Random forests are some of the most widely used machine learning models ...
research
09/30/2020

Uncovering Feature Interdependencies in Complex Systems with Non-Greedy Random Forests

A "non-greedy" variation of the random forest algorithm is presented to ...
research
11/19/2021

MURAL: An Unsupervised Random Forest-Based Embedding for Electronic Health Record Data

A major challenge in embedding or visualizing clinical patient data is t...
research
12/19/2019

Extreme Learning Tree

The paper proposes a new variant of a decision tree, called an Extreme L...

Please sign up or login with your details

Forgot password? Click here to reset