Best-scored Random Forest Classification

05/27/2019
by   Hanyuan Hang, et al.
0

We propose an algorithm named best-scored random forest for binary classification problems. The terminology "best-scored" means to select the one with the best empirical performance out of a certain number of purely random tree candidates as each single tree in the forest. In this way, the resulting forest can be more accurate than the original purely random forest. From the theoretical perspective, within the framework of regularized empirical risk minimization penalized on the number of splits, we establish almost optimal convergence rates for the proposed best-scored random trees under certain conditions which can be extended to the best-scored random forest. In addition, we present a counterexample to illustrate that in order to ensure the consistency of the forest, every dimension must have the chance to be split. In the numerical experiments, for the sake of efficiency, we employ an adaptive random splitting criterion. Comparative experiments with other state-of-art classification methods demonstrate the accuracy of our best-scored random forest.

READ FULL TEXT
research
06/24/2019

Density-based Clustering with Best-scored Random Forest

Single-level density-based approach has long been widely acknowledged to...
research
05/09/2019

Best-scored Random Forest Density Estimation

This paper presents a brand new nonparametric density estimation strateg...
research
05/09/2019

Two-stage Best-scored Random Forest for Large-scale Regression

We propose a novel method designed for large-scale regression problems, ...
research
05/07/2018

Complete Analysis of a Random Forest Model

Random forests have become an important tool for improving accuracy in r...
research
11/09/2015

Spatially Coherent Random Forests

Spatially Coherent Random Forest (SCRF) extends Random Forest to create ...
research
12/14/2022

Simplification of Forest Classifiers and Regressors

We study the problem of sharing as many branching conditions of a given ...
research
12/16/2014

Random Forests Can Hash

Hash codes are a very efficient data representation needed to be able to...

Please sign up or login with your details

Forgot password? Click here to reset