Multinomial Random Forests: Fill the Gap between Theoretical Consistency and Empirical Soundness

03/10/2019
by   Yiming Li, et al.
0

Random forests (RF) are one of the most widely used ensemble learning methods in classification and regression tasks. Despite its impressive performance, its theoretical consistency, which would ensure that its result converges to the optimum as the sample size increases, has been left far behind. Several consistent random forest variants have been proposed, yet all with relatively poor performance compared to the original random forests. In this paper, a novel RF framework named multinomial random forests (MRF) is proposed. In the MRF, an impurity-based multinomial distribution is constructed as the basis for the selection of a splitting point. This ensures that a certain degree of randomness is achieved while the overall quality of the trees is not much different from the original random forests. We prove the consistency of the MRF and demonstrate with multiple datasets that it performs similarly as the original random forests and better than existent consistent random forest variants for both classification and regression tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/23/2022

Consistency of The Oblique Decision Tree and Its Random Forest

The classification and regression tree (CART) and Random Forest (RF) are...
research
10/04/2013

Narrowing the Gap: Random Forests In Theory and In Practice

Despite widespread interest and practical use, the theoretical propertie...
research
02/08/2022

Is interpolation benign for random forests?

Statistical wisdom suggests that very complex models, interpolating trai...
research
04/23/2019

Regression-Enhanced Random Forests

Random forest (RF) methodology is one of the most popular machine learni...
research
12/14/2022

MABSplit: Faster Forest Training Using Multi-Armed Bandits

Random forests are some of the most widely used machine learning models ...
research
06/10/2015

Randomer Forests

Random forests (RF) is a popular general purpose classifier that has bee...
research
06/29/2023

Medoid splits for efficient random forests in metric spaces

This paper revisits an adaptation of the random forest algorithm for Fré...

Please sign up or login with your details

Forgot password? Click here to reset