Optimal Weighted Random Forests

05/17/2023
by   Xinyu Chen, et al.
0

The random forest (RF) algorithm has become a very popular prediction method for its great flexibility and promising accuracy. In RF, it is conventional to put equal weights on all the base learners (trees) to aggregate their predictions. However, the predictive performances of different trees within the forest can be very different due to the randomization of the embedded bootstrap sampling and feature selection. In this paper, we focus on RF for regression and propose two optimal weighting algorithms, namely the 1 Step Optimal Weighted RF (1step-WRF_opt) and 2 Steps Optimal Weighted RF (2steps-WRF_opt), that combine the base learners through the weights determined by weight choice criteria. Under some regularity conditions, we show that these algorithms are asymptotically optimal in the sense that the resulting squared loss and risk are asymptotically identical to those of the infeasible but best possible model averaging estimator. Numerical studies conducted on real-world data sets indicate that these algorithms outperform the equal-weight forest and two other weighted RFs proposed in existing literature in most cases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2020

Improved Weighted Random Forest for Classification Problems

Several studies have shown that combining machine learning models in an ...
research
05/30/2016

Forest Floor Visualizations of Random Forests

We propose a novel methodology, forest floor, to visualize and interpret...
research
09/16/2021

WildWood: a new Random Forest algorithm

We introduce WildWood (WW), a new ensemble algorithm for supervised lear...
research
08/17/2020

To Bag is to Prune

It is notoriously hard to build a bad Random Forest (RF). Concurrently, ...
research
06/10/2015

Randomer Forests

Random forests (RF) is a popular general purpose classifier that has bee...
research
05/30/2023

Sensitivity Analysis of RF+clust for Leave-one-problem-out Performance Prediction

Leave-one-problem-out (LOPO) performance prediction requires machine lea...

Please sign up or login with your details

Forgot password? Click here to reset