Improved Weighted Random Forest for Classification Problems

09/01/2020
by   Mohsen Shahhosseini, et al.
0

Several studies have shown that combining machine learning models in an appropriate way will introduce improvements in the individual predictions made by the base models. The key to make well-performing ensemble model is in the diversity of the base models. Of the most common solutions for introducing diversity into the decision trees are bagging and random forest. Bagging enhances the diversity by sampling with replacement and generating many training data sets, while random forest adds selecting a random number of features as well. This has made the random forest a winning candidate for many machine learning applications. However, assuming equal weights for all base decision trees does not seem reasonable as the randomization of sampling and input feature selection may lead to different levels of decision-making abilities across base decision trees. Therefore, we propose several algorithms that intend to modify the weighting strategy of regular random forest and consequently make better predictions. The designed weighting frameworks include optimal weighted random forest based on ac-curacy, optimal weighted random forest based on the area under the curve (AUC), performance-based weighted random forest, and several stacking-based weighted random forest models. The numerical results show that the proposed models are able to introduce significant improvements compared to regular random forest.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/03/2021

Oblique and rotation double random forest

An ensemble of decision trees is known as Random Forest. As suggested by...
research
05/17/2023

Optimal Weighted Random Forests

The random forest (RF) algorithm has become a very popular prediction me...
research
03/16/2020

A Numerical Transform of Random Forest Regressors corrects Systematically-Biased Predictions

Over the past decade, random forest models have become widely used as a ...
research
01/14/2020

Perfecting the Crime Machine

This study explores using different machine learning techniques and work...
research
02/14/2016

Random Forest Based Approach for Concept Drift Handling

Concept drift has potential in smart grid analysis because the socio-eco...
research
12/19/2020

Achieving Reliable Causal Inference with Data-Mined Variables: A Random Forest Approach to the Measurement Error Problem

Combining machine learning with econometric analysis is becoming increas...
research
05/17/2021

Cross-Cluster Weighted Forests

Adapting machine learning algorithms to better handle the presence of na...

Please sign up or login with your details

Forgot password? Click here to reset