Regression-Enhanced Random Forests

04/23/2019
by   Haozhe Zhang, et al.
38

Random forest (RF) methodology is one of the most popular machine learning techniques for prediction problems. In this article, we discuss some cases where random forests may suffer and propose a novel generalized RF method, namely regression-enhanced random forests (RERFs), that can improve on RFs by borrowing the strength of penalized parametric regression. The algorithm for constructing RERFs and selecting its tuning parameters is described. Both simulation study and real data examples show that RERFs have better predictive performance than RFs in important situations often encountered in practice. Moreover, RERFs may incorporate known relationships between the response and the predictors, and may give reliable predictions in extrapolation problems where predictions are required at points out of the domain of the training dataset. Strategies analogous to those described here can be used to improve other machine learning methods via combination with penalized parametric regression techniques.

READ FULL TEXT

page 3

page 7

page 8

page 10

research
03/10/2019

Multinomial Random Forests: Fill the Gap between Theoretical Consistency and Empirical Soundness

Random forests (RF) are one of the most widely used ensemble learning me...
research
11/23/2022

Consistency of The Oblique Decision Tree and Its Random Forest

The classification and regression tree (CART) and Random Forest (RF) are...
research
01/28/2015

ggRandomForests: Visually Exploring a Random Forest for Regression

Random Forests [Breiman:2001] (RF) are a fully non-parametric statistica...
research
07/05/2022

An Approximation Method for Fitted Random Forests

Random Forests (RF) is a popular machine learning method for classificat...
research
01/24/2023

Mixed Effects Random Forests for Personalised Predictions of Clinical Depression Severity

This work demonstrates how mixed effects random forests enable accurate ...
research
08/31/2021

When are Deep Networks really better than Random Forests at small sample sizes?

Random forests (RF) and deep networks (DN) are two of the most popular m...
research
12/26/2018

Comparing Spatial Regression to Random Forests for Large Environmental Data Sets

Environmental data may be "large" due to number of records, number of co...

Please sign up or login with your details

Forgot password? Click here to reset