A path in regression Random Forest looking for spatial dependence: a taxonomy and a systematic review

03/08/2023
by   Luca Patelli, et al.
0

Random Forest (RF) is a well-known data-driven algorithm applied in several fields thanks to its flexibility in modeling the relationship between the response variable and the predictors, also in case of strong non-linearities. In environmental applications, it often occurs that the phenomenon of interest may present spatial and/or temporal dependence that is not taken explicitly into account by RF in its standard version. In this work, we propose a taxonomy to classify strategies according to when (Pre-, In- and/or Post-processing) they try to include the spatial information into regression RF. Moreover, we provide a systematic review and classify the most recent strategies adopted to "adjust" regression RF to spatially dependent data, based on the criteria provided by the Preferred Reporting Items for Systematic reviews and Meta-Analysis (PRISMA). The latter consists of a reproducible methodology for collecting and processing existing literature on a specified topic from different sources. PRISMA starts with a query and ends with a set of scientific documents to review: we performed an online query on the 25^th October 2022 and, in the end, 32 documents were considered for review. The employed methodological strategies and the application fields considered in the 32 scientific documents are described and discussed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2022

Data-driven multinomial random forest

In this paper, we strengthen the previous weak consistency proof method ...
research
04/10/2018

Hyperparameters and Tuning Strategies for Random Forest

The random forest algorithm (RF) has several hyperparameters that have t...
research
06/27/2023

A Meta-analytical Comparison of Naive Bayes and Random Forest for Software Defect Prediction

Is there a statistical difference between Naive Bayes and Random Forest ...
research
07/30/2020

Random Forests for dependent data

Random forest (RF) is one of the most popular methods for estimating reg...
research
08/31/2020

Random Forest (RF) Kernel for Regression, Classification and Survival

Breiman's random forest (RF) can be interpreted as an implicit kernel ge...
research
05/30/2023

Sensitivity Analysis of RF+clust for Leave-one-problem-out Performance Prediction

Leave-one-problem-out (LOPO) performance prediction requires machine lea...

Please sign up or login with your details

Forgot password? Click here to reset