Treeging

10/03/2021
by   Gregory L. Watson, et al.
0

Treeging combines the flexible mean structure of regression trees with the covariance-based prediction strategy of kriging into the base learner of an ensemble prediction algorithm. In so doing, it combines the strengths of the two primary types of spatial and space-time prediction models: (1) models with flexible mean structures (often machine learning algorithms) that assume independently distributed data, and (2) kriging or Gaussian Process (GP) prediction models with rich covariance structures but simple mean structures. We investigate the predictive accuracy of treeging across a thorough and widely varied battery of spatial and space-time simulation scenarios, comparing it to ordinary kriging, random forest and ensembles of ordinary kriging base learners. Treeging performs well across the board, whereas kriging suffers when dependence is weak or in the presence of spurious covariates, and random forest suffers when the covariates are less informative. Treeging also outperforms these competitors in predicting atmospheric pollutants (ozone and PM_2.5) in several case studies. We examine sensitivity to tuning parameters (number of base learners and training data sampling proportion), finding they follow the familiar intuition of their random forest counterparts. We include a discussion of scaleability, noting that any covariance approximation techniques that expedite kriging (GP) may be similarly applied to expedite treeging.

READ FULL TEXT

page 9

page 13

research
08/23/2022

pystacked: Stacking generalization and machine learning in Stata

pystacked implements stacked generalization (Wolpert, 1992) for regressi...
research
12/15/2018

Consistent Estimation of Residual Variance with Random Forest Out-Of-Bag Errors

The issue of estimating residual variance in regression models has exper...
research
02/14/2016

Random Forest Based Approach for Concept Drift Handling

Concept drift has potential in smart grid analysis because the socio-eco...
research
02/17/2021

BEDS: Bagging ensemble deep segmentation for nucleus segmentation with testing stage stain augmentation

Reducing outcome variance is an essential task in deep learning based me...
research
03/19/2019

Random Pairwise Shapelets Forest

Shapelet is a discriminative subsequence of time series. An advanced sha...
research
01/31/2018

The Impact of Automated Parameter Optimization on Defect Prediction Models

Defect prediction models---classifiers that identify defect-prone softwa...

Please sign up or login with your details

Forgot password? Click here to reset