A Unified Framework for Random Forest Prediction Error Estimation

12/16/2019
by   Benjamin Lu, et al.
0

We introduce a unified framework for random forest prediction error estimation based on a novel estimator of the conditional prediction error distribution function. Our framework enables immediate estimation of key parameters often of interest, including conditional mean squared prediction errors, conditional biases, and conditional quantiles, by a straightforward plug-in routine. Our approach is particularly well-adapted for prediction interval estimation, which has received less attention in the random forest literature despite its practical utility; we show via simulations that our proposed prediction intervals are competitive with, and in some settings outperform, existing methods. To establish theoretical grounding for our framework, we prove pointwise uniform consistency of a more stringent version of our estimator of the conditional prediction error distribution. In addition to providing a suite of measures of prediction uncertainty, our general framework is applicable to many variants of the random forest algorithm. The estimators introduced here are implemented in the R package forestError.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2020

Random forest estimation of conditional distribution functions and conditional quantiles

We propose a theoretical study of two realistic estimators of conditiona...
research
12/15/2018

Consistent Estimation of Residual Variance with Random Forest Out-Of-Bag Errors

The issue of estimating residual variance in regression models has exper...
research
02/11/2023

Confidence and Uncertainty Assessment for Distributional Random Forests

The Distributional Random Forest (DRF) is a recently introduced Random F...
research
01/22/2018

Optimizing Prediction Intervals by Tuning Random Forest via Meta-Validation

Recent studies have shown that tuning prediction models increases predic...
research
02/08/2022

A Unified Prediction Framework for Signal Maps

Signal maps are essential for the planning and operation of cellular net...
research
12/19/2020

Achieving Reliable Causal Inference with Data-Mined Variables: A Random Forest Approach to the Measurement Error Problem

Combining machine learning with econometric analysis is becoming increas...
research
04/16/2018

conformalClassification: A Conformal Prediction R Package for Classification

The conformalClassification package implements Transductive Conformal Pr...

Please sign up or login with your details

Forgot password? Click here to reset