Flexible domain prediction using mixed effects random forests

01/26/2022
by   Patrick Krennmair, et al.
0

This paper promotes the use of random forests as versatile tools for estimating spatially disaggregated indicators in the presence of small area-specific sample sizes. Small area estimators are predominantly conceptualized within the regression-setting and rely on linear mixed models to account for the hierarchical structure of the survey data. In contrast, machine learning methods offer non-linear and non-parametric alternatives, combining excellent predictive performance and a reduced risk of model-misspecification. Mixed effects random forests combine advantages of regression forests with the ability to model hierarchical dependencies. This paper provides a coherent framework based on mixed effects random forests for estimating small area averages and proposes a non-parametric bootstrap estimator for assessing the uncertainty of the estimates. We illustrate advantages of our proposed methodology using Mexican income-data from the state Nuevo León. Finally, the methodology is evaluated in model-based and design-based simulations comparing the proposed methodology to traditional regression-based approaches for estimating small area averages.

READ FULL TEXT

page 12

page 14

page 17

page 19

page 24

page 25

page 27

research
04/22/2022

Analysing Opportunity Cost of Care Work using Mixed Effects Random Forests under Aggregated Census Data

Reliable estimators of the spatial distribution of socio-economic indica...
research
09/24/2013

Random Forests on Distance Matrices for Imaging Genetics Studies

We propose a non-parametric regression methodology, Random Forests on Di...
research
02/08/2019

Censored Quantile Regression Forests

Random forests are powerful non-parametric regression method but are sev...
research
06/11/2015

Mondrian Forests for Large-Scale Regression when Uncertainty Matters

Many real-world regression problems demand a measure of the uncertainty ...
research
01/24/2023

Mixed Effects Random Forests for Personalised Predictions of Clinical Depression Severity

This work demonstrates how mixed effects random forests enable accurate ...
research
07/25/2018

Stripe-Based Fragility Analysis of Concrete Bridge Classes Using Machine Learning Techniques

A framework for the generation of bridge-specific fragility utilizing th...
research
01/25/2022

A Nested Error Regression Model with High Dimensional Parameter for Small Area Estimation

In this paper we propose a flexible nested error regression small area m...

Please sign up or login with your details

Forgot password? Click here to reset