Model-assisted estimation through random forests in finite population sampling

02/22/2020
by   Mehdi Dagdoug, et al.
0

Surveys are used to collect data on a subset of a finite population. Most often, the interest lies in estimating finite population parameters such as population totals and means. In some surveys, auxiliary information is available at the population level. This information may be incorporated in the estimation procedures to increase their precision. Model-assisted procedures may be based on parametric or nonparametric models. In this paper, we propose a new class of model-assisted procedures based on random forests based on partitions built at the population level as well as at the sample level. We derive associated variance estimators and we establish the theoretical properties of the proposed procedures. A model-calibration procedure that has the ability to handle multiple survey variables is discussed. Finally, the results of a simulation study suggest that the proposed point and estimation procedures perform well in term of bias, efficiency and coverage in a wide variety of settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2020

Model-assisted estimation in high-dimensional settings for survey data

Model-assisted estimators have attracted a lot of attention in the last ...
research
08/09/2022

Model-Assisted Estimators under Nonresponse in Sample Surveys

In the presence of auxiliary information, model-assisted estimators use ...
research
05/09/2019

Double-calibration estimators accounting for under-coverage and nonresponse in socio-economic surveys

Under-coverage and nonresponse problems are jointly present in most soci...
research
09/08/2020

Data-assisted combustion simulations with dynamic submodel assignment using random forests

In this investigation, we outline a data-assisted approach that employs ...
research
12/15/2017

Automated Selection of Post-Strata using a Model-Assisted Regression Tree Estimator

Auxiliary information can increase the efficiency of survey estimators t...
research
10/04/2020

Efficient multiply robust imputation in the presence of influential units in surveys

Item nonresponse is a common issue in surveys. Because unadjusted estima...
research
12/06/2022

Efficient Stratification Method for Socioeconomic Survey in Remote Areas

The problems that exist in implementing a sampling design for socio-econ...

Please sign up or login with your details

Forgot password? Click here to reset