R package SamplingStrata: new developments and extension to Spatial Sampling

by   Marco Ballin, et al.

The R package SamplingStrata was developed in 2011 as an instrument to optimize the design of stratified samples. The optimization is performed by considering the stratification variables available in the sampling frame, and the precision constraints on target estimates of the survey (Ballin Barcaroli, 2014). The genetic algorithm at the basis of the optimization step explores the universe of the possible alternative stratifications determining for each of them the best allocation, that is the one of minumum total size that allows to satisfy the precision constraints: the final optimal solution is the one that ensures the global minimum sample size. One fundamental requirement to make this approach feasible is the possibility to estimate the variability of target variables in generated strata; in general, as target variable values are not available in the frame, but only proxy ones, anticipated variance is calculated by modelling the relations between target and proxy variables. In case of spatial sampling, it is important to consider not only the total model variance, but also the co-variance derived by the spatial auto-correlation. The last release of SamplingStrata enables to consider both components of variance, thus allowing to harness spatial auto-correlation in order to obtain more efficient samples.


page 5

page 13

page 17

page 22


Minimum Sample Size Allocation in Stratified Sampling Under Constraints on Variance and Strata Sample Sizes

We derive optimality conditions for the optimal sample allocation proble...

Variance-Optimal Offline and Streaming Stratified Random Sampling

Stratified random sampling (SRS) is a fundamental sampling technique tha...

Recursive Neyman Algorithm for Optimum Sample Allocation under Box Constraints on Sample Sizes in Strata

The optimal sample allocation in stratified sampling is one of the basic...

csSampling: An R Package for Bayesian Models for Complex Survey Data

We present csSampling, an R package for estimation of Bayesian models fo...

Two-stage Sampling Design and Sample Selection with the R package R2BEAT

R2BEAT (R "to" Bethel Extended Allocation for Two-stage sampling) is an ...

An Embedded Model Estimator for Non-Stationary Random Functions using Multiple Secondary Variables

An algorithm for non-stationary spatial modelling using multiple seconda...

Minimum Variance Embedded Auto-associative Kernel Extreme Learning Machine for One-class Classification

One-class classification (OCC) needs samples from only a single class to...

Please sign up or login with your details

Forgot password? Click here to reset