Optimal Sampling Design Under Logistical Constraints with Mixed Integer Programming

02/11/2023
by   Connie Okasaki, et al.
0

The goal of survey design is often to minimize the errors associated with inference: the total of bias and variance. Random surveys are common because they allow the use of theoretically unbiased estimators. In practice however, such design-based approaches are often unable to account for logistical or budgetary constraints. Thus, they may result in samples that are logistically inefficient, or infeasible to implement. Various balancing and optimal sampling techniques have been proposed to improve the statistical efficiency of such designs, but few models have attempted to explicitly incorporate logistical and financial constraints. We introduce a mixed integer linear program (MILP) for optimal sampling design, capable of capturing a variety of constraints and a wide class of Bayesian regression models. We demonstrate the use of our model on three spatial sampling problems of increasing complexity, including the real logistics of the US Forest Service Forest Inventory and Analysis survey of Tanana, Alaska. Our methodological contribution to survey design is significant because the proposed modeling framework makes it possible to generate high-quality sampling designs and inferences while satisfying practical constraints defined by the user. The technical novelty of the method is the explicit integration of Bayesian statistical models in combinatorial optimization. This integration might allow a paradigm shift in spatial sampling under constrained budgets or logistics.

READ FULL TEXT
research
03/08/2023

DisjunctiveProgramming.jl: Generalized Disjunctive Programming Models and Algorithms for JuMP

We present a Julia package, DisjunctiveProgramming.jl, that extends the ...
research
09/27/2021

Mixed Integer Neural Inverse Design

In computational design and fabrication, neural networks are becoming im...
research
04/16/2019

Bayesian Mixed Effects Model Estimation under Informative Sampling

When random effects are correlated with the response variable of interes...
research
01/20/2017

Bayesian Network Learning via Topological Order

We propose a mixed integer programming (MIP) model and iterative algorit...
research
03/18/2021

Optimal soil sampling design based on the maxvol algorithm

Spatial soil sampling is an integral part of a soil survey aimed at crea...
research
12/10/2020

PoolTestR: An R package for estimating prevalence and regression modelling with pooled samples

Pooled testing (also known as group testing), where diagnostic tests are...
research
10/19/2018

Remote sensing to reduce the effects of spatial autocorrelation on design-based inference for forest inventory using systematic samples

Systematic sampling is often used to select plot locations for forest in...

Please sign up or login with your details

Forgot password? Click here to reset