Optimal sampling for design-based estimators of regression models

06/16/2021
by   Tong Chen, et al.
0

Two-phase designs measure variables of interest on a subcohort where the outcome and covariates are readily available or cheap to collect on all individuals in the cohort. Given limited resource availability, it is of interest to find an optimal design that includes more informative individuals in the final sample. We explore the optimal designs and efficiencies for analysis by design-based estimators. Generalized raking is an efficient design-based estimator that improves on the inverse-probability weighted (IPW) estimator by adjusting weights based on the auxiliary information. We derive a closed-form solution of the optimal design for estimating regression coefficients from generalized raking estimators. We compare it with the optimal design for analysis via the IPW estimator and other two-phase designs in measurement-error settings. We consider general two-phase designs where the outcome variable and variables of interest can be continuous or discrete. Our results show that the optimal designs for analysis by the two design-based estimators can be very different. The optimal design for IPW estimation is optimal for analysis via the IPW estimator and typically gives near-optimal efficiency for generalized raking, though we show there is potential improvement in some settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2020

Optimal multi-wave sampling for regression modelling in two-phase designs

Two-phase designs involve measuring extra variables on a subset of the c...
research
11/11/2022

Optimal Designs of Two-Phase Case-Control Studies for General Predictor Effects

Under two-phase designs, the outcome and several covariates and confound...
research
03/21/2022

Choosing good subsamples for regression modelling

A common problem in health research is that we have a large database wit...
research
05/24/2023

On estimators of the mean of infinite dimensional data in finite populations

The Horvitz-Thompson (HT), the Rao-Hartley-Cochran (RHC) and the general...
research
08/15/2019

Isotonic regression discontinuity designs

In isotonic regression discontinuity designs, the average outcome and th...
research
12/13/2018

Optimal designs for series estimation in nonparametric regression with correlated data

In this paper we investigate the problem of designing experiments for se...
research
01/14/2021

Optimal designs for comparing regression curves – dependence within and between groups

We consider the problem of designing experiments for the comparison of t...

Please sign up or login with your details

Forgot password? Click here to reset