Multilevel calibration weighting for survey data

02/17/2021
by   Eli Ben-Michael, et al.
0

A pressing challenge in modern survey research is to find calibration weights when covariates are high dimensional and especially when interactions between variables are important. Traditional approaches like raking typically fail to balance higher-order interactions; and post-stratification, which exactly balances all interactions, is only feasible for a small number of variables. In this paper, we propose multilevel calibration weighting, which enforces tight balance constraints for marginal balance and looser constraints for higher-order interactions. This incorporates some of the benefits of post-stratification while retaining the guarantees of raking. We then correct for the bias due to the relaxed constraints via a flexible outcome model; we call this approach Double Regression with Post-stratification (DRP). We characterize the asymptotic properties of these estimators and show that the proposed calibration approach has a dual representation as a multilevel model for survey response. We assess the performance of this method via an extensive simulation study and show how it can reduce bias in a case-study of a large-scale survey of voter intention in the 2016 U.S. presidential election. The approach is available in the multical R package.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

07/25/2017

Bayesian hierarchical weighting adjustment and survey inference

We combine Bayesian prediction and weighted inference as a unified appro...
05/09/2019

Double-calibration estimators accounting for under-coverage and nonresponse in socio-economic surveys

Under-coverage and nonresponse problems are jointly present in most soci...
06/28/2019

Bias-Variance Trade-Off in Hierarchical Probabilistic Models Using Higher-Order Feature Interactions

Hierarchical probabilistic models are able to use a large number of para...
09/17/2021

Cross-Leverage Scores for Selecting Subsets of Explanatory Variables

In a standard regression problem, we have a set of explanatory variables...
12/01/2021

Investigating an Alternative for Estimation from a Nonprobability Sample: Matching plus Calibration

Matching a nonprobability sample to a probability sample is one strategy...
05/20/2020

Dyadic Reciprocity as a Function of Covariates

Reciprocity in dyadic interactions is common and a topic of interest acr...
06/27/2019

A Python Library For Empirical Calibration

Dealing with biased data samples is a common task across many statistica...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.