Multilevel calibration weighting for survey data

by   Eli Ben-Michael, et al.

A pressing challenge in modern survey research is to find calibration weights when covariates are high dimensional and especially when interactions between variables are important. Traditional approaches like raking typically fail to balance higher-order interactions; and post-stratification, which exactly balances all interactions, is only feasible for a small number of variables. In this paper, we propose multilevel calibration weighting, which enforces tight balance constraints for marginal balance and looser constraints for higher-order interactions. This incorporates some of the benefits of post-stratification while retaining the guarantees of raking. We then correct for the bias due to the relaxed constraints via a flexible outcome model; we call this approach Double Regression with Post-stratification (DRP). We characterize the asymptotic properties of these estimators and show that the proposed calibration approach has a dual representation as a multilevel model for survey response. We assess the performance of this method via an extensive simulation study and show how it can reduce bias in a case-study of a large-scale survey of voter intention in the 2016 U.S. presidential election. The approach is available in the multical R package.



There are no comments yet.


page 1

page 2

page 3

page 4


Bayesian hierarchical weighting adjustment and survey inference

We combine Bayesian prediction and weighted inference as a unified appro...

Double-calibration estimators accounting for under-coverage and nonresponse in socio-economic surveys

Under-coverage and nonresponse problems are jointly present in most soci...

Bias-Variance Trade-Off in Hierarchical Probabilistic Models Using Higher-Order Feature Interactions

Hierarchical probabilistic models are able to use a large number of para...

Cross-Leverage Scores for Selecting Subsets of Explanatory Variables

In a standard regression problem, we have a set of explanatory variables...

Investigating an Alternative for Estimation from a Nonprobability Sample: Matching plus Calibration

Matching a nonprobability sample to a probability sample is one strategy...

Dyadic Reciprocity as a Function of Covariates

Reciprocity in dyadic interactions is common and a topic of interest acr...

A Python Library For Empirical Calibration

Dealing with biased data samples is a common task across many statistica...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.