Adjustment for Biased Sampling Using NHANES Derived Propensity Weights

04/21/2021
by   Olivia M. Bernstein, et al.
0

The Consent-to-Contact (C2C) registry at the University of California, Irvine collects data from community participants to aid in the recruitment to clinical research studies. Self-selection into the C2C likely leads to bias due in part to enrollees having more years of education relative to the US general population. Salazar et al. (2020) recently used the C2C to examine associations of race/ethnicity with participant willingness to be contacted about research studies. To address questions about generalizability of estimated associations we estimate propensity for self-selection into the convenience sample weights using data from the National Health and Nutrition Examination Survey (NHANES). We create a combined dataset of C2C and NHANES subjects and compare different approaches (logistic regression, covariate balancing propensity score, entropy balancing, and random forest) for estimating the probability of membership in C2C relative to NHANES. We propose methods to estimate the variance of parameter estimates that account for uncertainty that arises from estimating propensity weights. Simulation studies explore the impact of propensity weight estimation on uncertainty. We demonstrate the approach by repeating the analysis by Salazar et al. with the deduced propensity weights for the C2C subjects and contrast the results of the two analyses. This method can be implemented using our estweight package in R available on GitHub.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/01/2019

A Framework for Covariate Balance using Bregman Distances

A common goal in observational research is to estimate marginal causal e...
research
10/19/2021

A Bayesian Approach for the Variance of Fine Stratification

Fine stratification is a popular design as it permits the stratification...
research
08/01/2022

Calculating incidence of Influenza-like and COVID-like symptoms from Flutracking participatory survey data

This article describes a new method for estimating weekly incidence (new...
research
01/19/2021

Robust Bayesian Inference for Big Data: Combining Sensor-based Records with Traditional Survey Data

Big Data often presents as massive non-probability samples. Not only is ...
research
02/11/2018

Uncharted Forest a Technique for Exploratory Data Analysis of Provenance Studies

Exploratory data analysis is a crucial task for developing effective cla...
research
07/08/2021

Balancing Higher Moments Matters for Causal Estimation: Further Context for the Results of Setodji et al. (2017)

We expand upon the simulation study of Setodji et al. (2017) which compa...
research
07/12/2023

balance – a Python package for balancing biased data samples

Surveys are an important research tool, providing unique measurements on...

Please sign up or login with your details

Forgot password? Click here to reset