Integrating Big Data and Survey Data for Efficient Estimation of the Median

06/28/2023
by   Ryan Covey, et al.
0

An ever-increasing deluge of big data is becoming available to national statistical offices globally, but it is well documented that statistics produced by big data alone often suffer from selection bias and are not usually representative of the population at large. In this paper, we construct a new design-based estimator of the median by integrating big data and survey data. Our estimator is asymptotically unbiased and has a smaller variance than a median estimator produced using survey data alone.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/22/2023

Survey Design and Estimating Equations when Combining Big Data with Probability Samples

The use of big data in official statistics and the applied sciences is a...
research
12/19/2022

A Bayesian algorithm for sample selection bias correction

In this paper we present a technique to couple non-traditional data with...
research
02/17/2021

Big Data meets Causal Survey Research: Understanding Nonresponse in the Recruitment of a Mixed-mode Online Panel

Survey scientists increasingly face the problem of high-dimensionality i...
research
06/06/2023

A Calibrated Data-Driven Approach for Small Area Estimation using Big Data

Where the response variable in a big data set is consistent with the var...
research
08/09/2017

Using Deep Neural Networks to Automate Large Scale Statistical Analysis for Big Data Applications

Statistical analysis (SA) is a complex process to deduce population prop...
research
08/09/2018

A Panel Quantile Approach to Attrition Bias in Big Data: Evidence from a Randomized Experiment

This paper introduces a quantile regression estimator for panel data mod...
research
02/11/2020

Big Data and model-based survey sampling

Big Data are huge amounts of digital information that are automatically ...

Please sign up or login with your details

Forgot password? Click here to reset