A Bayesian algorithm for sample selection bias correction

12/19/2022
by   Valerio Astuti, et al.
0

In this paper we present a technique to couple non-traditional data with statistics based on survey data, in order to partially correct for the bias produced by non-random sample selections. All major social media platforms represent huge samples of the general population, generated by a self-selection process. This implies that they are not representative of the larger public, and there are problems in extrapolating conclusions drawn from these samples to the whole population. We present an algorithm to integrate these massive data with ones coming from traditional sources, with the properties of being less extensive but more reliable. This integration allows to exploit the best of both worlds and reach the detail of typical "big data" sources and the representativeness of a carefully designed sample survey.

READ FULL TEXT
research
06/28/2023

Integrating Big Data and Survey Data for Efficient Estimation of the Median

An ever-increasing deluge of big data is becoming available to national ...
research
09/12/2023

Artificially Intelligent Opinion Polling

We seek to democratise public-opinion research by providing practitioner...
research
05/18/2021

An Efficient Approach for Statistical Matching of Survey Data Trough Calibration, Optimal Transport and Balanced Sampling

Statistical matching aims to integrate two statistical sources. These so...
research
08/19/2019

Improving multilevel regression and poststratification with structured priors

A central theme in the field of survey statistics is estimating populati...
research
02/23/2020

Sample Debiasing in the Themis Open World Database System (Extended Version)

Open world database management systems assume tuples not in the database...
research
10/19/2022

Combining Data from Surveys and Related Sources

To improve the precision of inferences and reduce costs there is conside...

Please sign up or login with your details

Forgot password? Click here to reset