Sampling techniques for big data analysis in finite population inference

01/29/2018
by   Jae Kwang Kim, et al.
0

In analyzing big data for finite population inference, it is critical to adjust for the selection bias in the big data. In this paper, we propose two methods of reducing the selection bias associated with the big data sample. The first method uses a version of inverse sampling by incorporating auxiliary information from external sources, and the second one borrows the idea of data integration by combining the big data sample with an independent probability sample. Two simulation studies show that the proposed methods are unbiased and have better coverage rates than their alternatives. In addition, the proposed methods are easy to implement in practice.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/26/2020

Data Integration by combining big data and survey sample data for finite population inference

The statistical challenges in using big data for making valid statistica...
research
02/11/2020

Big Data and model-based survey sampling

Big Data are huge amounts of digital information that are automatically ...
research
08/12/2020

Sampling Based Approximate Skyline Calculation on Big Data

The existing algorithms for processing skyline queries cannot adapt to b...
research
10/13/2022

We need to talk about nonprobability samples

It is well known that, in most circumstances, probability sampling is th...
research
04/05/2018

Robust Fusion Methods for Structured Big Data

We address one of the important problems in Big Data, namely how to comb...
research
10/01/2018

On valid descriptive inference from non-probability sample

We examine the conditions under which descriptive inference can be based...
research
01/19/2021

Robust Bayesian Inference for Big Data: Combining Sensor-based Records with Traditional Survey Data

Big Data often presents as massive non-probability samples. Not only is ...

Please sign up or login with your details

Forgot password? Click here to reset