Big Data meets Causal Survey Research: Understanding Nonresponse in the Recruitment of a Mixed-mode Online Panel

02/17/2021
by   Barbara Felderer, et al.
0

Survey scientists increasingly face the problem of high-dimensionality in their research as digitization makes it much easier to construct high-dimensional (or "big") data sets through tools such as online surveys and mobile applications. Machine learning methods are able to handle such data, and they have been successfully applied to solve predictive problems. However, in many situations, survey statisticians want to learn about causal relationships to draw conclusions and be able to transfer the findings of one survey to another. Standard machine learning methods provide biased estimates of such relationships. We introduce into survey statistics the double machine learning approach, which gives approximately unbiased estimators of causal parameters, and show how it can be used to analyze survey nonresponse in a high-dimensional panel setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/28/2023

Integrating Big Data and Survey Data for Efficient Estimation of the Median

An ever-increasing deluge of big data is becoming available to national ...
research
02/24/2016

A Survey on Domain-Specific Languages for Machine Learning in Big Data

The amount of data generated in the modern society is increasing rapidly...
research
04/15/2017

Big Universe, Big Data: Machine Learning and Image Analysis for Astronomy

Astrophysics and cosmology are rich with data. The advent of wide-area d...
research
10/07/2022

Geomagnetic Survey Interpolation with the Machine Learning Approach

This paper portrays the method of UAV magnetometry survey data interpola...
research
10/04/2021

Unraveling the graph structure of tabular datasets through Bayesian and spectral analysis

In the big-data age tabular datasets are being generated and analyzed ev...
research
06/10/2021

Are We There Yet? Big Data Significantly Overestimates COVID-19 Vaccination in the US

Public health efforts to control the COVID-19 pandemic rely on accurate ...
research
04/21/2023

A Common Misassumption in Online Experiments with Machine Learning Models

Online experiments such as Randomised Controlled Trials (RCTs) or A/B-te...

Please sign up or login with your details

Forgot password? Click here to reset