A Longitudinal Framework for Predicting Nonresponse in Panel Surveys

09/29/2019
by   Christoph Kern, et al.
0

Nonresponse in panel studies can lead to a substantial loss in data quality due to its potential to introduce bias and distort survey estimates. Recent work investigates the usage of machine learning to predict nonresponse in advance, such that predicted nonresponse propensities can be used to inform the data collection process. However, predicting nonresponse in panel studies requires accounting for the longitudinal data structure in terms of model building, tuning, and evaluation. This study proposes a longitudinal framework for predicting nonresponse with machine learning and multiple panel waves and illustrates its application. With respect to model building, this approach utilizes information from multiple waves by introducing features that aggregate previous (non)response patterns. Concerning model tuning and evaluation, temporal cross-validation is employed by iterating through pairs of panel waves such that the training and test sets move in time. Implementing this approach with data from a German probability-based mixed-mode panel shows that aggregating information over multiple panel waves can be used to build prediction models with competitive and robust performance over all test waves.

READ FULL TEXT

page 23

page 24

research
03/13/2020

Random Forest Classifier Based Prediction of Rogue waves on Deep Oceans

In this paper, we present a novel approach for the prediction of rogue w...
research
04/14/2022

Nonresponse Bias Analysis in Longitudinal Educational Assessment Studies

Longitudinal studies are subject to nonresponse when individuals fail to...
research
07/31/2020

Predicting heave and surge motions of a semi-submersible with neural networks

Real-time motion prediction of a vessel or a floating platform can help ...
research
11/05/2020

Predicting respondent difficulty in web surveys: A machine-learning approach based on mouse movement features

A central goal of survey research is to collect robust and reliable data...
research
07/07/2020

Analytics of Longitudinal System Monitoring Data for Performance Prediction

In recent years, several HPC facilities have started continuous monitori...
research
10/03/2019

Prediction of GNSS Phase Scintillations: A Machine Learning Approach

A Global Navigation Satellite System (GNSS) uses a constellation of sate...
research
11/18/2017

Household poverty classification in data-scarce environments: a machine learning approach

We describe a method to identify poor households in data-scarce countrie...

Please sign up or login with your details

Forgot password? Click here to reset