A Panel Quantile Approach to Attrition Bias in Big Data: Evidence from a Randomized Experiment

08/09/2018
by   Matthew Harding, et al.
0

This paper introduces a quantile regression estimator for panel data models with individual heterogeneity and attrition. The method is motivated by the fact that attrition bias is often encountered in Big Data applications. For example, many users sign-up for the latest program but few remain active users several months later, making the evaluation of such interventions inherently very challenging. Building on earlier work by Hausman and Wise (1979), we provide a simple identification strategy that leads to a two-step estimation procedure. In the first step, the coefficients of interest in the selection equation are consistently estimated using parametric or nonparametric methods. In the second step, standard panel quantile methods are employed on a subset of weighted observations. The estimator is computationally easy to implement in Big Data applications with a large number of subjects. We investigate the conditions under which the parameter estimator is asymptotically Gaussian and we carry out a series of Monte Carlo simulations to investigate the finite sample properties of the estimator. Lastly, using a simulation exercise, we apply the method to the evaluation of a recent Time-of-Day electricity pricing experiment inspired by the work of Aigner and Hausman (1980).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2022

On quantiles, continuity and robustness

We consider the geometric quantile and various definitions of the compon...
research
06/28/2023

Integrating Big Data and Survey Data for Efficient Estimation of the Median

An ever-increasing deluge of big data is becoming available to national ...
research
01/13/2020

Panel Data Quantile Regression for Treatment Effect Models

In this study, we explore the identification and estimation of the quant...
research
09/12/2019

Fast Algorithms for the Quantile Regression Process

The widespread use of quantile regression methods depends crucially on t...
research
05/05/2015

On the Feasibility of Distributed Kernel Regression for Big Data

In modern scientific research, massive datasets with huge numbers of obs...
research
03/06/2020

Complete Subset Averaging for Quantile Regressions

We propose a novel conditional quantile prediction method based on the c...
research
01/15/2018

Panel Data Quantile Regression with Grouped Fixed Effects

This paper introduces grouped latent heterogeneity in panel data quantil...

Please sign up or login with your details

Forgot password? Click here to reset