Proxy expenditure weights for Consumer Price Index: Audit sampling inference for big data statistics

06/15/2019
by   Li-Chun Zhang, et al.
0

Purchase data from retail chains provide proxy measures of private household expenditure on items that are the most troublesome to collect in the traditional expenditure survey. Due to the sheer amount of proxy data, the bias due to coverage and selection errors completely dominates the variance. We develop tests for bias based on audit sampling, which makes use of available survey data that cannot be linked to the proxy data source at the individual level. However, audit sampling fails to yield a meaningful mean squared error estimate, because the sampling variance is too large compared to the bias of the big data estimate. We propose a novel accuracy measure that is applicable in such situations. This can provide a necessary part of the statistical argument for the uptake of big data source, in replacement of traditional survey sampling. An application to disaggregated food price index is used to demonstrate the proposed approach.

READ FULL TEXT
research
07/22/2023

Survey Design and Estimating Equations when Combining Big Data with Probability Samples

The use of big data in official statistics and the applied sciences is a...
research
06/19/2022

Demand Analysis with a Thin Price Sample

For about 125 items of food, the Consumer Expenditure Survey (CES) sched...
research
10/11/2020

On Spatial Lag Models estimated using crowdsourcing, web-scraping or other unconventionally collected data

The Big Data revolution is challenging the state-of-the-art statistical ...
research
06/06/2023

A Calibrated Data-Driven Approach for Small Area Estimation using Big Data

Where the response variable in a big data set is consistent with the var...
research
11/30/2022

Predicting China's CPI by Scanner Big Data

Scanner big data has potential to construct Consumer Price Index (CPI). ...
research
11/09/2017

A Dwarf-based Scalable Big Data Benchmarking Methodology

Different from the traditional benchmarking methodology that creates a n...
research
06/10/2021

Are We There Yet? Big Data Significantly Overestimates COVID-19 Vaccination in the US

Public health efforts to control the COVID-19 pandemic rely on accurate ...

Please sign up or login with your details

Forgot password? Click here to reset