The r-value: evaluating stability with respect to distributional shifts

05/07/2021
by   Suyash Gupta, et al.
0

Common statistical measures of uncertainty like p-values and confidence intervals quantify the uncertainty due to sampling, that is, the uncertainty due to not observing the full population. In practice, populations change between locations and across time. This makes it difficult to gather knowledge that transfers across data sets. We propose a measure of uncertainty that quantifies the distributional uncertainty of a statistical estimand with respect to Kullback-Liebler divergence, that is, the sensitivity of the parameter under general distributional perturbations within a Kullback-Liebler divergence ball. If the signal-to-noise ratio is small, distributional uncertainty is a monotonous transformation of the signal-to-noise ratio. In general, however, it is a different concept and corresponds to a different research question. Further, we propose measures to estimate the stability of parameters with respect to directional or variable-specific shifts. We also demonstrate how the measure of distributional uncertainty can be used to prioritize data collection for better estimation of statistical parameters under shifted distribution. We evaluate the performance of the proposed measure in simulations and real data and show that it can elucidate the distributional (in-)stability of an estimator with respect to certain shifts and give more accurate estimates of parameters under shifted distribution only requiring to collect limited information from the shifted distribution.

READ FULL TEXT
research
09/19/2022

Distributionally robust and generalizable inference

We discuss recently developed methods that quantify the stability and ge...
research
02/24/2022

Calibrated inference: statistical inference that accounts for both sampling uncertainty and distributional uncertainty

During data analysis, analysts often have to make seemingly arbitrary de...
research
04/19/2022

Distributional Transform Based Information Reconciliation

In this paper we present an information reconciliation protocol designed...
research
02/20/2021

Strata-based Quantification of Distributional Uncertainty in Socio-Economic Indicators: A Comparative Study of Indian States

This paper reports a comprehensive study of distributional uncertainty i...
research
07/15/2021

Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks

There has been significant research done on developing methods for impro...
research
04/09/2021

Conditional Inference: Towards a Hierarchy of Statistical Evidence

Statistical uncertainty has many sources. P-values and confidence interv...
research
01/19/2021

Goodness (of fit) of Imputation Accuracy: The GoodImpact Analysis

In statistical survey analysis, (partial) non-responders are integral el...

Please sign up or login with your details

Forgot password? Click here to reset