A Theory of Statistical Inference for Ensuring the Robustness of Scientific Results

04/23/2018
by   Beau Coker, et al.
0

Inference is the process of using facts we know to learn about facts we do not know. A theory of inference gives assumptions necessary to get from the former to the latter, along with a definition for and summary of the resulting uncertainty. Any one theory of inference is neither right nor wrong, but merely an axiom that may or may not be useful. Each of the many diverse theories of inference can be valuable for certain applications. However, no existing theory of inference addresses the tendency to choose, from the range of plausible data analysis specifications consistent with prior evidence, those that inadvertently favor one's own hypotheses. Since the biases from these choices are a growing concern across scientific fields, and in a sense the reason the scientific community was invented in the first place, we introduce a new theory of inference designed to address this critical problem. We derive "hacking intervals," which are the range of a summary statistic one may obtain given a class of possible endogenous manipulations of the data. Hacking intervals require no appeal to hypothetical data sets drawn from imaginary superpopulations. A scientific result with a small hacking interval is more robust to researcher manipulation than one with a larger interval, and is often easier to interpret than a classical confidence interval. Some versions of hacking intervals turn out to be equivalent to classical confidence intervals, which means they may also provide a more intuitive and potentially more useful interpretation of classical confidence intervals

READ FULL TEXT

page 31

page 35

research
11/21/2020

Robust statistical inference for the matched net benefit and the matched win ratio using prioritized composite endpoints

As alternatives to the time-to-first-event analysis of composite endpoin...
research
11/25/2020

Hybrid Confidence Intervals for Informative Uniform Asymptotic Inference After Model Selection

I propose a new type of confidence interval for correct asymptotic infer...
research
07/14/2023

Sparsified Simultaneous Confidence Intervals for High-Dimensional Linear Models

Statistical inference of the high-dimensional regression coefficients is...
research
12/04/2020

MCMC Confidence Intervals and Biases

The recent paper "Simple confidence intervals for MCMC without CLTs" by ...
research
01/27/2018

More powerful post-selection inference, with application to the Lasso

Investigators often use the data to generate interesting hypotheses and ...
research
01/10/2022

Interpretation and inference for altmetric indicators arising from sparse data statistics

In 2018 Bornmann and Haunschild (2018a) introduced a new indicator calle...
research
05/15/2020

Evaluating methods for Lasso selective inference in biomedical research by a comparative simulation study

Variable selection for regression models plays a key role in the analysi...

Please sign up or login with your details

Forgot password? Click here to reset