Post-selection Inference in Multiverse Analysis (PIMA): an inferential framework based on the sign flipping score test

10/06/2022
by   Paolo Girardi, et al.
0

When analyzing data researchers make some decisions that are either arbitrary, based on subjective beliefs about the data generating process, or for which equally justifiable alternative choices could have been made. This wide range of data-analytic choices can be abused, and has been one of the underlying causes of the replication crisis in several fields. Recently, the introduction of multiverse analysis provides researchers with a method to evaluate the stability of the results across reasonable choices that could be made when analyzing data. Multiverse analysis is confined to a descriptive role, lacking a proper and comprehensive inferential procedure. Recently, specification curve analysis adds an inferential procedure to multiverse analysis, but this approach is limited to simple cases related to the linear model, and only allows researchers to infer whether at least one specification rejects the null hypothesis, but not which specifications should be selected. In this paper we present a Post-selection Inference approach to Multiverse Analysis (PIMA) which is a flexible and general inferential approach that accounts for all possible models, i.e., the multiverse of reasonable analyses. The approach allows for a wide range of data specifications (i.e. pre-processing) and any generalized linear model; it allows testing the null hypothesis of a given predictor not being associated with the outcome, by merging information from all reasonable models of multiverse analysis, and provides strong control of the family-wise error rate such that it allows researchers to claim that the null-hypothesis can be rejected for each specification that shows a significant effect. The inferential proposal is based on a conditional resampling procedure. To be continued...

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2022

When Evidence and Significance Collide

Null hypothesis statistical significance testing (NHST) is the dominant ...
research
01/21/2019

The posterior probability of a null hypothesis given a statistically significant result

Some researchers informally assume that, when they carry out a null hypo...
research
10/18/2017

A five-decision testing procedure to infer on unidimensional parameter

A statistical test can be seen as a procedure to produce a decision base...
research
06/29/2023

Zipper: Addressing degeneracy in algorithm-agnostic inference

The widespread use of black box prediction methods has sparked an increa...
research
01/23/2019

Three principles of data science: predictability, computability, and stability (PCS)

We propose the predictability, computability, and stability (PCS) framew...
research
10/03/2017

Robust Hypothesis Test for Nonlinear Effect with Gaussian Processes

This work constructs a hypothesis test for detecting whether an data-gen...

Please sign up or login with your details

Forgot password? Click here to reset