How to Tell When a Result Will Replicate: Significance and Replication in Distributional Null Hypothesis Tests

11/04/2022
by   Fintan Costello, et al.
0

There is a well-known problem in Null Hypothesis Significance Testing: many statistically significant results fail to replicate in subsequent experiments. We show that this problem arises because standard `point-form null' significance tests consider only within-experiment but ignore between-experiment variation, and so systematically underestimate the degree of random variation in results. We give an extension to standard significance testing that addresses this problem by analysing both within- and between-experiment variation. This `distributional null' approach does not underestimate experimental variability and so is not overconfident in identifying significance; because this approach addresses between-experiment variation, it gives mathematically coherent estimates for the probability of replication of significant results. Using a large-scale replication dataset (the first `Many Labs' project), we show that many experimental results that appear statistically significant in standard tests are in fact consistent with random variation when both within- and between-experiment variation are taken into account in this approach. Further, grouping experiments in this dataset into `predictor-target' pairs we show that the predicted replication probabilities for target experiments produced in this approach (given predictor experiment results and the sample sizes of the two experiments) are strongly correlated with observed replication rates. Distributional null hypothesis testing thus gives researchers a statistical tool for identifying statistically significant and reliably replicable results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/15/2020

Distributional Null Hypothesis Testing with the T distribution

Null Hypothesis Significance Testing (NHST) has long been central to the...
research
10/16/2020

Significance and Replication in simple counting experiments: Distributional Null Hypothesis Testing

Null Hypothesis Significance Testing (NHST) has long been of central imp...
research
05/08/2023

Replication of "null results" – Absence of evidence or evidence of absence?

In several large-scale replication projects, statistically non-significa...
research
07/06/2021

When to adjust alpha during multiple testing: A consideration of disjunction, conjunction, and individual testing

Scientists often adjust their significance threshold (alpha level) durin...
research
03/01/2018

On Statistical Non-Significance

Significance tests are probably the most extended form of inference in e...
research
06/05/2023

Significance Bands for Local Projections

An impulse response function describes the dynamic evolution of an outco...

Please sign up or login with your details

Forgot password? Click here to reset