A New Standard for the Analysis and Design of Replication Studies

11/26/2018
by   Leonhard Held, et al.
0

A new standard is proposed for the evidential assessment of replication studies. The approach combines a specific reverse-Bayes technique with prior-predictive tail probabilities to define replication success. The method gives rise to a quantitative measure for replication success, called the sceptical p-value. The sceptical p-value integrates traditional significance of both the original and replication study with a comparison of the respective effect sizes. It incorporates the uncertainty of both the original and replication effect estimates and reduces to the ordinary p-value of the replication study if the uncertainty of the original effect estimate is ignored. The proposed framework can also be used to determine the power or the required sample size to achieve replication success. Numerical calculations highlight the difficulty to achieve replication success if the evidence from the original study is only suggestive. An application to data from the Open Science Collaboration project on the replicability of psychological science illustrates the proposed methodology.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2020

The assessment of replication success based on relative effect size

Replication studies are increasingly conducted to confirm original findi...
research
09/03/2020

The sceptical Bayes factor for the assessment of replication success

There is an urgent need to develop new methodology for the design and an...
research
04/22/2020

Power Calculations for Replication Studies

The reproducibility crisis has led to an increasing number of replicatio...
research
07/01/2022

A Statistical Framework for Replicability

We introduce a novel statistical framework to study replicability which ...
research
06/11/2021

Cross-replication Reliability – An Empirical Approach to Interpreting Inter-rater Reliability

We present a new approach to interpreting IRR that is empirical and cont...
research
05/08/2023

Replication of "null results" – Absence of evidence or evidence of absence?

In several large-scale replication projects, statistically non-significa...
research
12/08/2017

p-Values for Credibility

Analysis of credibility is a reverse-Bayes technique that has been propo...

Please sign up or login with your details

Forgot password? Click here to reset