The sceptical Bayes factor for the assessment of replication success

09/03/2020
by   Samuel Pawel, et al.
0

There is an urgent need to develop new methodology for the design and analysis of replication studies. Recently, a reverse-Bayes method called the sceptical p-value has been proposed for this purpose; the inversion of Bayes' theorem allows us to mathematically formalise the notion of scepticism, which in turn can be used to assess the agreement between the findings of an original study and its replication. However, despite its Bayesian nature, the method relies on tail probabilities as primary inference tools. Here, we present an extension that uses Bayes factors as an alternative means of quantifying evidence. This leads to a new measure for evaluating replication success, the sceptical Bayes factor: Conceptually, the sceptical Bayes factor provides a bound for the maximum level of evidence at which an advocate of the original finding can convince a sceptic who does not trust it, in light of the replication data. While the sceptical p-value can only quantify the conflict between the sceptical prior and the observed replication data, the sceptical Bayes factor also takes into account how likely the data are under the posterior distribution of the effect conditional on the original study, allowing for stronger statements about replication success. Moreover, the proposed method elegantly combines traditional notions of replication success; it ensures that both studies need to show evidence against the null, while at the same time penalising incompatibility of their effect estimates. Case studies from the Reproducibility Project: Cancer Biology and the Social Sciences Replication Project show the advantages of the method for the quantitative assessment of replicability.

READ FULL TEXT
research
11/26/2018

A New Standard for the Analysis and Design of Replication Studies

A new standard is proposed for the evidential assessment of replication ...
research
05/08/2023

Replication of "null results" – Absence of evidence or evidence of absence?

In several large-scale replication projects, statistically non-significa...
research
09/03/2023

Diagnosing the role of observable distribution shift in scientific replications

Many researchers have identified distribution shift as a likely contribu...
research
12/08/2017

p-Values for Credibility

Analysis of credibility is a reverse-Bayes technique that has been propo...
research
07/01/2022

A Statistical Framework for Replicability

We introduce a novel statistical framework to study replicability which ...
research
04/14/2022

The replication of non-inferiority and equivalence studies

Replication studies are increasingly conducted to assess credibility of ...
research
07/29/2022

Power Priors for Replication Studies

The ongoing replication crisis in science has increased interest in the ...

Please sign up or login with your details

Forgot password? Click here to reset