Diagnosing the role of observable distribution shift in scientific replications

09/03/2023
by   Ying Jin, et al.
0

Many researchers have identified distribution shift as a likely contributor to the reproducibility crisis in behavioral and biomedical sciences. The idea is that if treatment effects vary across individual characteristics and experimental contexts, then studies conducted in different populations will estimate different average effects. This paper uses “generalizability" methods to quantify how much of the effect size discrepancy between an original study and its replication can be explained by distribution shift on observed unit-level characteristics. More specifically, we decompose this discrepancy into “components" attributable to sampling variability (including publication bias), observable distribution shifts, and residual factors. We compute this decomposition for several directly-replicated behavioral science experiments and find little evidence that observable distribution shifts contribute appreciably to non-replicability. In some cases, this is because there is too much statistical noise. In other cases, there is strong evidence that controlling for additional moderators is necessary for reliable replication.

READ FULL TEXT
research
09/03/2020

The sceptical Bayes factor for the assessment of replication success

There is an urgent need to develop new methodology for the design and an...
research
05/08/2023

Replication of "null results" – Absence of evidence or evidence of absence?

In several large-scale replication projects, statistically non-significa...
research
03/20/2019

Statistical Methods for Replicability Assessment

Large-scale replication studies like the Reproducibility Project: Psycho...
research
02/16/2022

Bias in Automated Image Colorization: Metrics and Error Types

We measure the color shifts present in colorized images from the ADE20K ...
research
05/08/2023

A Survey on the Geographic Diversity of Usable Privacy and Security Research

In human factor fields such as human-computer interaction (HCI), psychol...
research
12/01/2021

A benchmark with decomposed distribution shifts for 360 monocular depth estimation

In this work we contribute a distribution shift benchmark for a computer...
research
01/13/2020

Stake Shift in Major Cryptocurrencies: An Empirical Study

In the proof-of-stake (PoS) paradigm for maintaining decentralized, perm...

Please sign up or login with your details

Forgot password? Click here to reset