Prediction scoring of data-driven discoveries for reproducible research

11/18/2022
by   Anna L. Smith, et al.
0

Predictive modeling uncovers knowledge and insights regarding a hypothesized data generating mechanism (DGM). Results from different studies on a complex DGM, derived from different data sets, and using complicated models and algorithms, are hard to quantitatively compare due to random noise and statistical uncertainty in model results. This has been one of the main contributors to the replication crisis in the behavioral sciences. The contribution of this paper is to apply prediction scoring to the problem of comparing two studies, such as can arise when evaluating replications or competing evidence. We examine the role of predictive models in quantitatively assessing agreement between two datasets that are assumed to come from two distinct DGMs. We formalize a distance between the DGMs that is estimated using cross validation. We argue that the resulting prediction scores depend on the predictive models created by cross validation. In this sense, the prediction scores measure the distance between DGMs, along the dimension of the particular predictive model. Using human behavior data from experimental economics, we demonstrate that prediction scores can be used to evaluate preregistered hypotheses and provide insights comparing data from different populations and settings. We examine the asymptotic behavior of the prediction scores using simulated experimental data and demonstrate that leveraging competing predictive models can reveal important differences between underlying DGMs. Our proposed cross-validated prediction scores are capable of quantifying differences between unobserved data generating mechanisms and allow for the validation and assessment of results from complex models.

READ FULL TEXT
research
08/03/2012

Cross-conformal predictors

This note introduces the method of cross-conformal prediction, which is ...
research
05/21/2019

On the marginal likelihood and cross-validation

In Bayesian statistics, the marginal likelihood, also known as the evide...
research
08/23/2019

A relation between log-likelihood and cross-validation log-scores

It is shown that the log-likelihood of a hypothesis or model given some ...
research
01/29/2020

Asymptotics of Cross-Validation

Cross validation is a central tool in evaluating the performance of mach...
research
08/16/2019

Selection of Exponential-Family Random Graph Models via Held-Out Predictive Evaluation (HOPE)

Statistical models for networks with complex dependencies pose particula...
research
08/24/2019

EPP: interpretable score of model predictive power

The most important part of model selection and hyperparameter tuning is ...
research
07/07/2022

Production Assessment using a Knowledge Transfer Framework and Evidence Theory

Operational knowledge is one of the most valuable assets in a company, a...

Please sign up or login with your details

Forgot password? Click here to reset