Irreproducibility; Nothing is More Predictable

03/12/2018
by   David Kohn, et al.
0

The increasing ease of data capture and storage has led to a corresponding increase in the choice of data, the type of analysis performed on that data, and the complexity of the analysis performed. The main contribution of this paper is to show that the subjective choice of data and analysis methodology substantially impacts the identification of factors and outcomes of observational studies. This subjective variability of inference is at the heart of recent discussions around irreproducibility in scientific research. To demonstrate this subjective variability, data is taken from an existing study, where interest centres on understanding the factors associated with a young adult's propensity to fall into the category of `not in employment, education or training' (NEET). A fully probabilistic analysis is performed, set in a Bayesian framework and implemented using Reversible Jump Markov chain Monte Carlo (RJMCMC). The results show that different techniques lead to different inference but that models consisting of different factors often have the same predictive performance, whether the analysis is frequentist or Bayesian, making inference problematic. We demonstrate how the use of prior distributions in Bayesian techniques can be used to as a tool for assessing a factor's importance.

READ FULL TEXT
research
01/26/2022

Sequential Bayesian Inference for Factor Analysis

We develop an efficient Bayesian sequential inference framework for fact...
research
10/09/2019

Bayesian factor models for multivariate categorical data obtained from questionnaires

Factor analysis is a flexible technique for assessment of multivariate d...
research
02/17/2020

Bayesian Quantile Factor Models

Factor analysis is a flexible technique for assessment of multivariate d...
research
11/05/2018

A Bayesian Semiparametric Jolly-Seber Model with Individual Heterogeneity: An Application to Migratory Mallards at Stopover

We propose a Bayesian hierarchical Jolly-Seber model that can account fo...
research
08/30/2018

An Introduction to Inductive Statistical Inference -- from Parameter Estimation to Decision-Making

These lecture notes aim at a post-Bachelor audience with a backgound at ...
research
12/22/2019

Blang: Bayesian declarative modelling of arbitrary data structures

Consider a Bayesian inference problem where a variable of interest does ...
research
07/21/2022

Efficient inference and identifiability analysis for differential equation models with random parameters

Heterogeneity is a dominant factor in the behaviour of many biological p...

Please sign up or login with your details

Forgot password? Click here to reset