When Evidence and Significance Collide

06/09/2022
by   František Bartoš, et al.
0

Null hypothesis statistical significance testing (NHST) is the dominant approach for evaluating results from randomized controlled trials. Whereas NHST comes with long-run error rate guarantees, its main inferential tool – the p-value – is only an indirect measure of evidence against the null hypothesis. The main reason is that the p-value is based on the assumption the null hypothesis is true, whereas the likelihood of the data under any alternative hypothesis is ignored. If the goal is to quantify how much evidence the data provide for or against the null hypothesis it is unavoidable that an alternative hypothesis be specified (Goodman Royall, 1988). Paradoxes arise when researchers interpret p-values as evidence. For instance, results that are surprising under the null may be equally surprising under a plausible alternative hypothesis, such that a p=.045 result (`reject the null') does not make the null any less plausible than it was before. Hence, p-values have been argued to overestimate the evidence against the null hypothesis. Conversely, it can be the case that statistically non-significant results (i.e., p>.05) nevertheless provide some evidence in favor of the alternative hypothesis. It is therefore crucial for researchers to know when statistical significance and evidence collide, and this requires that a direct measure of evidence is computed and presented alongside the traditional p-value.

READ FULL TEXT
research
06/05/2020

fbst: An R package for the Full Bayesian Significance Test for testing a sharp null hypothesis against its alternative via the e-value

Hypothesis testing is a central statistical method in psychology and the...
research
01/21/2019

The posterior probability of a null hypothesis given a statistically significant result

Some researchers informally assume that, when they carry out a null hypo...
research
06/06/2018

A Likelihood-based Alternative to Null Hypothesis Significance Testing

The logical and practical difficulties associated with research interpre...
research
05/08/2018

Seeking evidence of absence: Reconsidering tests of model assumptions

Statistical tests can only reject the null hypothesis, never prove it. H...
research
10/06/2022

Post-selection Inference in Multiverse Analysis (PIMA): an inferential framework based on the sign flipping score test

When analyzing data researchers make some decisions that are either arbi...
research
08/17/2023

Rethinking Hypothesis Tests

Null Hypothesis Significance Testing (NHST) have been a popular statisti...

Please sign up or login with your details

Forgot password? Click here to reset