How to choose between different Bayesian posterior indices for hypothesis testing in practice

05/27/2020
by   Riko Kelter, et al.
0

Hypothesis testing is an essential statistical method in psychology and the cognitive sciences. The problems of traditional null hypothesis significance testing (NHST) have been discussed widely, and among the proposed solutions to the replication problems caused by the inappropriate use of significance tests and p-values is a shift towards Bayesian data analysis. However, Bayesian hypothesis testing is concerned with various posterior indices for significance and the size of an effect. This complicates Bayesian hypothesis testing in practice, as the availability of multiple Bayesian alternatives to the traditional p-value causes confusion which one to select and why. In this paper, we compare various Bayesian posterior indices which have been proposed in the literature and discuss their benefits and limitations. Our comparison shows that conceptually not all proposed Bayesian alternatives to NHST and p-values are beneficial, and the usefulness of some indices strongly depends on the study design and research goal. However, our comparison also reveals that there exist at least two candidates among the available Bayesian posterior indices which have appealing theoretical properties and are, to our best knowledge, widely underused among psychologists.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2020

The Full Bayesian Significance Test and the e-value – Foundations, theory and application in the cognitive sciences

Hypothesis testing is a central statistical method in psychological rese...
research
12/20/2021

Hypothesis testing and confidence sets: why Bayesian not frequentist, and how to set a prior with a regulatory authority

We marshall the arguments for preferring Bayesian hypothesis testing and...
research
10/08/2020

Statistical Models for the Analysis of Optimization Algorithms with Benchmark Functions

Frequentist statistical methods, such as hypothesis testing, are standar...
research
01/18/2022

Fragility Measures For Typical Cases

The fragility index is a clinically motivated metric designed to supplem...
research
06/14/2016

Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis

The machine learning community adopted the use of null hypothesis signif...
research
09/23/2019

A reckless guide to P-values: local evidence, global errors

This chapter demystifies P-values, hypothesis tests and significance tes...

Please sign up or login with your details

Forgot password? Click here to reset