DeepAI AI Chat
Log In Sign Up

History and Nature of the Jeffreys-Lindley Paradox

by   Eric-Jan Wagenmakers, et al.

The Jeffreys-Lindley paradox exposes a rift between Bayesian and frequentist hypothesis testing that strikes at the heart of statistical inference. Contrary to what most current literature suggests, the paradox was central to the Bayesian testing methodology developed by Sir Harold Jeffreys in the late 1930s. Jeffreys showed that the evidence against a point-null hypothesis ℋ_0 scales with √(n) and repeatedly argued that it would therefore be mistaken to set a threshold for rejecting ℋ_0 at a constant multiple of the standard error. Here we summarize Jeffreys's early work on the paradox and clarify his reasons for including the √(n) term. The prior distribution is seen to play a crucial role; by implicitly correcting for selection, small parameter values are identified as relatively surprising under ℋ_1. We highlight the general nature of the paradox by presenting both a fully frequentist and a fully Bayesian version. We also demonstrate that the paradox does not depend on assigning prior mass to a point hypothesis, as is commonly believed.


page 1

page 2

page 3

page 4


Bayesian Hypothesis Testing: Redux

Bayesian hypothesis testing is re-examined from the perspective of an a ...

Demystify Lindley's Paradox by Interpreting P-value as Posterior Probability

In the hypothesis testing framework, p-value is often computed to determ...

50 shades of Bayesian testing of hypotheses

Hypothesis testing and model choice are quintessential questions for sta...

Sound and Relatively Complete Belief Hoare Logic for Statistical Hypothesis Testing Programs

We propose a new approach to formally describing the requirement for sta...

A Puzzle of Proportions: Two Popular Bayesian Tests Can Yield Dramatically Different Conclusions

Testing the equality of two proportions is a common procedure in science...

A Bayesian 'sandwich' for variance estimation and hypothesis testing

Many frequentist methods have large-sample Bayesian analogs, but widely-...