Safe Testing

06/18/2019
by   Peter Grünwald, et al.
0

We present a new theory of hypothesis testing. The main concept is the S-value, a notion of evidence which, unlike p-values, allows for effortlessly combining evidence from several tests, even in the common scenario where the decision to perform a new test depends on the previous test outcome: safe tests based on S-values generally preserve Type-I error guarantees under such "optional continuation". S-values exist for completely general testing problems with composite null and alternatives. Their prime interpretation is in terms of gambling or investing, each S-value corresponding to a particular investment. Surprisingly, optimal "GROW" S-values, which lead to fastest capital growth, are fully characterized by the joint information projection (JIPr) between the set of all Bayes marginal distributions on H0 and H1. Thus, optimal S-values also have an interpretation as Bayes factors, with priors given by the JIPr. We illustrate the theory using two classical testing scenarios: the one-sample t-test and the 2x2 contingency table. In the t-test setting, GROW s-values correspond to adopting the right Haar prior on the variance, like in Jeffreys' Bayesian t-test. However, unlike Jeffreys', the "default" safe t-test puts a discrete 2-point prior on the effect size, leading to better behavior in terms of statistical power. Sharing Fisherian, Neymanian and Jeffreys-Bayesian interpretations, S-values and safe tests may provide a methodology acceptable to adherents of all three schools.

READ FULL TEXT
research
08/26/2018

Bayesian Hypothesis Testing: Redux

Bayesian hypothesis testing is re-examined from the perspective of an a ...
research
10/27/2020

Testing with p*-values: Between p-values and e-values

We introduce the notion of p*-values (p*-variables), which generalizes p...
research
03/01/2022

A safe Hosmer-Lemeshow test

This technical report proposes an alternative to the Hosmer-Lemeshow (HL...
research
01/15/2020

Valid p-Values and Expectations of p-Values Revisited

A storm of favorable or critical publications regarding p-values-based p...
research
08/10/2021

A Puzzle of Proportions: Two Popular Bayesian Tests Can Yield Dramatically Different Conclusions

Testing the equality of two proportions is a common procedure in science...
research
05/04/2021

The Lévy combination test

A novel class of methods for combining p-values to perform aggregate hyp...
research
01/29/2018

Test Martingales for bounded random variables

Test martingales have been proposed as a more intuitive approach to hypo...

Please sign up or login with your details

Forgot password? Click here to reset