p-Value as the Strength of Evidence Measured by Confidence Distribution

01/31/2020
by   Sifan Liu, et al.
0

The notion of p-value is a fundamental concept in statistical inference and has been widely used for reporting outcomes of hypothesis tests. However, p-value is often misinterpreted, misused or miscommunicated in practice. Part of the issue is that existing definitions of p-value are often derived from constructions under specific settings, and a general definition that directly reflects the evidence of the null hypothesis is not yet available. In this article, we first propose a general and rigorous definition of p-value that fulfills two performance-based characteristics. The performance-based definition subsumes all existing construction-based definitions of the p-value, and justifies their interpretations. The paper further presents a specific approach based on confidence distribution to formulate and calculate p-values. This specific way of computing p values has two main advantages. First, it is applicable for a wide range of hypothesis testing problems, including the standard one- and two-sided tests, tests with interval-type null, intersection-union tests, multivariate tests and so on. Second, it can naturally lead to a coherent interpretation of p-value as evidence in support of the null hypothesis, as well as a meaningful measure of degree of such support. In particular, it places a meaning of a large p-value, e.g. p-value of 0.8 has more support than 0.5. Numerical examples are used to illustrate the wide applicability and computational feasibility of our approach. We show that our proposal is effective and can be applied broadly, without further consideration of the form/size of the null space. As for existing testing methods, the solutions have not been available or cannot be easily obtained.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/22/2018

P-value: A Bless or A Curse for Evidence-Based Studies?

As a convention, p-value is often computed in frequentist hypothesis tes...
research
03/03/2022

A general adaptive framework for multivariate point null testing

As a common step in refining their scientific inquiry, investigators are...
research
05/08/2018

Seeking evidence of absence: Reconsidering tests of model assumptions

Statistical tests can only reject the null hypothesis, never prove it. H...
research
06/06/2018

A Likelihood-based Alternative to Null Hypothesis Significance Testing

The logical and practical difficulties associated with research interpre...
research
08/17/2023

Rethinking Hypothesis Tests

Null Hypothesis Significance Testing (NHST) have been a popular statisti...
research
10/19/2021

Practical Relevance: A Formal Definition

There is a general agreement that it is important to consider the practi...
research
07/01/2021

Sanity Checks for Lottery Tickets: Does Your Winning Ticket Really Win the Jackpot?

There have been long-standing controversies and inconsistencies over the...

Please sign up or login with your details

Forgot password? Click here to reset