Evidence for goodness of fit in Karl Pearson chi-squared statistics

12/03/2019
by   Robert G. Staudte, et al.
0

Chi-squared tests for lack of fit are traditionally employed to find evidence against a hypothesized model, with the model accepted if the Karl Pearson statistic comparing observed and expected numbers of observations falling within cells is not significantly large. However, if one really wants evidence for goodness of fit, it is better to adopt an equivalence testing approach in which small values of the chi-squared statistic are evidence for the desired model. This method requires one to define what is meant by equivalence to the desired model, and guidelines are proposed. Then a simple extension of the classical normalizing transformation for the non-central chi-squared distribution places these values on a simple to interpret calibration scale for evidence. It is shown that the evidence can distinguish between normal and nearby models, as well between the Poisson and over-dispersed models. Applications to evaluation of random number generators and to uniformity of the digits of pi are included. Sample sizes required to obtain a desired expected evidence for goodness of fit are also provided.

READ FULL TEXT
research
12/06/2021

Pearson's goodness-of-fit tests for sparse distributions

Pearson's chi-squared test is widely used to test the goodness of fit be...
research
12/21/2018

Multinomial Goodness-of-Fit Based on U-Statistics: High-Dimensional Asymptotic and Minimax Optimality

We consider multinomial goodness-of-fit tests in the high-dimensional re...
research
02/20/2022

On Resolving Problems with Conditionality and Its Implications for Characterizing Statistical Evidence

The conditionality principle C plays a key role in attempts to character...
research
10/08/2020

Conditional Goodness-of-Fit Tests for Discrete Distributions

In this paper, we address the problem of testing goodness-of-fit for dis...
research
10/02/2022

Chi-Square Goodness-of-Fit Tests for Conditional Distributions

We propose a cross-classification rule for the dependent and explanatory...
research
09/16/2020

A semi-analytical solution to the maximum likelihood fit of Poisson data to a linear model using the Cash statistic

[ABRIDGED] The Cash statistic, also known as the C stat, is commonly use...
research
06/12/2019

A Bayesian Hierarchical Model for Evaluating Forensic Footwear Evidence

When a latent shoeprint is discovered at a crime scene, forensic analyst...

Please sign up or login with your details

Forgot password? Click here to reset