Data-driven stabilizations of goodness-of-fit tests

Exact null distributions of goodness-of-fit test statistics are generally challenging to obtain in tractable forms. Practitioners are therefore usually obliged to rely on asymptotic null distributions or Monte Carlo methods, either in the form of a lookup table or carried out on demand, to apply a goodness-of-fit test. Stephens (1970) provided remarkable simple and useful transformations of several classic goodness-of-fit test statistics that stabilized their exact-n critical values for varying sample sizes n. However, detail on the accuracy of these and subsequent transformations in yielding exact p-values, or even deep understanding on the derivation of several transformations, is still scarce nowadays. We illuminate and automatize, using modern tools, the latter stabilization approach to (i) expand its scope of applicability and (ii) yield semi-continuous exact p-values, as opposed to exact critical values for fixed significance levels. We show improvements on the stabilization accuracy of the exact null distributions of the Kolmogorov-Smirnov, Cramér-von Mises, Anderson-Darling, Kuiper, and Watson test statistics. In addition, we provide a parameter-dependent exact-n stabilization for several novel statistics for testing uniformity on the hypersphere of arbitrary dimension. A data application in astronomy illustrates the benefits of the advocated stabilization for quickly analyzing small-to-moderate sequentially-measured samples.

READ FULL TEXT
research
08/13/2021

A new omnibus test of fit based on a characterisation of the uniform distribution

In this paper, we revisit the classical goodness-of-fit problems for uni...
research
04/10/2023

On new omnibus tests of uniformity on the hypersphere

Two new omnibus tests of uniformity for data on the hypersphere are prop...
research
02/21/2019

Modifying the Chi-square and the CMH test for population genetic inference: adapting to over-dispersion

Evolve and resequence studies provide a popular approach to simulate evo...
research
07/07/2023

Mle-equivariance, data transformations and invariant tests of fit

We define data transformations that leave certain classes of distributio...
research
09/16/2019

Exact Semiparametric Inference and Model Selection for Load-Sharing Systems

As a specific proportional hazard rates model, sequential order statisti...
research
03/14/2020

Multivariate goodness-of-Fit tests based on Wasserstein distance

Goodness-of-fit tests based on the empirical Wasserstein distance are pr...
research
08/13/2020

Calibrating the scan statistic: finite sample performance vs. asymptotics

We consider the problem of detecting an elevated mean on an interval wit...

Please sign up or login with your details

Forgot password? Click here to reset