A Family of Exact Goodness-of-Fit Tests for High-Dimensional Discrete Distributions

02/26/2019
by   Feras A. Saad, et al.
0

The objective of goodness-of-fit testing is to assess whether a dataset of observations is likely to have been drawn from a candidate probability distribution. This paper presents a rank-based family of goodness-of-fit tests that is specialized to discrete distributions on high-dimensional domains. The test is readily implemented using a simulation-based, linear-time procedure. The testing procedure can be customized by the practitioner using knowledge of the underlying data domain. Unlike most existing test statistics, the proposed test statistic is distribution-free and its exact (non-asymptotic) sampling distribution is known in closed form. We establish consistency of the test against all alternatives by showing that the test statistic is distributed as a discrete uniform if and only if the samples were drawn from the candidate distribution. We illustrate its efficacy for assessing the sample quality of approximate sampling algorithms over combinatorially large spaces with intractable probabilities, including random partitions in Dirichlet process mixture models and random lattices in Ising models.

READ FULL TEXT
research
08/18/2022

Goodness of fit tests for Rayleigh distribution

We develop a new goodness fit test for Rayleigh distribution for complet...
research
10/08/2020

Conditional Goodness-of-Fit Tests for Discrete Distributions

In this paper, we address the problem of testing goodness-of-fit for dis...
research
12/21/2018

Multinomial Goodness-of-Fit Based on U-Statistics: High-Dimensional Asymptotic and Minimax Optimality

We consider multinomial goodness-of-fit tests in the high-dimensional re...
research
04/06/2019

Goodness of Fit Testing for Dynamic Networks

Numerous networks in the real world change over time, in the sense that ...
research
01/06/2023

Rank-transformed subsampling: inference for multiple data splitting and exchangeable p-values

Many testing problems are readily amenable to randomised tests such as t...
research
08/03/2020

A monotonicity property of weighted log-rank tests

The logrank test is a well-known nonparametric test which is often used ...
research
06/05/2018

Distribution free goodness of fit tests for regularly varying tail distributions

We discuss in this paper a possibility of constructing a whole class of ...

Please sign up or login with your details

Forgot password? Click here to reset