Multinomial Goodness-of-Fit Based on U-Statistics: High-Dimensional Asymptotic and Minimax Optimality

12/21/2018
by   Ilmun Kim, et al.
0

We consider multinomial goodness-of-fit tests in the high-dimensional regime where the number of bins increases with the sample size. In this regime, Pearson's chi-squared test can suffer from low power due to the substantial bias as well as high variance of its statistic. To resolve these issues, we introduce a family of U-statistic for multinomial goodness-of-fit and study their asymptotic behaviors in high-dimensions. Specifically, we establish conditions under which the considered U-statistic is asymptotically Poisson or Gaussian, and investigate its power function under each asymptotic regime. Furthermore, we introduce a class of weights for the U-statistic that results in minimax rate optimal tests.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/24/2022

On the intermediate asymptotic efficiency of goodness-of-fit tests in multinomial distribution

We consider goodness-of-fit tests for uniformity of a multinomial distri...
research
05/10/2020

Statistical inference for the EU portfolio in high dimensions

In this paper, using the shrinkage-based approach for portfolio weights ...
research
11/10/2020

Dimension-agnostic inference

Classical asymptotic theory for statistical inference usually involves c...
research
12/03/2019

Evidence for goodness of fit in Karl Pearson chi-squared statistics

Chi-squared tests for lack of fit are traditionally employed to find evi...
research
02/26/2019

A Family of Exact Goodness-of-Fit Tests for High-Dimensional Discrete Distributions

The objective of goodness-of-fit testing is to assess whether a dataset ...
research
09/10/2020

Fast calculation of p-values for one-sided Kolmogorov-Smirnov type statistics

We present a method for computing exact p-values for a large family of o...
research
02/22/2018

Multidimensional multiscale scanning in Exponential Families: Limit theory and statistical consequences

In this paper we consider the problem of finding anomalies in a d-dimens...

Please sign up or login with your details

Forgot password? Click here to reset