A Linear-Time Kernel Goodness-of-Fit Test

05/22/2017
by   Wittawat Jitkrittum, et al.
0

We propose a novel adaptive test of goodness-of-fit, with computational cost linear in the number of samples. We learn the test features that best indicate the differences between observed samples and a reference model, by minimizing the false negative rate. These features are constructed via Stein's method, meaning that it is not necessary to compute the normalising constant of the model. We analyse the asymptotic Bahadur efficiency of the new test, and prove that under a mean-shift alternative, our test always has greater relative efficiency than a previous linear-time kernel test, regardless of the choice of parameters for that test. In experiments, the performance of our method exceeds that of the earlier linear-time test, and matches or exceeds the power of a quadratic-time kernel test. In high dimensions and where model structure may be exploited, our goodness of fit test performs far better than a quadratic-time two-sample test based on the Maximum Mean Discrepancy, with samples drawn from the model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/18/2022

Efficient Aggregated Kernel Tests using Incomplete U-statistics

We propose a series of computationally efficient, nonparametric tests fo...
research
11/23/2014

On the High-dimensional Power of Linear-time Kernel Two-Sample Testing under Mean-difference Alternatives

Nonparametric two sample testing deals with the question of consistently...
research
11/27/2022

A Permutation-free Kernel Two-Sample Test

The kernel Maximum Mean Discrepancy (MMD) is a popular multivariate dist...
research
05/22/2016

Interpretable Distribution Features with Maximum Testing Power

Two semimetrics on probability distributions are proposed, given as the ...
research
01/23/2023

Using Excel software to calculate Bayesian factors: taking goodness of fit test (Chi-square test) as an example

Taking the goodness of fit test (Chi test) as an example, this paper att...
research
10/09/2018

A maximum-mean-discrepancy goodness-of-fit test for censored data

We introduce a kernel-based goodness-of-fit test for censored data, wher...
research
11/15/2021

Distribution Compression in Near-linear Time

In distribution compression, one aims to accurately summarize a probabil...

Please sign up or login with your details

Forgot password? Click here to reset