Asymptotically Optimal One- and Two-Sample Testing with Kernels

08/27/2019
by   Shengyu Zhu, et al.
0

We characterize the asymptotic performance of nonparametric one- and two-sample testing. The exponential decay rate or error exponent of the type-II error probability is used as the asymptotic performance metric, and an optimal test achieves the maximum rate subject to a constant level constraint on the type-I error probability. With Sanov's theorem, we derive a sufficient condition for one-sample tests to achieve the optimal error exponent in the universal setting, i.e., for any distribution defining the alternative hypothesis. We then show that two classes of Maximum Mean Discrepancy (MMD) based tests attain the optimal type-II error exponent on R^d, while the quadratic-time Kernel Stein Discrepancy (KSD) based tests achieve this optimality with an asymptotic level constraint. For general two-sample testing, however, Sanov's theorem is insufficient to obtain a similar sufficient condition. We proceed to establish an extended version of Sanov's theorem and derive an exact error exponent for the quadratic-time MMD based two-sample tests. The obtained error exponent is further shown to be optimal among all two-sample tests satisfying a given level constraint. Our results not only solve a long-standing open problem in information theory and statistics, but also provide an achievability result for optimal nonparametric one- and two-sample testing. Application to off-line change detection and related issues are also discussed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/21/2018

Universal Hypothesis Testing with Kernels: Asymptotically Optimal Tests for Goodness of Fit

We characterize the asymptotic performance of nonparametric goodness of ...
research
02/23/2018

Exponentially Consistent Kernel Two-Sample Tests

Given two sets of independent samples from unknown distributions P and Q...
research
10/28/2019

Finite Sample-Size Regime of Testing Against Independence with Communication Constraints

The central problem of Hypothesis Testing (HT) consists in determining t...
research
05/28/2018

Testing Against Independence and a Rényi Information Measure

The achievable error-exponent pairs for the type I and type II errors ar...
research
11/29/2021

Hypothesis Testing of Mixture Distributions using Compressed Data

In this paper we revisit the binary hypothesis testing problem with one-...
research
02/24/2023

On Stein's lemma in hypotheses testing in general non-asymptotic case

The problem of testing two simple hypotheses in a general probability sp...
research
07/05/2018

Smart Meter Privacy: Adversarial Hypothesis Testing Models

Smart meter privacy and privacy-preserving energy management are studied...

Please sign up or login with your details

Forgot password? Click here to reset