Kernel Two-Sample Tests in High Dimension: Interplay Between Moment Discrepancy and Dimension-and-Sample Orders

12/31/2021
by   Jian Yan, et al.
0

Motivated by the increasing use of kernel-based metrics for high-dimensional and large-scale data, we study the asymptotic behavior of kernel two-sample tests when the dimension and sample sizes both diverge to infinity. We focus on the maximum mean discrepancy (MMD) with the kernel of the form k(x,y)=f(x-y_2^2/γ), including MMD with the Gaussian kernel and the Laplacian kernel, and the energy distance as special cases. We derive asymptotic expansions of the kernel two-sample statistics, based on which we establish the central limit theorem (CLT) under both the null hypothesis and the local and fixed alternatives. The new non-null CLT results allow us to perform asymptotic exact power analysis, which reveals a delicate interplay between the moment discrepancy that can be detected by the kernel two-sample tests and the dimension-and-sample orders. The asymptotic theory is further corroborated through numerical studies. Our findings complement those in the recent literature and shed new light on the use of kernel two-sample tests for high-dimensional and large-scale data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/30/2021

Two Sample Testing in High Dimension via Maximum Mean Discrepancy

Maximum Mean Discrepancy (MMD) has been widely used in the areas of mach...
research
02/11/2023

A High-dimensional Convergence Theorem for U-statistics with Applications to Kernel-based Testing

We prove a convergence theorem for U-statistics of degree two, where the...
research
02/19/2019

Interpoint Distance Based Two Sample Tests in High Dimension

In this paper, we study a class of two sample test statistics based on i...
research
10/07/2021

A Fast and Effective Large-Scale Two-Sample Test Based on Kernels

Kernel two-sample tests have been widely used and the development of eff...
research
10/05/2022

A uniform kernel trick for high-dimensional two-sample problems

We use a suitable version of the so-called "kernel trick" to devise two-...
research
02/08/2019

Asymptotics and practical aspects of testing normality with kernel methods

This paper is concerned with testing normality in a Hilbert space based ...
research
02/23/2018

Exponentially Consistent Kernel Two-Sample Tests

Given two sets of independent samples from unknown distributions P and Q...

Please sign up or login with your details

Forgot password? Click here to reset