A Permutation-Free Kernel Independence Test

12/18/2022
βˆ™
by   Shubhanshu Shekhar, et al.
βˆ™
0
βˆ™

In nonparametric independence testing, we observe i.i.d. data {(X_i,Y_i)}_i=1^n, where X βˆˆπ’³, Y βˆˆπ’΄ lie in any general spaces, and we wish to test the null that X is independent of Y. Modern test statistics such as the kernel Hilbert-Schmidt Independence Criterion (HSIC) and Distance Covariance (dCov) have intractable null distributions due to the degeneracy of the underlying U-statistics. Thus, in practice, one often resorts to using permutation testing, which provides a nonasymptotic guarantee at the expense of recalculating the quadratic-time statistics (say) a few hundred times. This paper provides a simple but nontrivial modification of HSIC and dCov (called xHSIC and xdCov, pronounced β€œcross” HSIC/dCov) so that they have a limiting Gaussian distribution under the null, and thus do not require permutations. This requires building on the newly developed theory of cross U-statistics by Kim and Ramdas (2020), and in particular developing several nontrivial extensions of the theory in Shekhar et al. (2022), which developed an analogous permutation-free kernel two-sample test. We show that our new tests, like the originals, are consistent against fixed alternatives, and minimax rate optimal against smooth local alternatives. Numerical simulations demonstrate that compared to the full dCov or HSIC, our variants have the same power up to a √(2) factor, giving practitioners a new option for large problems or data-analysis pipelines where computation, not sample size, could be the bottleneck.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
βˆ™ 11/27/2022

A Permutation-free Kernel Two-Sample Test

The kernel Maximum Mean DiscrepancyΒ (MMD) is a popular multivariate dist...
research
βˆ™ 03/30/2020

Minimax optimality of permutation tests

Permutation tests are widely used in statistics, providing a finite-samp...
research
βˆ™ 10/14/2021

A Distribution-Free Independence Test for High Dimension Data

Test of independence is of fundamental importance in modern data analysi...
research
βˆ™ 12/27/2019

The Chi-Square Test of Distance Correlation

Distance correlation has gained much recent attention in the statistics ...
research
βˆ™ 10/17/2016

BET on Independence

We study the problem of nonparametric dependence detection. Many existin...
research
βˆ™ 12/15/2020

Computation-free Nonparametric testing for Local and Global Spatial Autocorrelation with application to the Canadian Electorate

Measures of local and global spatial association are key tools for explo...
research
βˆ™ 11/01/2019

Revisiting the random shift approach for testing in spatial statistics

We consider the problem of non-parametric testing of independence of two...

Please sign up or login with your details

Forgot password? Click here to reset