Two-sample Statistics Based on Anisotropic Kernels

09/14/2017
by   Xiuyuan Cheng, et al.
0

The paper introduces a new kernel-based Maximum Mean Discrepancy (MMD) statistic for measuring the distance between two distributions given finitely-many multivariate samples. When the distributions are locally low-dimensional, the proposed test can be made more powerful to distinguish certain alternatives by incorporating local covariance matrices and constructing an anisotropic kernel. The kernel matrix is asymmetric; it computes the affinity between n data points and a set of n_R reference points, where n_R can be drastically smaller than n. While the proposed statistic can be viewed as a special class of Reproducing Kernel Hilbert Space MMD, the consistency of the test is proved, under mild assumptions of the kernel, as long as p-q∼ O(n^-1/2+δ) for any δ>0, based on a result of convergence in distribution of the test statistic. Applications to flow cytometry and diffusion MRI datasets are demonstrated, which motivate the proposed approach to compare distributions.

READ FULL TEXT

page 6

page 23

page 25

page 27

page 28

page 30

research
10/20/2019

The Exact Equivalence of Independence Testing and Two-Sample Testing

Testing independence and testing equality of distributions are two tight...
research
02/10/2016

A Kernelized Stein Discrepancy for Goodness-of-fit Tests and Model Evaluation

We derive a new discrepancy statistic for measuring differences between ...
research
02/27/2019

Multiple Kernel Learning from U-Statistics of Empirical Measures in the Feature Space

We propose a novel data-driven method to learn multiple kernels in kerne...
research
05/05/2020

Measuring the Discrepancy between Conditional Distributions: Methods, Properties and Applications

We propose a simple yet powerful test statistic to quantify the discrepa...
research
04/11/2019

Comparing a Large Number of Multivariate Distributions

In this paper, we propose a test for the equality of multiple distributi...
research
12/11/2018

Bounding the Error From Reference Set Kernel Maximum Mean Discrepancy

In this paper, we bound the error induced by using a weighted skeletoniz...
research
06/06/2021

Neural Tangent Kernel Maximum Mean Discrepancy

We present a novel neural network Maximum Mean Discrepancy (MMD) statist...

Please sign up or login with your details

Forgot password? Click here to reset