A fast and accurate kernel-based independence test with applications to high-dimensional and functional data

01/03/2023
by   Jin-Ting Zhang, et al.
0

Testing the dependency between two random variables is an important inference problem in statistics since many statistical procedures rely on the assumption that the two samples are independent. To test whether two samples are independent, a so-called HSIC (Hilbert–Schmidt Independence Criterion)-based test has been proposed. Its null distribution is approximated either by permutation or a Gamma approximation. In this paper, a new HSIC-based test is proposed. Its asymptotic null and alternative distributions are established. It is shown that the proposed test is root-n consistent. A three-cumulant matched chi-squared approximation is adopted to approximate the null distribution of the test statistic. By choosing a proper reproducing kernel, the proposed test can be applied to many different types of data including multivariate, high-dimensional, and functional data. Three simulation studies and two real data applications show that in terms of level accuracy, power, and computational cost, the proposed test outperforms several existing tests for multivariate, high-dimensional, and functional data.

READ FULL TEXT
research
12/23/2022

Two-Sample Test for High-Dimensional Covariance Matrices: a normal-reference approach

Testing the equality of the covariance matrices of two high-dimensional ...
research
10/05/2022

A uniform kernel trick for high-dimensional two-sample problems

We use a suitable version of the so-called "kernel trick" to devise two-...
research
07/28/2023

Multivariate Differential Association Analysis

Identifying how dependence relationships vary across different condition...
research
09/07/2020

Anomaly Detection in Stationary Settings: A Permutation-Based Higher Criticism Approach

Anomaly detection when observing a large number of data streams is essen...
research
01/03/2023

Testing High-dimensional Multinomials with Applications to Text Analysis

Motivated by applications in text mining and discrete distribution infer...
research
08/24/2017

Multivariate Dependency Measure based on Copula and Gaussian Kernel

We propose a new multivariate dependency measure. It is obtained by cons...
research
12/13/2022

Testing the Graph of a Gaussian Graphical Model

The Gaussian graphical model is routinely employed to model the joint di...

Please sign up or login with your details

Forgot password? Click here to reset