Asymptotic Distributions of High-Dimensional Nonparametric Inference with Distance Correlation

10/28/2019
by   Lan Gao, et al.
0

Understanding the nonlinear association between a pair of potentially high-dimensional random vectors is encountered frequently in many contemporary big data applications. Distance correlation has become an increasingly popular tool for such a purpose. Most existing works have explored its asymptotic distributions under the independence assumption when only the sample size or the dimensionality diverges. Yet its asymptotic theory for the more realistic setting when both sample size and dimensionality diverge remains largely unexplored. In this paper, we fill such a gap and establish the central limit theorems and the associated rates of convergence for a rescaled test statistic based on the bias-corrected distance correlation in high dimensions under some mild regularity conditions and the null hypothesis of independence between the two random vectors. Our new theoretical results reveal an interesting phenomenon of blessing of dimensionality for high-dimensional nonparametric inference with distance correlation in the sense that the accuracy of normal approximation can increase with dimensionality. The finite-sample performance and advantages of the test statistic are illustrated with several simulation examples and a blockchain application.

READ FULL TEXT
research
08/11/2020

Test for mean matrix in GMANOVA model under heteroscedasticity and non-normality for high-dimensional data

This paper is concerned with the testing bilateral linear hypothesis on ...
research
05/05/2021

A nonparametric test of independence based on L_1-error

We propose a test of mutual independence between random vectors with arb...
research
10/26/2017

From Distance Correlation to Multiscale Generalized Correlation

Understanding and developing a correlation measure that can detect gener...
research
06/13/2016

Tuning-Free Heterogeneity Pursuit in Massive Networks

Heterogeneity is often natural in many contemporary applications involvi...
research
09/30/2021

Two Sample Testing in High Dimension via Maximum Mean Discrepancy

Maximum Mean Discrepancy (MMD) has been widely used in the areas of mach...
research
12/18/2021

High-Dimensional Knockoffs Inference for Time Series Data

The framework of model-X knockoffs provides a flexible tool for exact fi...
research
05/11/2015

Foundational principles for large scale inference: Illustrations through correlation mining

When can reliable inference be drawn in the "Big Data" context? This pap...

Please sign up or login with your details

Forgot password? Click here to reset