High-Dimensional Independence Testing and Maximum Marginal Correlation

01/04/2020
by   Cencheng Shen, et al.
0

A number of universally consistent dependence measures have been recently proposed for testing independence, such as distance correlation, kernel correlation, multiscale graph correlation, etc. They provide a satisfactory solution for dependence testing in low-dimensions, but often exhibit decreasing power for high-dimensional data, a phenomenon that has been recognized but remains mostly unchartered. In this paper, we aim to better understand the high-dimensional testing scenarios and explore a procedure that is robust against increasing dimension. To that end, we propose the maximum marginal correlation method and characterize high-dimensional dependence structures via the notion of dependent dimensions. We prove that the maximum method can be valid and universally consistent for testing high-dimensional dependence under regularity conditions, and demonstrate when and how the maximum method may outperform other methods. The methodology can be implemented by most existing dependence measures, has a superior testing power in a variety of common high-dimensional settings, and is computationally efficient for big data analysis when using the distance correlation chi-square test.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/30/2019

A New Framework for Distance and Kernel-based Metrics in High Dimensions

The paper presents new metrics to quantify and test for (i) the equality...
research
09/15/2017

Dependence Modeling in Ultra High Dimensions with Vine Copulas and the Graphical Lasso

To model high dimensional data, Gaussian methods are widely used since t...
research
06/19/2023

Invariant correlation under marginal transforms

The Pearson correlation coefficient is generally not invariant under com...
research
02/28/2022

Asymptotic Normality of Gini Correlation in High Dimension with Applications to the K-sample Problem

The categorical Gini correlation proposed by Dang et al. is a dependence...
research
12/27/2019

The Chi-Square Test of Distance Correlation

Distance correlation has gained much recent attention in the statistics ...
research
11/11/2021

Simulating High-Dimensional Multivariate Data using the bigsimr R Package

It is critical to accurately simulate data when employing Monte Carlo te...
research
12/27/2018

How to avoid the zero-power trap in testing for correlation

In testing for correlation of the errors in regression models the power ...

Please sign up or login with your details

Forgot password? Click here to reset