Asymptotic Normality of Gini Correlation in High Dimension with Applications to the K-sample Problem

02/28/2022
by   Yongli Sang, et al.
0

The categorical Gini correlation proposed by Dang et al. is a dependence measure between a categorical and a numerical variables, which can characterize independence of the two variables. The asymptotic distributions of the sample correlation under the dependence and independence have been established when the dimension of the numerical variable is fixed. However, its asymptotic distribution for high dimensional data has not been explored. In this paper, we develop the central limit theorem for the Gini correlation for the more realistic setting where the dimensionality of the numerical variable is diverging. We then construct a powerful and consistent test for the K-sample problem based on the asymptotic normality. The proposed test not only avoids computation burden but also gains power over the permutation procedure. Simulation studies and real data illustrations show that the proposed test is more competitive to existing methods across a broad range of realistic situations, especially in unbalanced cases.

READ FULL TEXT
research
08/01/2019

Jackknife Empirical Likelihood Approach for K-sample Tests

The categorical Gini correlation is an alternative measure of dependence...
research
09/26/2018

A new Gini correlation between quantitative and qualitative variables

We propose a new Gini correlation to measure dependence between a catego...
research
04/16/2019

Distribution and correlation free two-sample test of high-dimensional means

We propose a two-sample test for high-dimensional means that requires ne...
research
02/20/2023

On relationships between Chatterjee's and Spearman's correlation coefficients

In his seminal work, Chatterjee (2021) introduced a novel correlation me...
research
11/28/2017

Latent Association Mining in Binary Data

We consider the problem of identifying groups of mutually associated var...
research
01/04/2020

High-Dimensional Independence Testing and Maximum Marginal Correlation

A number of universally consistent dependence measures have been recentl...
research
10/14/2021

A Distribution-Free Independence Test for High Dimension Data

Test of independence is of fundamental importance in modern data analysi...

Please sign up or login with your details

Forgot password? Click here to reset