Measuring dependence between random vectors via optimal transport

04/28/2021
by   Gilles Mordant, et al.
0

To quantify the dependence between two random vectors of possibly different dimensions, we propose to rely on the properties of the 2-Wasserstein distance. We first propose two coefficients that are based on the Wasserstein distance between the actual distribution and a reference distribution with independent components. The coefficients are normalized to take values between 0 and 1, where 1 represents the maximal amount of dependence possible given the two multivariate margins. We then make a quasi-Gaussian assumption that yields two additional coefficients rooted in the same ideas as the first two. These different coefficients are more amenable for distributional results and admit attractive formulas in terms of the joint covariance or correlation matrix. Furthermore, maximal dependence is proved to occur at the covariance matrix with minimal von Neumann entropy given the covariance matrices of the two multivariate margins. This result also helps us revisit the RV coefficient by proposing a sharper normalisation. The two coefficients based on the quasi-Gaussian approach can be estimated easily via the empirical covariance matrix. The estimators are asymptotically normal and their asymptotic variances are explicit functions of the covariance matrix, which can thus be estimated consistently too. The results extend to the Gaussian copula case, in which case the estimators are rank-based. The results are illustrated through theoretical examples, Monte Carlo simulations, and a case study involving electroencephalography data.

READ FULL TEXT
research
03/21/2021

Asymptotic distribution for the proportional covariance model

Asymptotic distribution for the proportional covariance model under mult...
research
05/17/2021

Eigenvalue distribution of a high-dimensional distance covariance matrix with application

We introduce a new random matrix model called distance covariance matrix...
research
06/11/2021

Statistical Analysis from the Fourier Integral Theorem

Taking the Fourier integral theorem as our starting point, in this paper...
research
06/21/2022

Tyler's and Maronna's M-estimators: Non-Asymptotic Concentration Results

Tyler's and Maronna's M-estimators, as well as their regularized variant...
research
02/12/2021

Fast Non-Asymptotic Testing And Support Recovery For Large Sparse Toeplitz Covariance Matrices

We consider n independent p-dimensional Gaussian vectors with covariance...
research
02/27/2023

Parametric dependence between random vectors via copula-based divergence measures

This article proposes copula-based dependence quantification between mul...
research
03/23/2020

JPEG Steganography and Synchronization of DCT Coefficients for a Given Development Pipeline

This short paper proposes to use the statistical analysis of the correla...

Please sign up or login with your details

Forgot password? Click here to reset