Optimal Covariance Cleaning for Heavy-Tailed Distributions: Insights from Information Theory

04/27/2023
by   Christian Bongiorno, et al.
0

In optimal covariance cleaning theory, minimizing the Frobenius norm between the true population covariance matrix and a rotational invariant estimator is a key step. This estimator can be obtained asymptotically for large covariance matrices, without knowledge of the true covariance matrix. In this study, we demonstrate that this minimization problem is equivalent to minimizing the loss of information between the true population covariance and the rotational invariant estimator for normal multivariate variables. However, for Student's t distributions, the minimal Frobenius norm does not necessarily minimize the information loss in finite-sized matrices. Nevertheless, such deviations vanish in the asymptotic regime of large matrices, which might extend the applicability of random matrix theory results to Student's t distributions. These distributions are characterized by heavy tails and are frequently encountered in real-world applications such as finance, turbulence, or nuclear physics. Therefore, our work establishes a connection between statistical random matrix theory and estimation theory in physics, which is predominantly based on information theory.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/08/2019

Affine Invariant Covariance Estimation for Heavy-Tailed Distributions

In this work we provide an estimator for the covariance matrix of a heav...
research
03/17/2023

Efficient nonparametric estimation of Toeplitz covariance matrices

A new nonparametric estimator for Toeplitz covariance matrices is propos...
research
06/06/2023

Entropic covariance models

In covariance matrix estimation, one of the challenges lies in finding a...
research
05/23/2016

Sub-Gaussian estimators of the mean of a random matrix with heavy-tailed entries

Estimation of the covariance matrix has attracted a lot of attention of ...
research
08/16/2021

Mean Test with Fewer Observation than Dimension and Ratio Unbiased Estimator for Correlation Matrix

Hotelling's T-squared test is a classical tool to test if the normal mea...
research
05/23/2021

Compressing Heavy-Tailed Weight Matrices for Non-Vacuous Generalization Bounds

Heavy-tailed distributions have been studied in statistics, random matri...
research
12/22/2021

Asymptotic Learning Requirements for Stealth Attacks

Information-theoretic stealth attacks are data injection attacks that mi...

Please sign up or login with your details

Forgot password? Click here to reset