Sub-Gaussian estimators of the mean of a random matrix with heavy-tailed entries

05/23/2016
by   Stanislav Minsker, et al.
0

Estimation of the covariance matrix has attracted a lot of attention of the statistical research community over the years, partially due to important applications such as Principal Component Analysis. However, frequently used empirical covariance estimator (and its modifications) is very sensitive to outliers in the data. As P. J. Huber wrote in 1964, "...This raises a question which could have been asked already by Gauss, but which was, as far as I know, only raised a few years ago (notably by Tukey): what happens if the true distribution deviates slightly from the assumed normal one? As is now well known, the sample mean then may have a catastrophically bad performance..." Motivated by this question, we develop a new estimator of the (element-wise) mean of a random matrix, which includes covariance estimation problem as a special case. Assuming that the entries of a matrix possess only finite second moment, this new estimator admits sub-Gaussian or sub-exponential concentration around the unknown mean in the operator norm. We will explain the key ideas behind our construction, as well as applications to covariance estimation and matrix completion problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/17/2018

Robust Modifications of U-statistics and Applications to Covariance Estimation Problems

Let Y be a d-dimensional random vector with unknown mean μ and covarianc...
research
06/05/2020

Reliable Covariance Estimation

Covariance or scatter matrix estimation is ubiquitous in most modern sta...
research
04/27/2023

Optimal Covariance Cleaning for Heavy-Tailed Distributions: Insights from Information Theory

In optimal covariance cleaning theory, minimizing the Frobenius norm bet...
research
06/08/2020

Estimating High-dimensional Covariance and Precision Matrices under General Missing Dependence

A sample covariance matrix S of completely observed data is the key stat...
research
05/12/2015

Detecting the large entries of a sparse covariance matrix in sub-quadratic time

The covariance matrix of a p-dimensional random variable is a fundamenta...
research
08/30/2023

A Parameter-Free Two-Bit Covariance Estimator with Improved Operator Norm Error Rate

A covariance matrix estimator using two bits per entry was recently deve...
research
09/06/2022

A spectral least-squares-type method for heavy-tailed corrupted regression with unknown covariance & heterogeneous noise

We revisit heavy-tailed corrupted least-squares linear regression assumi...

Please sign up or login with your details

Forgot password? Click here to reset