Statistically Optimal Robust Mean and Covariance Estimation for Anisotropic Gaussians

01/21/2023
by   Arshak Minasyan, et al.
0

Assume that X_1, …, X_N is an ε-contaminated sample of N independent Gaussian vectors in ℝ^d with mean μ and covariance Σ. In the strong ε-contamination model we assume that the adversary replaced an ε fraction of vectors in the original Gaussian sample by any other vectors. We show that there is an estimator μ of the mean satisfying, with probability at least 1 - δ, a bound of the form μ - μ_2 ≤ c(√(Tr(Σ)/N) + √(Σlog(1/δ)/N) + ε√(Σ)), where c > 0 is an absolute constant and Σ denotes the operator norm of Σ. In the same contaminated Gaussian setup, we construct an estimator Σ of the covariance matrix Σ that satisfies, with probability at least 1 - δ, Σ - Σ≤ c(√(ΣTr(Σ)/N) + Σ√(log(1/δ)/N) + εΣ). Both results are optimal up to multiplicative constant factors. Despite the recent significant interest in robust statistics, achieving both dimension-free bounds in the canonical Gaussian case remained open. In fact, several previously known results were either dimension-dependent and required Σ to be close to identity, or had a sub-optimal dependence on the contamination level ε. As a part of the analysis, we derive sharp concentration inequalities for central order statistics of Gaussian, folded normal, and chi-squared distributions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/27/2018

Robust covariance estimation under L_4-L_2 norm equivalence

Let X be a centered random vector taking values in R^d and let Σ= E(X⊗ X...
research
11/28/2019

Finite impulse response models: A non-asymptotic analysis of the least squares estimator

We consider a finite impulse response system with centered independent s...
research
05/17/2022

Covariance Estimation: Optimal Dimension-free Guarantees for Adversarial Corruption and Heavy Tails

We provide an estimator of the covariance matrix that achieves the optim...
research
08/28/2023

Sharper dimension-free bounds on the Frobenius distance between sample covariance and its expectation

We study properties of a sample covariance estimate Σ= (𝐗_1 𝐗_1^⊤ + … + ...
research
08/18/2021

Dimension-free Bounds for Sums of Independent Matrices and Simple Tensors via the Variational Principle

We consider the deviation inequalities for the sums of independent d by ...
research
04/05/2022

Nearly minimax robust estimator of the mean vector by iterative spectral dimension reduction

We study the problem of robust estimation of the mean vector of a sub-Ga...
research
09/01/2021

Nonasymptotic one-and two-sample tests in high dimension with unknown covariance structure

Let 𝐗 = (X_i)_1≤ i ≤ n be an i.i.d. sample of square-integrable variable...

Please sign up or login with your details

Forgot password? Click here to reset