Large Dimensional Analysis of Robust M-Estimators of Covariance with Outliers

03/04/2015
by   David Morales-Jimenez, et al.
0

A large dimensional characterization of robust M-estimators of covariance (or scatter) is provided under the assumption that the dataset comprises independent (essentially Gaussian) legitimate samples as well as arbitrary deterministic samples, referred to as outliers. Building upon recent random matrix advances in the area of robust statistics, we specifically show that the so-called Maronna M-estimator of scatter asymptotically behaves similar to well-known random matrices when the population and sample sizes grow together to infinity. The introduction of outliers leads the robust estimator to behave asymptotically as the weighted sum of the sample outer products, with a constant weight for all legitimate samples and different weights for the outliers. A fine analysis of this structure reveals importantly that the propensity of the M-estimator to attenuate (or enhance) the impact of outliers is mostly dictated by the alignment of the outliers with the inverse population covariance matrix of the legitimate samples. Thus, robust M-estimators can bring substantial benefits over more simplistic estimators such as the per-sample normalized version of the sample covariance matrix, which is not capable of differentiating the outlying samples. The analysis shows that, within the class of Maronna's estimators of scatter, the Huber estimator is most favorable for rejecting outliers. On the contrary, estimators more similar to Tyler's scale invariant estimator (often preferred in the literature) run the risk of inadvertently enhancing some outliers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/25/2022

An Improvement on the Hotelling T^2 Test Using the Ledoit-Wolf Nonlinear Shrinkage Estimator

Hotelling's T^2 test is a classical approach for discriminating the mean...
research
04/26/2018

Large-dimensional behavior of regularized Maronna's M-estimators of covariance matrices

Robust estimators of large covariance matrices are considered, comprisin...
research
09/11/2023

A Note on Location Parameter Estimation using the Weighted Hodges-Lehmann Estimator

Robust design is one of the main tools employed by engineers for the fac...
research
12/08/2019

Improved Covariance Matrix Estimator using Shrinkage Transformation and Random Matrix Theory

One of the major challenges in multivariate analysis is the estimation o...
research
10/12/2019

Real-time outlier detection for large datasets by RT-DetMCD

Modern industrial machines can generate gigabytes of data in seconds, fr...
research
01/24/2019

Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians

We consider deep classifying neural networks. We expose a structure in t...
research
09/15/2022

The Influence Function of Graphical Lasso Estimators

The precision matrix that encodes conditional linear dependency relation...

Please sign up or login with your details

Forgot password? Click here to reset