Concentration study of M-estimators using the influence function

04/09/2021
by   Timothée Mathieu, et al.
0

We present a new finite-sample analysis of M-estimators of locations in ℝ^d using the tool of the influence function. In particular, we show that the deviations of an M-estimator can be controlled thanks to its influence function (or its score function) and then, we use concentration inequality on M-estimators to investigate the robust estimation of the mean in high dimension in a corrupted setting (adversarial corruption setting) for bounded and unbounded score functions. For a sample of size n and covariance matrix Σ, we attain the minimax speed √(Tr(Σ)/n)+√(Σ_oplog(1/δ)/n) with probability larger than 1-δ in a heavy-tailed setting. One of the major advantages of our approach compared to others recently proposed is that our estimator is tractable and fast to compute even in very high dimension with a complexity of O(ndlog(Tr(Σ))) where n is the sample size and Σ is the covariance matrix of the inliers. In practice, the code that we make available for this article proves to be very fast.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset