Generalized Resilience and Robust Statistics

09/19/2019
by   Banghua Zhu, et al.
0

Robust statistics traditionally focuses on outliers, or perturbations in total variation distance. However, a dataset could be corrupted in many other ways, such as systematic measurement errors and missing covariates. We generalize the robust statistics approach to consider perturbations under any Wasserstein distance, and show that robust estimation is possible whenever a distribution's population statistics are robust under a certain family of friendly perturbations. This generalizes a property called resilience previously employed in the special case of mean estimation with outliers. We justify the generalized resilience property by showing that it holds under moment or hypercontractive conditions. Even in the total variation case, these subsume conditions in the literature for mean estimation, regression, and covariance estimation; the resulting analysis simplifies and sometimes improves these known results in both population limit and finite-sample rate. Our robust estimators are based on minimum distance (MD) functionals (Donoho and Liu, 1988), which project onto a set of distributions under a discrepancy related to the perturbation. We present two approaches for designing MD estimators with good finite-sample rates: weakening the discrepancy and expanding the set of distributions. We also present connections to Gao et al. (2019)'s recent analysis of generative adversarial networks for robust estimation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/12/2019

Finite sample properties of parametric MMD estimation: robustness to misspecification and dependence

Many works in statistics aim at designing a universal estimation procedu...
research
02/02/2022

Robust Estimation for Nonparametric Families via Generative Adversarial Networks

We provide a general framework for designing Generative Adversarial Netw...
research
10/06/2021

Robust Generalized Method of Moments: A Finite Sample Viewpoint

For many inference problems in statistics and econometrics, the unknown ...
research
06/11/2018

Robust test statistics for the two-way MANOVA based on the minimum covariance determinant estimator

Robust test statistics for the two-way MANOVA based on the minimum covar...
research
01/16/2023

Theoretical and computational aspects of robust optimal transportation, with applications to statistics and machine learning

Optimal transport (OT) theory and the related p-Wasserstein distance (W_...
research
10/04/2018

Robust Estimation and Generative Adversarial Nets

Robust estimation under Huber's ϵ-contamination model has become an impo...
research
03/15/2017

Resilience: A Criterion for Learning in the Presence of Arbitrary Outliers

We introduce a criterion, resilience, which allows properties of a datas...

Please sign up or login with your details

Forgot password? Click here to reset