Quantum Entropy Scoring for Fast Robust Mean Estimation and Improved Outlier Detection

06/26/2019
by   Yihe Dong, et al.
0

We study two problems in high-dimensional robust statistics: robust mean estimation and outlier detection. In robust mean estimation the goal is to estimate the mean μ of a distribution on R^d given n independent samples, an ε-fraction of which have been corrupted by a malicious adversary. In outlier detection the goal is to assign an outlier score to each element of a data set such that elements more likely to be outliers are assigned higher scores. Our algorithms for both problems are based on a new outlier scoring method we call QUE-scoring based on quantum entropy regularization. For robust mean estimation, this yields the first algorithm with optimal error rates and nearly-linear running time O(nd) in all parameters, improving on the previous fastest running time O((nd/ε^6, nd^2)). For outlier detection, we evaluate the performance of QUE-scoring via extensive experiments on synthetic and real data, and demonstrate that it often performs better than previously proposed algorithms. Code for these experiments is available at https://github.com/twistedcubic/que-outlier-detection .

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2021

Robust Mean Estimation in High Dimensions via Global Outlier Pursuit

We study the robust mean estimation problem in high dimensions, where le...
research
07/30/2020

Outlier Robust Mean Estimation with Subgaussian Rates via Stability

We study the problem of outlier robust high-dimensional mean estimation ...
research
08/21/2020

Robust Mean Estimation in High Dimensions via ℓ_0 Minimization

We study the robust mean estimation problem in high dimensions, where α ...
research
01/19/2023

Robust Chauvenet Rejection: Powerful, but Easy to Use Outlier Detection for Heavily Contaminated Data Sets

In Maples et al. (2018) we introduced Robust Chauvenet Outlier Rejection...
research
06/22/2006

Outlier Robust ICP for Minimizing Fractional RMSD

We describe a variation of the iterative closest point (ICP) algorithm f...
research
12/22/2021

Robust learning of data anomalies with analytically-solvable entropic outlier sparsification

Entropic Outlier Sparsification (EOS) is proposed as a robust computatio...
research
09/22/2019

Outlier-Detection Based Robust Information Fusion for Networked Systems

We consider state estimation for networked systems where measurements fr...

Please sign up or login with your details

Forgot password? Click here to reset