Gaining Outlier Resistance with Progressive Quantiles: Fast Algorithms and Theoretical Studies

12/15/2021
by   Yiyuan She, et al.
0

Outliers widely occur in big-data applications and may severely affect statistical estimation and inference. In this paper, a framework of outlier-resistant estimation is introduced to robustify an arbitrarily given loss function. It has a close connection to the method of trimming and includes explicit outlyingness parameters for all samples, which in turn facilitates computation, theory, and parameter tuning. To tackle the issues of nonconvexity and nonsmoothness, we develop scalable algorithms with implementation ease and guaranteed fast convergence. In particular, a new technique is proposed to alleviate the requirement on the starting point such that on regular datasets, the number of data resamplings can be substantially reduced. Based on combined statistical and computational treatments, we are able to perform nonasymptotic analysis beyond M-estimation. The obtained resistant estimators, though not necessarily globally or even locally optimal, enjoy minimax rate optimality in both low dimensions and high dimensions. Experiments in regression, classification, and neural networks show excellent performance of the proposed methodology at the occurrence of gross outliers.

READ FULL TEXT

page 23

page 24

research
12/15/2021

On Generalization and Computation of Tukey's Depth: Part I

Tukey's depth offers a powerful tool for nonparametric inference and est...
research
06/24/2023

High-dimensional outlier detection and variable selection via adaptive weighted mean regression

This paper proposes an adaptive penalized weighted mean regression for o...
research
12/16/2019

Detecting and Classifying Outliers in Big Functional Data

This paper proposes two new outlier detection methods, which are useful ...
research
07/18/2017

Exploring Outliers in Crowdsourced Ranking for QoE

Outlier detection is a crucial part of robust evaluation for crowdsource...
research
10/08/2016

Indirect Gaussian Graph Learning beyond Gaussianity

This paper studies how to capture dependency graph structures from real ...
research
05/23/2020

A New Algorithm using Component-wise Adaptive Trimming For Robust Mixture Regression

Mixture regression provides a statistical model for teasing out latent h...

Please sign up or login with your details

Forgot password? Click here to reset