Distributed Adaptive Huber Regression

07/06/2021
by   Jiyu Luo, et al.
0

Distributed data naturally arise in scenarios involving multiple sources of observations, each stored at a different location. Directly pooling all the data together is often prohibited due to limited bandwidth and storage, or due to privacy protocols. This paper introduces a new robust distributed algorithm for fitting linear regressions when data are subject to heavy-tailed and/or asymmetric errors with finite second moments. The algorithm only communicates gradient information at each iteration and therefore is communication-efficient. Statistically, the resulting estimator achieves the centralized nonasymptotic error bound as if all the data were pooled together and came from a distribution with sub-Gaussian tails. Under a finite (2+δ)-th moment condition, we derive a Berry-Esseen bound for the distributed estimator, based on which we construct robust confidence intervals. Numerical studies further confirm that compared with extant distributed methods, the proposed methods achieve near-optimal accuracy with low variability and better coverage with tighter confidence width.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2022

Distributed Learning for Principle Eigenspaces without Moment Constraints

Distributed Principal Component Analysis (PCA) has been studied to deal ...
research
02/02/2022

Catoni-style confidence sequences for heavy-tailed mean estimation

A confidence sequence (CS) is a sequence of confidence intervals that is...
research
01/23/2023

Quantum Heavy-tailed Bandits

In this paper, we study multi-armed bandits (MAB) and stochastic linear ...
research
06/25/2023

Simple Estimation of Semiparametric Models with Measurement Errors

We develop a practical way of addressing the Errors-In-Variables (EIV) p...
research
10/26/2021

Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs

Despite a large amount of effort in dealing with heavy-tailed error in m...
research
09/01/2018

Optimal Bandwidth Choice for Robust Bias Corrected Inference in Regression Discontinuity Designs

Modern empirical work in Regression Discontinuity (RD) designs employs l...
research
08/22/2018

Sensitivity Analysis using Approximate Moment Condition Models

We consider inference in models defined by approximate moment conditions...

Please sign up or login with your details

Forgot password? Click here to reset