Robust Linear Regression for General Feature Distribution

02/04/2022
by   Tom Norman, et al.
2

We investigate robust linear regression where data may be contaminated by an oblivious adversary, i.e., an adversary than may know the data distribution but is otherwise oblivious to the realizations of the data samples. This model has been previously analyzed under strong assumptions. Concretely, (i) all previous works assume that the covariance matrix of the features is positive definite; and (ii) most of them assume that the features are centered (i.e. zero mean). Additionally, all previous works make additional restrictive assumption, e.g., assuming that the features are Gaussian or that the corruptions are symmetrically distributed. In this work we go beyond these assumptions and investigate robust regression under a more general set of assumptions: (i) we allow the covariance matrix to be either positive definite or positive semi definite, (ii) we do not necessarily assume that the features are centered, (iii) we make no further assumption beyond boundedness (sub-Gaussianity) of features and measurement noise. Under these assumption we analyze a natural SGD variant for this problem and show that it enjoys a fast convergence rate when the covariance matrix is positive definite. In the positive semi definite case we show that there are two regimes: if the features are centered we can obtain a standard convergence rate; otherwise the adversary can cause any learner to fail arbitrarily.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/10/2023

Symmetric positive semi-definite Fourier estimator of instantaneous variance-covariance matrix

In this paper we propose an estimator of spot covariance matrix which en...
research
10/22/2020

Positive definiteness of the asymptotic covariance matrix of OLS estimators in parsimonious regressions

Recently, Ghysels, Hill, and Motegi (2020) proposed a test for examining...
research
10/25/2020

Learning Sparse Graph Laplacian with K Eigenvector Prior via Iterative GLASSO and Projection

Learning a suitable graph is an important precursor to many graph signal...
research
04/26/2018

Corrected Empirical Bayes Confidence Region in a Multivariate Fay-Herriot Model

In the small area estimation, the empirical best linear unbiased predict...
research
09/14/2023

Spectrum-Aware Adjustment: A New Debiasing Framework with Applications to Principal Components Regression

We introduce a new debiasing framework for high-dimensional linear regre...
research
03/14/2018

Block Diagonally Dominant Positive Definite Sub-optimal Filters and Smoothers

We examine stochastic dynamical systems where the transition matrix, Φ, ...
research
06/07/2019

Robust subgaussian estimation of a mean vector in nearly linear time

We construct an algorithm, running in nearly-linear time, which is robus...

Please sign up or login with your details

Forgot password? Click here to reset