Distributed Sparse Regression via Penalization

11/12/2021
by   Yao Ji, et al.
0

We study sparse linear regression over a network of agents, modeled as an undirected graph (with no centralized node). The estimation problem is formulated as the minimization of the sum of the local LASSO loss functions plus a quadratic penalty of the consensus constraint – the latter being instrumental to obtain distributed solution methods. While penalty-based consensus methods have been extensively studied in the optimization literature, their statistical and computational guarantees in the high dimensional setting remain unclear. This work provides an answer to this open problem. Our contribution is two-fold. First, we establish statistical consistency of the estimator: under a suitable choice of the penalty parameter, the optimal solution of the penalized problem achieves near optimal minimax rate 𝒪(s log d/N) in ℓ_2-loss, where s is the sparsity value, d is the ambient dimension, and N is the total sample size in the network – this matches centralized sample rates. Second, we show that the proximal-gradient algorithm applied to the penalized problem, which naturally leads to distributed implementations, converges linearly up to a tolerance of the order of the centralized statistical error – the rate scales as 𝒪(d), revealing an unavoidable speed-accuracy dilemma.Numerical results demonstrate the tightness of the derived sample rate and convergence rate scalings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/21/2022

High-Dimensional Inference over Networks: Linear Convergence and Statistical Guarantees

We study sparse linear regression over a network of agents, modeled as a...
research
08/17/2021

Non-Asymptotic Bounds for the ℓ_∞ Estimator in Linear Regression with Uniform Noise

The Chebyshev or ℓ_∞ estimator is an unconventional alternative to the o...
research
10/21/2019

High-dimensional robust approximated M-estimators for mean regression with asymmetric data

Asymmetry along with heteroscedasticity or contamination often occurs wi...
research
04/24/2011

Scaled Sparse Linear Regression

Scaled sparse linear regression jointly estimates the regression coeffic...
research
08/21/2022

High-Dimensional Composite Quantile Regression: Optimal Statistical Guarantees and Fast Algorithms

The composite quantile regression (CQR) was introduced by Zou and Yuan [...
research
12/07/2021

Mesh-Based Solutions for Nonparametric Penalized Regression

It is often of interest to estimate regression functions non-parametrica...
research
04/26/2023

A Statistical Interpretation of the Maximum Subarray Problem

Maximum subarray is a classical problem in computer science that given a...

Please sign up or login with your details

Forgot password? Click here to reset