Distributed Sparse Linear Regression with Sublinear Communication

09/15/2022
by   Chen Amiraz, et al.
0

We study the problem of high-dimensional sparse linear regression in a distributed setting under both computational and communication constraints. Specifically, we consider a star topology network whereby several machines are connected to a fusion center, with whom they can exchange relatively short messages. Each machine holds noisy samples from a linear regression model with the same unknown sparse d-dimensional vector of regression coefficients θ. The goal of the fusion center is to estimate the vector θ and its support using few computations and limited communication at each machine. In this work, we consider distributed algorithms based on Orthogonal Matching Pursuit (OMP) and theoretically study their ability to exactly recover the support of θ. We prove that under certain conditions, even at low signal-to-noise-ratios where individual machines are unable to detect the support of θ, distributed-OMP methods correctly recover it with total communication sublinear in d. In addition, we present simulations that illustrate the performance of distributed OMP-based algorithms and show that they perform similarly to more sophisticated and computationally intensive methods, and in some cases even outperform them.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/09/2023

Distributed Sparse Linear Regression under Communication Constraints

In multiple domains, statistical tasks are performed in distributed sett...
research
02/05/2021

Sparse Normal Means Estimation with Sublinear Communication

We consider the problem of sparse normal means estimation in a distribut...
research
11/14/2017

Sparse High-Dimensional Linear Regression. Algorithmic Barriers and a Local Search Algorithm

We consider a sparse high dimensional regression model where the goal is...
research
11/22/2020

Online Orthogonal Matching Pursuit

Greedy algorithms for feature selection are widely used for recovering s...
research
06/29/2011

A Dirty Model for Multiple Sparse Regression

Sparse linear regression -- finding an unknown vector from linear measur...
research
02/28/2019

Distributed Learning with Sublinear Communication

In distributed statistical learning, N samples are split across m machin...
research
09/22/2017

Estimate Exchange over Network is Good for Distributed Hard Thresholding Pursuit

We investigate an existing distributed algorithm for learning sparse sig...

Please sign up or login with your details

Forgot password? Click here to reset