Distributed Sparse Linear Regression under Communication Constraints

01/09/2023
by   Rodney Fonseca, et al.
0

In multiple domains, statistical tasks are performed in distributed settings, with data split among several end machines that are connected to a fusion center. In various applications, the end machines have limited bandwidth and power, and thus a tight communication budget. In this work we focus on distributed learning of a sparse linear regression model, under severe communication constraints. We propose several two round distributed schemes, whose communication per machine is sublinear in the data dimension. In our schemes, individual machines compute debiased lasso estimators, but send to the fusion center only very few values. On the theoretical front, we analyze one of these schemes and prove that with high probability it achieves exact support recovery at low signal to noise ratios, where individual machines fail to recover the support. We show in simulations that our scheme works as well as, and in some cases better, than more communication intensive approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/15/2022

Distributed Sparse Linear Regression with Sublinear Communication

We study the problem of high-dimensional sparse linear regression in a d...
research
02/05/2021

Sparse Normal Means Estimation with Sublinear Communication

We consider the problem of sparse normal means estimation in a distribut...
research
03/14/2015

Communication-efficient sparse regression: a one-shot approach

We devise a one-shot approach to distributed sparse regression in the hi...
research
02/28/2019

Distributed Learning with Sublinear Communication

In distributed statistical learning, N samples are split across m machin...
research
01/22/2021

Linear Regression with Distributed Learning: A Generalization Error Perspective

Distributed learning provides an attractive framework for scaling the le...
research
03/06/2021

Linear Regression over Networks with Communication Guarantees

A key functionality of emerging connected autonomous systems such as sma...
research
03/30/2016

Towards Geo-Distributed Machine Learning

Latency to end-users and regulatory requirements push large companies to...

Please sign up or login with your details

Forgot password? Click here to reset