Generalization Error for Linear Regression under Distributed Learning

04/30/2020
by   Martin Hellkvist, et al.
0

Distributed learning facilitates the scaling-up of data processing by distributing the computational burden over several nodes. Despite the vast interest in distributed learning, generalization performance of such approaches is not well understood. We address this gap by focusing on a linear regression setting. We consider the setting where the unknowns are distributed over a network of nodes. We present an analytical characterization of the dependence of the generalization error on the partitioning of the unknowns over nodes. In particular, for the overparameterized case, our results show that while the error on training data remains in the same range as that of the centralized solution, the generalization error of the distributed solution increases dramatically compared to that of the centralized solution when the number of unknowns estimated at any node is close to the number of observations. We further provide numerical examples to verify our analytical expressions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/22/2021

Linear Regression with Distributed Learning: A Generalization Error Perspective

Distributed learning provides an attractive framework for scaling the le...
research
02/04/2022

Improved Information Theoretic Generalization Bounds for Distributed and Federated Learning

We consider information-theoretic bounds on expected generalization erro...
research
09/30/2018

Distributed linear regression by averaging

Modern massive datasets pose an enormous computational burden to practit...
research
02/20/2022

Memorize to Generalize: on the Necessity of Interpolation in High Dimensional Linear Regression

We examine the necessity of interpolation in overparameterized models, t...
research
07/07/2020

Coded Computing for Federated Learning at the Edge

Federated Learning (FL) is an exciting new paradigm that enables trainin...
research
05/26/2023

Generalization Error without Independence: Denoising, Linear Regression, and Transfer Learning

Studying the generalization abilities of linear models with real data is...
research
10/26/2020

Distributed Node-Specific Block-Diagonal LCMV Beamforming in Wireless Acoustic Sensor Networks

This paper derives the analytical solution of a novel distributed node-s...

Please sign up or login with your details

Forgot password? Click here to reset