Information Bottleneck Methods for Distributed Learning

10/26/2018
by   Parinaz Farajiparvar, et al.
0

We study a distributed learning problem in which Alice sends a compressed distillation of a set of training data to Bob, who uses the distilled version to best solve an associated learning problem. We formalize this as a rate-distortion problem in which the training set is the source and Bob's cross-entropy loss is the distortion measure. We consider this problem for unsupervised learning for batch and sequential data. In the batch data, this problem is equivalent to the information bottleneck (IB), and we show that reduced-complexity versions of standard IB methods solve the associated rate-distortion problem. For the streaming data, we present a new algorithm, which may be of independent interest, that solves the rate-distortion problem for Gaussian sources. Furthermore, to improve the results of the iterative algorithm for sequential data we introduce a two-pass version of this algorithm. Finally, we show the dependency of the rate on the number of samples k required for Gaussian sources to ensure cross-entropy loss that scales optimally with the growth of the training set.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/28/2022

On the Rényi Cross-Entropy

The Rényi cross-entropy measure between two distributions, a generalizat...
research
11/09/2018

Vector Gaussian CEO Problem Under Logarithmic Loss and Applications

We study the vector Gaussian CEO problem under logarithmic loss distorti...
research
12/18/2007

The source coding game with a cheating switcher

Motivated by the lossy compression of an active-vision video stream, we ...
research
11/27/2017

The Time-Invariant Multidimensional Gaussian Sequential Rate-Distortion Problem Revisited

We revisit the sequential rate-distortion (SRD) trade-off problem for ve...
research
01/17/2018

Rate-Distortion Performance of Sequential Massive Random Access to Gaussian Sources with Memory

In Sequential Massive Random Access (SMRA), a set of correlated sources ...
research
11/02/2020

On the Relevance-Complexity Region of Scalable Information Bottleneck

The Information Bottleneck method is a learning technique that seeks a r...

Please sign up or login with your details

Forgot password? Click here to reset