Pathologies in information bottleneck for deterministic supervised learning

08/23/2018
by   Artemy Kolchinsky, et al.
2

Information bottleneck (IB) is a method for extracting information from one random variable X that is relevant for predicting another random variable Y. To do so, IB identifies an intermediate "bottleneck" variable T that has low mutual information I(X;T) and high mutual information I(Y;T). The "IB curve" characterizes the set of bottleneck variables that achieve maximal I(Y;T) for a given I(X;T), and is typically explored by optimizing the "IB Lagrangian", I(Y;T) - β I(X;T). Recently, there has been interest in applying IB to supervised learning, particularly for classification problems that use neural networks. In most classification problems, the output class Y is a deterministic function of the input X, which we refer to as "deterministic supervised learning". We demonstrate three pathologies that arise when IB is used in any scenario where Y is a deterministic function of X: (1) the IB curve cannot be recovered by optimizing the IB Lagrangian for different values of β; (2) there are "uninteresting" solutions at all points of the IB curve; and (3) for classifiers that achieve low error rates, the activity of different hidden layers will not exhibit a strict trade-off between compression and prediction, contrary to a recent proposal. To address problem (1), we propose a functional that, unlike the IB Lagrangian, can recover the IB curve in all cases. We finish by demonstrating these issues on the MNIST dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2020

Disentangled Information Bottleneck

The information bottleneck (IB) method is a technique for extracting inf...
research
11/25/2019

The Convex Information Bottleneck Lagrangian

The information bottleneck (IB) problem tackles the issue of obtaining r...
research
08/22/2023

Information Bottleneck Revisited: Posterior Probability Perspective with Optimal Transport

Information bottleneck (IB) is a paradigm to extract information in one ...
research
05/06/2017

Nonlinear Information Bottleneck

Information bottleneck [IB] is a technique for extracting information in...
research
04/01/2016

The deterministic information bottleneck

Lossy compression and clustering fundamentally involve a decision about ...
research
05/15/2020

On the Information Plane of Autoencoders

The training dynamics of hidden layers in deep learning are poorly under...
research
11/14/2017

The Multi-layer Information Bottleneck Problem

The muti-layer information bottleneck (IB) problem, where information is...

Please sign up or login with your details

Forgot password? Click here to reset