Information Bottleneck and its Applications in Deep Learning

04/07/2019
by   Hassan Hafez-Kolahi, et al.
0

Information Theory (IT) has been used in Machine Learning (ML) from early days of this field. In the last decade, advances in Deep Neural Networks (DNNs) have led to surprising improvements in many applications of ML. The result has been a paradigm shift in the community toward revisiting previous ideas and applications in this new framework. Ideas from IT are no exception. One of the ideas which is being revisited by many researchers in this new era, is Information Bottleneck (IB); a formulation of information extraction based on IT. The IB is promising in both analyzing and improving DNNs. The goal of this survey is to review the IB concept and demonstrate its applications in deep learning. The information theoretic nature of IB, makes it also a good candidate in showing the more general concept of how IT can be used in ML. Two important concepts are highlighted in this narrative on the subject, i) the concise and universal view that IT provides on seemingly unrelated methods of ML, demonstrated by explaining how IB relates to minimal sufficient statistics, stochastic gradient descent, and variational auto-encoders, and ii) the common technical mistakes and problems caused by applying ideas from IT, which is discussed by a careful study of some recent methods suffering from them.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2021

A Critical Review of Information Bottleneck Theory and its Applications to Deep Learning

In the past decade, deep neural networks have seen unparalleled improvem...
research
04/30/2020

The Information Bottleneck Problem and Its Applications in Machine Learning

Inference capabilities of machine learning (ML) systems skyrocketed in r...
research
11/15/2022

The Lean Data Scientist: Recent Advances towards Overcoming the Data Bottleneck

Machine learning (ML) is revolutionizing the world, affecting almost eve...
research
08/17/2021

InfoGram and Admissible Machine Learning

We have entered a new era of machine learning (ML), where the most accur...
research
04/13/2023

MLOps Spanning Whole Machine Learning Life Cycle: A Survey

Google AlphaGos win has significantly motivated and sped up machine lear...
research
10/21/2020

Deep Neural Networks Are Congestion Games: From Loss Landscape to Wardrop Equilibrium and Beyond

The theoretical analysis of deep neural networks (DNN) is arguably among...
research
03/09/2023

Variational formulations of ODE-Net as a mean-field optimal control problem and existence results

This paper presents a mathematical analysis of ODE-Net, a continuum mode...

Please sign up or login with your details

Forgot password? Click here to reset