Compressed Predictive Information Coding

03/03/2022
by   Rui Meng, et al.
0

Unsupervised learning plays an important role in many fields, such as artificial intelligence, machine learning, and neuroscience. Compared to static data, methods for extracting low-dimensional structure for dynamic data are lagging. We developed a novel information-theoretic framework, Compressed Predictive Information Coding (CPIC), to extract useful representations from dynamic data. CPIC selectively projects the past (input) into a linear subspace that is predictive about the compressed data projected from the future (output). The key insight of our framework is to learn representations by minimizing the compression complexity and maximizing the predictive information in latent space. We derive variational bounds of the CPIC loss which induces the latent space to capture information that is maximally predictive. Our variational bounds are tractable by leveraging bounds of mutual information. We find that introducing stochasticity in the encoder robustly contributes to better representation. Furthermore, variational approaches perform better in mutual information estimation compared with estimates under a Gaussian assumption. We demonstrate that CPIC is able to recover the latent space of noisy dynamical systems with low signal-to-noise ratios, and extracts features predictive of exogenous variables in neuroscience data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/07/2020

Representation Learning for Sequence Data with Deep Autoencoding Predictive Components

We propose Deep Autoencoding Predictive Components (DAPC) – a self-super...
research
07/10/2018

Representation Learning with Contrastive Predictive Coding

While supervised learning has enabled great progress in many application...
research
05/23/2019

Unsupervised Discovery of Temporal Structure in Noisy Data with Dynamical Components Analysis

Linear dimensionality reduction methods are commonly used to extract low...
research
01/01/2023

Deep Correlation-Aware Kernelized Autoencoders for Anomaly Detection in Cybersecurity

Unsupervised learning-based anomaly detection in latent space has gained...
research
03/01/2023

Implementing engrams from a machine learning perspective: matching for prediction

Despite evidence for the existence of engrams as memory support structur...
research
05/16/2020

Mutual Information Maximization for Robust Plannable Representations

Extending the capabilities of robotics to real-world complex, unstructur...
research
12/01/2020

Iterative VAE as a predictive brain model for out-of-distribution generalization

Our ability to generalize beyond training data to novel, out-of-distribu...

Please sign up or login with your details

Forgot password? Click here to reset