Information-Ordered Bottlenecks for Adaptive Semantic Compression

05/18/2023
by   Matthew Ho, et al.
0

We present the information-ordered bottleneck (IOB), a neural layer designed to adaptively compress data into latent variables ordered by likelihood maximization. Without retraining, IOB nodes can be truncated at any bottleneck width, capturing the most crucial information in the first latent variables. Unifying several previous approaches, we show that IOBs achieve near-optimal compression for a given encoding architecture and can assign ordering to latent signals in a manner that is semantically meaningful. IOBs demonstrate a remarkable ability to compress embeddings of image and text data, leveraging the performance of SOTA architectures such as CNNs, transformers, and diffusion models. Moreover, we introduce a novel theory for estimating global intrinsic dimensionality with IOBs and show that they recover SOTA dimensionality estimates for complex synthetic data. Furthermore, we showcase the utility of these models for exploratory analysis through applications on heterogeneous datasets, enabling computer-aided discovery of dataset complexity.

READ FULL TEXT

page 6

page 9

page 12

page 13

research
06/15/2020

Ordering Dimensions with Nested Dropout Normalizing Flows

The latent space of normalizing flows must be of the same dimensionality...
research
07/04/2022

Causal Structure Discovery between Clusters of Nodes Induced by Latent Factors

We consider the problem of learning the structure of a causal directed a...
research
05/06/2020

Stochastic Bottleneck: Rateless Auto-Encoder for Flexible Dimensionality Reduction

We propose a new concept of rateless auto-encoders (RL-AEs) that enable ...
research
09/11/2023

Data efficiency, dimensionality reduction, and the generalized symmetric information bottleneck

The Symmetric Information Bottleneck (SIB), an extension of the more fam...
research
09/14/2022

Lossy Image Compression with Conditional Diffusion Models

Denoising diffusion models have recently marked a milestone in high-qual...
research
10/16/2012

Latent Composite Likelihood Learning for the Structured Canonical Correlation Model

Latent variable models are used to estimate variables of interest quanti...

Please sign up or login with your details

Forgot password? Click here to reset