RevUp: Revise and Update Information Bottleneck for Event Representation

05/24/2022
by   Mehdi Rezaee, et al.
0

In machine learning, latent variables play a key role to capture the underlying structure of data, but they are often unsupervised. When we have side knowledge that already has high-level information about the input data, we can use that source to guide latent variables and capture the available background information in a process called "parameter injection." In that regard, we propose a semi-supervised information bottleneck-based model that enables the use of side knowledge, even if it is noisy and imperfect, to direct the learning of discrete latent variables. Fundamentally, we introduce an auxiliary continuous latent variable as a way to reparameterize the model's discrete variables with a light-weight hierarchical structure. With this reparameterization, the model's discrete latent variables are learned to minimize the mutual information between the observed data and optional side knowledge that is not already captured by the new, auxiliary variables. We theoretically show that our approach generalizes an existing method of parameter injection, and perform an empirical case study of our approach on language-based event modeling. We corroborate our theoretical results with strong empirical experiments, showing that the proposed method outperforms previous proposed approaches on multiple datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2022

Semantically-informed Hierarchical Event Modeling

Prior work has shown that coupling sequential latent variable models wit...
research
11/10/2015

Anchored Discrete Factor Analysis

We present a semi-supervised learning algorithm for learning discrete fa...
research
11/20/2020

Lightweight Data Fusion with Conjugate Mappings

We present an approach to data fusion that combines the interpretability...
research
06/18/2012

Modeling Latent Variable Uncertainty for Loss-based Learning

We consider the problem of parameter estimation using weakly supervised ...
research
05/12/2022

Towards Robust Unsupervised Disentanglement of Sequential Data – A Case Study Using Music Audio

Disentangled sequential autoencoders (DSAEs) represent a class of probab...
research
06/02/2022

Weakly Supervised Representation Learning with Sparse Perturbations

The theory of representation learning aims to build methods that provabl...
research
11/14/2018

Extractive Summary as Discrete Latent Variables

In this paper, we compare various methods to compress a text using a neu...

Please sign up or login with your details

Forgot password? Click here to reset