Semi-supervised Sequential Generative Models

06/30/2020
by   Michael Teng, et al.
10

We introduce a novel objective for training deep generative time-series models with discrete latent variables for which supervision is only sparsely available. This instance of semi-supervised learning is challenging for existing methods, because the exponential number of possible discrete latent configurations results in high variance gradient estimators. We first overcome this problem by extending the standard semi-supervised generative modeling objective with reweighted wake-sleep. However, we find that this approach still suffers when the frequency of available labels varies between training sequences. Finally, we introduce a unified objective inspired by teacher-forcing and show that this approach is robust to variable length supervision. We call the resulting method caffeinated wake-sleep (CWS) to emphasize its additional dependence on real data. We demonstrate its effectiveness with experiments on MNIST, handwriting, and fruit fly trajectory data.

READ FULL TEXT

page 4

page 8

research
07/25/2022

Advancing Semi-Supervised Task Oriented Dialog Systems by JSA Learning of Discrete Latent Variable Models

Developing semi-supervised task-oriented dialog (TOD) systems by leverag...
research
01/24/2019

Semi-Unsupervised Learning with Deep Generative Models: Clustering and Classifying using Ultra-Sparse Labels

We introduce semi-unsupervised learning, an extreme case of semi-supervi...
research
09/07/2023

A Probabilistic Semi-Supervised Approach with Triplet Markov Chains

Triplet Markov chains are general generative models for sequential data ...
research
10/03/2019

Semi-Supervised Generative Modeling for Controllable Speech Synthesis

We present a novel generative model that combines state-of-the-art neura...
research
07/14/2017

Guiding InfoGAN with Semi-Supervision

In this paper we propose a new semi-supervised GAN architecture (ss-Info...
research
06/10/2018

A Structured Variational Autoencoder for Contextual Morphological Inflection

Statistical morphological inflectors are typically trained on fully supe...
research
10/29/2018

Semi-crowdsourced Clustering with Deep Generative Models

We consider the semi-supervised clustering problem where crowdsourcing p...

Please sign up or login with your details

Forgot password? Click here to reset