A Stable Variational Autoencoder for Text Modelling

11/13/2019
by   Ruizhe Li, et al.
0

Variational Autoencoder (VAE) is a powerful method for learning representations of high-dimensional data. However, VAEs can suffer from an issue known as latent variable collapse (or KL loss vanishing), where the posterior collapses to the prior and the model will ignore the latent codes in generative tasks. Such an issue is particularly prevalent when employing VAE-RNN architectures for text modelling (Bowman et al., 2016). In this paper, we present a simple architecture called holistic regularisation VAE (HR-VAE), which can effectively avoid latent variable collapse. Compared to existing VAE-RNN architectures, we show that our model can achieve much more stable training process and can generate text with significantly better quality.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/02/2020

Improving Variational Autoencoder for Text Modelling with Timestep-Wise Regularisation

The Variational Autoencoder (VAE) is a popular and powerful model applie...
research
08/17/2019

Improve variational autoEncoder with auxiliary softmax multiclassifier

As a general-purpose generative model architecture, VAE has been widely ...
research
09/14/2018

Unsupervised Abstractive Sentence Summarization using Length Controlled Variational Autoencoder

In this work we present a unsupervised approach to summarize sentences i...
research
10/31/2018

Dirichlet Variational Autoencoder for Text Modeling

We introduce an improved variational autoencoder (VAE) for text modeling...
research
07/11/2019

retina-VAE: Variationally Decoding the Spectrum of Macular Disease

In this paper, we seek a clinically-relevant latent code for representin...
research
11/15/2021

An adaptive dimension reduction algorithm for latent variables of variational autoencoder

Constructed by the neural network, variational autoencoder has the overf...
research
08/24/2019

Scalable Modeling of Spatiotemporal Data using the Variational Autoencoder: an Application in Glaucoma

As big spatial data becomes increasingly prevalent, classical spatiotemp...

Please sign up or login with your details

Forgot password? Click here to reset