Grammar Variational Autoencoder

03/06/2017
by   Matt J. Kusner, et al.
0

Deep generative models have been wildly successful at learning coherent latent representations for continuous data such as video and audio. However, generative modeling of discrete data such as arithmetic expressions and molecular structures still poses significant challenges. Crucially, state-of-the-art methods often produce outputs that are not valid. We make the key observation that frequently, discrete data can be represented as a parse tree from a context-free grammar. We propose a variational autoencoder which encodes and decodes directly to and from these parse trees, ensuring the generated outputs are always valid. Surprisingly, we show that not only does our model more often generate valid outputs, it also learns a more coherent latent space in which nearby points decode to similar discrete outputs. We demonstrate the effectiveness of our learned models by showing their improved performance in Bayesian optimization for symbolic regression and molecular synthesis.

READ FULL TEXT
research
12/10/2019

Learning to generate new indoor scenes

Deep generative models have been used in recent years to learn coherent ...
research
02/24/2018

Syntax-Directed Variational Autoencoder for Structured Data

Deep generative models have been enjoying success in modeling continuous...
research
09/08/2018

Molecular Hypergraph Grammar with its Application to Molecular Optimization

This paper is concerned with a molecular optimization framework using va...
research
08/15/2017

Actively Learning what makes a Discrete Sequence Valid

Deep learning techniques have been hugely successful for traditional sup...
research
12/05/2017

Learning a Generative Model for Validity in Complex Discrete Structures

Deep generative models have been successfully used to learn representati...
research
03/30/2020

AriEL: volume coding for sentence generation

Mapping sequences of discrete data to a point in a continuous space make...
research
09/13/2021

Deep Generative Models to Extend Active Directory Graphs with Honeypot Users

Active Directory (AD) is a crucial element of large organizations, given...

Please sign up or login with your details

Forgot password? Click here to reset