Syntax-Directed Variational Autoencoder for Structured Data

02/24/2018
by   Hanjun Dai, et al.
0

Deep generative models have been enjoying success in modeling continuous data. However it remains challenging to capture the representations for discrete structures with formal grammars and semantics, e.g., computer programs and molecular structures. How to generate both syntactically and semantically correct data still remains largely an open problem. Inspired by the theory of compiler where the syntax and semantics check is done via syntax-directed translation (SDT), we propose a novel syntax-directed variational autoencoder (SD-VAE) by introducing stochastic lazy attributes. This approach converts the offline SDT check into on-the-fly generated guidance for constraining the decoder. Comparing to the state-of-the-art methods, our approach enforces constraints on the output space so that the output will be not only syntactically valid, but also semantically reasonable. We evaluate the proposed model with applications in programming language and molecules, including reconstruction and program/molecule optimization. The results demonstrate the effectiveness in incorporating syntactic and semantic constraints in discrete generative models, which is significantly better than current state-of-the-art approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/06/2017

Grammar Variational Autoencoder

Deep generative models have been wildly successful at learning coherent ...
research
04/24/2019

D-VAE: A Variational Autoencoder for Directed Acyclic Graphs

Graph structured data are abundant in the real world. Among different gr...
research
08/27/2019

Text Modeling with Syntax-Aware Variational Autoencoders

Syntactic information contains structures and rules about how text sente...
research
06/05/2019

Syntax-Infused Variational Autoencoder for Text Generation

We present a syntax-infused variational autoencoder (SIVAE), that integr...
research
06/24/2019

SampleFix: Learning to Correct Programs by Sampling Diverse Fixes

Automatic program correction is an active topic of research, which holds...
research
09/07/2018

Constrained Generation of Semantically Valid Graphs via Regularizing Variational Autoencoders

Deep generative models have achieved remarkable success in various data ...
research
12/05/2017

Learning a Generative Model for Validity in Complex Discrete Structures

Deep generative models have been successfully used to learn representati...

Please sign up or login with your details

Forgot password? Click here to reset