DeepAI AI Chat
Log In Sign Up

Actively Learning what makes a Discrete Sequence Valid

by   David Janz, et al.

Deep learning techniques have been hugely successful for traditional supervised and unsupervised machine learning problems. In large part, these techniques solve continuous optimization problems. Recently however, discrete generative deep learning models have been successfully used to efficiently search high-dimensional discrete spaces. These methods work by representing discrete objects as sequences, for which powerful sequence-based deep models can be employed. Unfortunately, these techniques are significantly hindered by the fact that these generative models often produce invalid sequences. As a step towards solving this problem, we propose to learn a deep recurrent validator model. Given a partial sequence, our model learns the probability of that sequence occurring as the beginning of a full valid sequence. Thus this identifies valid versus invalid sequences and crucially it also provides insight about how individual sequence elements influence the validity of discrete objects. To learn this model we propose an approach inspired by seminal work in Bayesian active learning. On a synthetic dataset, we demonstrate the ability of our model to distinguish valid and invalid sequences. We believe this is a key step toward learning generative models that faithfully produce valid discrete objects.


page 1

page 2

page 3

page 4


Learning a Generative Model for Validity in Complex Discrete Structures

Deep generative models have been successfully used to learn representati...

Discrete Object Generation with Reversible Inductive Construction

The success of generative modeling in continuous domains has led to a su...

Grammar Variational Autoencoder

Deep generative models have been wildly successful at learning coherent ...

Dirichlet Diffusion Score Model for Biological Sequence Generation

Designing biological sequences is an important challenge that requires s...

An online sequence-to-sequence model for noisy speech recognition

Generative models have long been the dominant approach for speech recogn...

Lempel-Ziv Networks

Sequence processing has long been a central area of machine learning res...