DeepAI AI Chat
Log In Sign Up

Actively Learning what makes a Discrete Sequence Valid

08/15/2017
by   David Janz, et al.
0

Deep learning techniques have been hugely successful for traditional supervised and unsupervised machine learning problems. In large part, these techniques solve continuous optimization problems. Recently however, discrete generative deep learning models have been successfully used to efficiently search high-dimensional discrete spaces. These methods work by representing discrete objects as sequences, for which powerful sequence-based deep models can be employed. Unfortunately, these techniques are significantly hindered by the fact that these generative models often produce invalid sequences. As a step towards solving this problem, we propose to learn a deep recurrent validator model. Given a partial sequence, our model learns the probability of that sequence occurring as the beginning of a full valid sequence. Thus this identifies valid versus invalid sequences and crucially it also provides insight about how individual sequence elements influence the validity of discrete objects. To learn this model we propose an approach inspired by seminal work in Bayesian active learning. On a synthetic dataset, we demonstrate the ability of our model to distinguish valid and invalid sequences. We believe this is a key step toward learning generative models that faithfully produce valid discrete objects.

READ FULL TEXT

page 1

page 2

page 3

page 4

12/05/2017

Learning a Generative Model for Validity in Complex Discrete Structures

Deep generative models have been successfully used to learn representati...
07/18/2019

Discrete Object Generation with Reversible Inductive Construction

The success of generative modeling in continuous domains has led to a su...
03/06/2017

Grammar Variational Autoencoder

Deep generative models have been wildly successful at learning coherent ...
05/18/2023

Dirichlet Diffusion Score Model for Biological Sequence Generation

Designing biological sequences is an important challenge that requires s...
06/16/2017

An online sequence-to-sequence model for noisy speech recognition

Generative models have long been the dominant approach for speech recogn...
11/23/2022

Lempel-Ziv Networks

Sequence processing has long been a central area of machine learning res...