Actively Learning what makes a Discrete Sequence Valid

08/15/2017
by   David Janz, et al.
0

Deep learning techniques have been hugely successful for traditional supervised and unsupervised machine learning problems. In large part, these techniques solve continuous optimization problems. Recently however, discrete generative deep learning models have been successfully used to efficiently search high-dimensional discrete spaces. These methods work by representing discrete objects as sequences, for which powerful sequence-based deep models can be employed. Unfortunately, these techniques are significantly hindered by the fact that these generative models often produce invalid sequences. As a step towards solving this problem, we propose to learn a deep recurrent validator model. Given a partial sequence, our model learns the probability of that sequence occurring as the beginning of a full valid sequence. Thus this identifies valid versus invalid sequences and crucially it also provides insight about how individual sequence elements influence the validity of discrete objects. To learn this model we propose an approach inspired by seminal work in Bayesian active learning. On a synthetic dataset, we demonstrate the ability of our model to distinguish valid and invalid sequences. We believe this is a key step toward learning generative models that faithfully produce valid discrete objects.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2017

Learning a Generative Model for Validity in Complex Discrete Structures

Deep generative models have been successfully used to learn representati...
research
07/18/2019

Discrete Object Generation with Reversible Inductive Construction

The success of generative modeling in continuous domains has led to a su...
research
03/06/2017

Grammar Variational Autoencoder

Deep generative models have been wildly successful at learning coherent ...
research
06/16/2017

An online sequence-to-sequence model for noisy speech recognition

Generative models have long been the dominant approach for speech recogn...
research
04/18/2018

Deep Generative Networks For Sequence Prediction

This thesis investigates unsupervised time series representation learnin...
research
11/23/2022

Lempel-Ziv Networks

Sequence processing has long been a central area of machine learning res...
research
12/03/2018

Towards Solving Text-based Games by Producing Adaptive Action Spaces

To solve a text-based game, an agent needs to formulate valid text comma...

Please sign up or login with your details

Forgot password? Click here to reset