SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

09/18/2016
by   Lantao Yu, et al.
0

As a new way of training generative models, Generative Adversarial Nets (GAN) that uses a discriminative model to guide the training of the generative model has enjoyed considerable success in generating real-valued data. However, it has limitations when the goal is for generating sequences of discrete tokens. A major reason lies in that the discrete outputs from the generative model make it difficult to pass the gradient update from the discriminative model to the generative model. Also, the discriminative model can only assess a complete sequence, while for a partially generated sequence, it is non-trivial to balance its current score and the future one once the entire sequence has been generated. In this paper, we propose a sequence generation framework, called SeqGAN, to solve the problems. Modeling the data generator as a stochastic policy in reinforcement learning (RL), SeqGAN bypasses the generator differentiation problem by directly performing gradient policy update. The RL reward signal comes from the GAN discriminator judged on a complete sequence, and is passed back to the intermediate state-action steps using Monte Carlo search. Extensive experiments on synthetic data and real-world tasks demonstrate significant improvements over strong baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/24/2017

Long Text Generation via Adversarial Training with Leaked Information

Automatically generating coherent and semantically meaningful text has m...
research
04/16/2020

OptiGAN: Generative Adversarial Networks for Goal Optimized Sequence Generation

One of the challenging problems in sequence generation tasks is the opti...
research
09/18/2020

Conditional Hybrid GAN for Sequence Generation

Conditional sequence generation aims to instruct the generation procedur...
research
05/30/2019

Adversarial Sub-sequence for Text Generation

Generative adversarial nets (GAN) has been successfully introduced for g...
research
08/24/2019

DGSAN: Discrete Generative Self-Adversarial Network

Although GAN-based methods have received many achievements in the last f...
research
05/25/2018

Detecting Deceptive Reviews using Generative Adversarial Networks

In the past few years, consumer review sites have become the main target...
research
04/28/2022

Policy Gradient Stock GAN for Realistic Discrete Order Data Generation in Financial Markets

This study proposes a new generative adversarial network (GAN) for gener...

Please sign up or login with your details

Forgot password? Click here to reset