TaylorGAN: Neighbor-Augmented Policy Update for Sample-Efficient Natural Language Generation

11/27/2020
by   Chun-Hsing Lin, et al.
0

Score function-based natural language generation (NLG) approaches such as REINFORCE, in general, suffer from low sample efficiency and training instability problems. This is mainly due to the non-differentiable nature of the discrete space sampling and thus these methods have to treat the discriminator as a black box and ignore the gradient information. To improve the sample efficiency and reduce the variance of REINFORCE, we propose a novel approach, TaylorGAN, which augments the gradient estimation by off-policy update and the first-order Taylor expansion. This approach enables us to train NLG models from scratch with smaller batch size – without maximum likelihood pre-training, and outperforms existing GAN-based methods on multiple metrics of quality and diversity. The source code and data are available at https://github.com/MiuLab/TaylorGAN

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2019

Training language GANs from Scratch

Generative Adversarial Networks (GANs) enjoy great success at image gene...
research
06/07/2021

Differentiable Quality Diversity

Quality diversity (QD) is a growing branch of stochastic optimization re...
research
01/06/2022

SABLAS: Learning Safe Control for Black-box Dynamical Systems

Control certificates based on barrier functions have been a powerful too...
research
07/05/2023

PULSAR at MEDIQA-Sum 2023: Large Language Models Augmented by Synthetic Dialogue Convert Patient Dialogues to Medical Records

This paper describes PULSAR, our system submission at the ImageClef 2023...
research
07/06/2018

Memory Augmented Policy Optimization for Program Synthesis with Generalization

This paper presents Memory Augmented Policy Optimization (MAPO): a novel...
research
06/05/2023

Bootstrapped Training of Score-Conditioned Generator for Offline Design of Biological Sequences

We study the problem of optimizing biological sequences, e.g., proteins,...
research
06/01/2021

Exploring Dynamic Selection of Branch Expansion Orders for Code Generation

Due to the great potential in facilitating software development, code ge...

Please sign up or login with your details

Forgot password? Click here to reset