Efficient Reinforcement Learning for Unsupervised Controlled Text Generation

04/16/2022
by   Bhargav Upadhyay, et al.
0

Controlled text generation tasks such as unsupervised text style transfer have increasingly adopted the use of Reinforcement Learning (RL). A major challenge in applying RL to such tasks is the sparse reward, which is available only after the full text is generated. Sparse rewards, combined with a large action space make RL training sample-inefficient and difficult to converge. Recently proposed reward-shaping strategies to address this issue have shown only negligible gains. In contrast, this work proposes a novel approach that provides dense rewards to each generated token. We evaluate our approach by its usage in unsupervised text style transfer. Averaged across datasets, our style transfer system improves upon current state-of-art systems by 21% on human evaluation and 12% on automatic evaluation. Upon ablated comparison with the current reward shaping approach (the `roll-out strategy'), using dense rewards improves the overall style transfer quality by 22% based on human evaluation. Further the RL training is 2.5 times as sample efficient, and 7 times faster.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/11/2020

Reinforced Rewards Framework for Text Style Transfer

Style transfer deals with the algorithms to transfer the stylistic prope...
research
05/25/2022

RLPrompt: Optimizing Discrete Text Prompts With Reinforcement Learning

Prompting has shown impressive success in enabling large pretrained lang...
research
05/07/2020

Learning Implicit Text Generation via Feature Matching

Generative feature matching network (GFMN) is an approach for training i...
research
09/19/2023

Specializing Small Language Models towards Complex Style Transfer via Latent Attribute Pre-Training

In this work, we introduce the concept of complex text style transfer ta...
research
12/20/2022

SimpleStyle: An Adaptable Style Transfer Approach

Attribute-controlled text rewriting, also known as text style-transfer, ...
research
10/06/2020

Plug and Play Autoencoders for Conditional Text Generation

Text autoencoders are commonly used for conditional generation tasks suc...
research
11/02/2018

Sequence Generation with Guider Network

Sequence generation with reinforcement learning (RL) has received signif...

Please sign up or login with your details

Forgot password? Click here to reset