DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Generation

10/18/2022
by   Hanqing Zhang, et al.
0

Prompt learning with immensely large Casual Language Models (CLMs) has been shown promising for attribute-controllable text generation (CTG). However, vanilla prompt tuning tends to imitate training corpus characteristics beyond the control attributes, resulting in a poor generalization ability. Moreover, it is less able to capture the relationship between different attributes, further limiting the control performance. In this paper, we propose a new CTG approach, namely DisCup, which incorporates the attribute knowledge of discriminator to optimize the control-prompts, steering a frozen CLM to produce attribute-specific texts. Specifically, the frozen CLM model, capable of producing multitudinous texts, is first used to generate the next-token candidates based on the context, so as to ensure the diversity of tokens to be predicted. Then, we leverage an attribute-discriminator to select desired/undesired tokens from those candidates, providing the inter-attribute knowledge. Finally, we bridge the above two traits by an unlikelihood objective for prompt-tuning. Extensive experimental results show that DisCup can achieve a new state-of-the-art control performance while maintaining an efficient and high-quality text generation, only relying on around 10 virtual tokens.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2023

Focused Prefix Tuning for Controllable Text Generation

In a controllable text generation dataset, there exist unannotated attri...
research
04/25/2022

Which Discriminator for Cooperative Text Generation?

Language models generate texts by successively predicting probability di...
research
05/09/2022

CounterGeDi: A controllable approach to generate polite, detoxified and emotional counterspeech

Recently, many studies have tried to create generation models to assist ...
research
03/24/2022

Mix and Match: Learning-free Controllable Text Generation using Energy Language Models

Recent work on controlled text generation has either required attribute-...
research
05/06/2020

Token Manipulation Generative Adversarial Network for Text Generation

MaskGAN opens the query for the conditional language model by filling in...
research
10/19/2022

Language Detoxification with Attribute-Discriminative Latent Space

Transformer-based Language Models (LMs) achieve remarkable performances ...
research
09/14/2020

GeDi: Generative Discriminator Guided Sequence Generation

Class-conditional language models (CC-LMs) can be used to generate natur...

Please sign up or login with your details

Forgot password? Click here to reset