A Distributional Approach to Controlled Text Generation

12/21/2020
by   Muhammad Khalifa, et al.
0

We propose a Distributional Approach to address Controlled Text Generation from pre-trained Language Models (LMs). This view permits to define, in a single formal framework, "pointwise" and "distributional" constraints over the target LM – to our knowledge, this is the first approach with such generality – while minimizing KL divergence with the initial LM distribution. The optimal target distribution is then uniquely determined as an explicit EBM (Energy-Based Model) representation. From that optimal representation we then train the target controlled autoregressive LM through an adaptive distributional variant of Policy Gradient. We conduct a first set of experiments over pointwise constraints showing the advantages of our approach over a set of baselines, in terms of obtaining a controlled LM balancing constraint satisfaction with divergence from the initial LM (GPT-2). We then perform experiments over distributional constraints, a unique feature of our approach, demonstrating its potential as a remedy to the problem of Bias in Language Models. Through an ablation study we show the effectiveness of our adaptive technique for obtaining faster convergence.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2021

Energy-Based Models for Code Generation under Compilability Constraints

Neural language models can be successfully trained on source code, leadi...
research
02/16/2023

Aligning Language Models with Preferences through f-divergence Minimization

Aligning language models with preferences can be posed as approximating ...
research
04/15/2023

Tractable Control for Autoregressive Language Generation

Despite the success of autoregressive large language models in text gene...
research
12/13/2021

Controlled Cue Generation for Play Scripts

In this paper, we use a large-scale play scripts dataset to propose the ...
research
10/11/2012

Distributional Framework for Emergent Knowledge Acquisition and its Application to Automated Document Annotation

The paper introduces a framework for representation and acquisition of k...
research
05/19/2023

BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases

Energy-based models (EBMs) have gained popularity for controlled text ge...
research
06/05/2023

Structured Voronoi Sampling

Recently, there has been a growing interest in the development of gradie...

Please sign up or login with your details

Forgot password? Click here to reset