Structured Voronoi Sampling

06/05/2023
by   Afra Amini, et al.
0

Recently, there has been a growing interest in the development of gradient-based sampling algorithms for text generation, especially in the context of controlled generation. However, there exists a lack of theoretically grounded and principled approaches for this task. In this paper, we take an important step toward building a principled approach for sampling from language models with gradient-based methods. We use discrete distributions given by language models to define densities and develop an algorithm based on Hamiltonian Monte Carlo to sample from them. We name our gradient-based technique Structured Voronoi Sampling (SVS). In an experimental setup where the reference distribution is known, we show that the empirical distribution of SVS samples is closer to the reference distribution compared to alternative sampling schemes. Furthermore, in a controlled generation task, SVS is able to generate fluent and diverse samples while following the control targets significantly better than other methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/31/2019

Co-Generation with GANs using AIS based HMC

Inferring the most likely configuration for a subset of variables of a j...
research
06/04/2021

Exposing the Implicit Energy Networks behind Masked Language Models via Metropolis–Hastings

While recent work has shown that scores from models trained by the ubiqu...
research
12/10/2021

Sampling from Discrete Energy-Based Models with Quality/Efficiency Trade-offs

Energy-Based Models (EBMs) allow for extremely flexible specifications o...
research
09/05/2021

SideControl: Controlled Open-domain Dialogue Generation via Additive Side Networks

Transformer-based pre-trained language models boost the performance of o...
research
06/20/2022

A Langevin-like Sampler for Discrete Distributions

We propose discrete Langevin proposal (DLP), a simple and scalable gradi...
research
12/21/2020

A Distributional Approach to Controlled Text Generation

We propose a Distributional Approach to address Controlled Text Generati...
research
09/22/2022

Training neural network ensembles via trajectory sampling

In machine learning, there is renewed interest in neural network ensembl...

Please sign up or login with your details

Forgot password? Click here to reset