Self-conditioned Embedding Diffusion for Text Generation

11/08/2022
by   Robin Strudel, et al.
1

Can continuous diffusion models bring the same performance breakthrough on natural language they did for image generation? To circumvent the discrete nature of text data, we can simply project tokens in a continuous space of embeddings, as is standard in language modeling. We propose Self-conditioned Embedding Diffusion, a continuous diffusion mechanism that operates on token embeddings and allows to learn flexible and scalable diffusion models for both conditional and unconditional text generation. Through qualitative and quantitative evaluation, we show that our text diffusion models generate samples comparable with those produced by standard autoregressive language models - while being in theory more efficient on accelerator hardware at inference time. Our work paves the way for scaling up diffusion models for text, similarly to autoregressive models, and for improving performance with recent refinements to continuous diffusion.

READ FULL TEXT

page 5

page 7

page 8

page 9

page 13

page 15

research
05/15/2023

TESS: Text-to-Text Self-Conditioned Simplex Diffusion

Diffusion models have emerged as a powerful paradigm for generation, obt...
research
04/25/2023

RenderDiffusion: Text Generation as Image Generation

Diffusion models have become a new generative paradigm for text generati...
research
05/16/2023

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

Diffusion models have gained significant attention in the realm of image...
research
12/13/2021

Step-unrolled Denoising Autoencoders for Text Generation

In this paper we propose a new generative model of text, Step-unrolled D...
research
03/28/2023

Visual Chain-of-Thought Diffusion Models

Recent progress with conditional image diffusion models has been stunnin...
research
05/27/2022

Diffusion-LM Improves Controllable Text Generation

Controlling the behavior of language models (LMs) without re-training is...
research
08/08/2022

Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning

We present Bit Diffusion: a simple and generic approach for generating d...

Please sign up or login with your details

Forgot password? Click here to reset