Can Diffusion Model Achieve Better Performance in Text Generation? Bridging the Gap between Training and Inference!

05/08/2023
by   Zecheng Tang, et al.
0

Diffusion models have been successfully adapted to text generation tasks by mapping the discrete text into the continuous space. However, there exist nonnegligible gaps between training and inference, owing to the absence of the forward process during inference. Thus, the model only predicts based on the previously generated reverse noise rather than the noise computed by the forward process. Besides, the widely-used downsampling strategy in speeding up the inference will cause the mismatch of diffusion trajectories between training and inference. To understand and mitigate the above two types of training-inference discrepancies, we launch a thorough preliminary study. Based on our observations, we propose two simple yet effective methods to bridge the gaps mentioned above, named Distance Penalty and Adaptive Decay Sampling. Extensive experiments on 6 generation tasks confirm the superiority of our methods, which can achieve 100×→ 200× speedup with better performance.

READ FULL TEXT

page 4

page 15

research
02/11/2023

A Reparameterized Discrete Diffusion Model for Text Generation

This work studies discrete diffusion probabilistic models with applicati...
research
12/22/2022

GENIE: Large Scale Pre-training for Text Generation with Diffusion Model

In this paper, we propose a large-scale language pre-training for text G...
research
04/10/2023

A Cheaper and Better Diffusion Language Model with Soft-Masked Noise

Diffusion models that are based on iterative denoising have been recentl...
research
08/11/2023

Mirror Diffusion Models

Diffusion models have successfully been applied to generative tasks in v...
research
03/27/2023

Debiasing Scores and Prompts of 2D Diffusion for Robust Text-to-3D Generation

The view inconsistency problem in score-distilling text-to-3D generation...
research
01/27/2023

Input Perturbation Reduces Exposure Bias in Diffusion Models

Denoising Diffusion Probabilistic Models have shown an impressive genera...
research
03/21/2023

LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models

Creating graphic layouts is a fundamental step in graphic designs. In th...

Please sign up or login with your details

Forgot password? Click here to reset