InferGrad: Improving Diffusion Models for Vocoder by Considering Inference in Training

02/08/2022
by   Zehua Chen, et al.
7

Denoising diffusion probabilistic models (diffusion models for short) require a large number of iterations in inference to achieve the generation quality that matches or surpasses the state-of-the-art generative models, which invariably results in slow inference speed. Previous approaches aim to optimize the choice of inference schedule over a few iterations to speed up inference. However, this results in reduced generation quality, mainly because the inference process is optimized separately, without jointly optimizing with the training process. In this paper, we propose InferGrad, a diffusion model for vocoder that incorporates inference process into training, to reduce the inference iterations while maintaining high generation quality. More specifically, during training, we generate data from random noise through a reverse process under inference schedules with a few iterations, and impose a loss to minimize the gap between the generated and ground-truth data samples. Then, unlike existing approaches, the training of InferGrad considers the inference process. The advantages of InferGrad are demonstrated through experiments on the LJSpeech dataset showing that InferGrad achieves better voice quality than the baseline WaveGrad under same conditions while maintaining the same voice quality as the baseline but with 3x speedup (2 iterations for InferGrad vs 6 iterations for WaveGrad).

READ FULL TEXT
research
06/15/2023

OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Models

Diffusion probabilistic models (DPMs) are a new class of generative mode...
research
05/16/2023

Selective Guidance: Are All the Denoising Steps of Guided Diffusion Important?

This study examines the impact of optimizing the Stable Diffusion (SD) g...
research
08/28/2023

Voice Conversion with Denoising Diffusion Probabilistic GAN Models

Voice conversion is a method that allows for the transformation of speak...
research
07/17/2023

Autoregressive Diffusion Model for Graph Generation

Diffusion-based graph generative models have recently obtained promising...
research
02/14/2023

Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions

Diffusion-based generative models (DBGMs) perturb data to a target noise...
research
02/05/2023

ReDi: Efficient Learning-Free Diffusion Inference via Trajectory Retrieval

Diffusion models show promising generation capability for a variety of d...
research
02/13/2023

Preconditioned Score-based Generative Models

Score-based generative models (SGMs) have recently emerged as a promisin...

Please sign up or login with your details

Forgot password? Click here to reset