Full-band General Audio Synthesis with Score-based Diffusion

10/26/2022
by   Santiago Pascual, et al.
0

Recent works have shown the capability of deep generative models to tackle general audio synthesis from a single label, producing a variety of impulsive, tonal, and environmental sounds. Such models operate on band-limited signals and, as a result of an autoregressive approach, they are typically conformed by pre-trained latent encoders and/or several cascaded modules. In this work, we propose a diffusion-based generative model for general audio synthesis, named DAG, which deals with full-band signals end-to-end in the waveform domain. Results show the superiority of DAG over existing label-conditioned generators in terms of both quality and diversity. More specifically, when compared to the state of the art, the band-limited and full-band versions of DAG achieve relative improvements that go up to 40 and 65 flexible enough to accommodate different conditioning schemas while providing good quality synthesis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/02/2023

From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion

Deep generative models can generate high-fidelity audio conditioned on v...
research
11/09/2021

RAVE: A variational autoencoder for fast and high-quality neural audio synthesis

Deep generative models applied to audio have improved by a large margin ...
research
09/21/2020

DiffWave: A Versatile Diffusion Model for Audio Synthesis

In this work, we propose DiffWave, a versatile Diffusion probabilistic m...
research
09/27/2018

Conditional WaveGAN

Generative models are successfully used for image synthesis in the recen...
research
09/02/2022

Diffusion Models: A Comprehensive Survey of Methods and Applications

Diffusion models are a class of deep generative models that have shown i...
research
05/30/2022

BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis

Binaural audio plays a significant role in constructing immersive augmen...
research
10/08/2022

STaSy: Score-based Tabular data Synthesis

Tabular data synthesis is a long-standing research topic in machine lear...

Please sign up or login with your details

Forgot password? Click here to reset