UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model

06/01/2023
by   Anastasiia Iashchenko, et al.
0

This paper introduces UnDiff, a diffusion probabilistic model capable of solving various speech inverse tasks. Being once trained for speech waveform generation in an unconditional manner, it can be adapted to different tasks including degradation inversion, neural vocoding, and source separation. In this paper, we, first, tackle the challenging problem of unconditional waveform generation by comparing different neural architectures and preconditioning domains. After that, we demonstrate how the trained unconditional diffusion could be adapted to different tasks of speech processing by the means of recent developments in post-training conditioning of diffusion models. Finally, we demonstrate the performance of the proposed technique on the tasks of bandwidth extension, declipping, vocoding, and speech source separation and compare it to the baselines. The codes will be released soon.

READ FULL TEXT
research
11/27/2019

Music Source Separation in the Waveform Domain

Source separation for music is the task of isolating contributions, or s...
research
02/13/2015

Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation

Monaural source separation is important for many real world applications...
research
01/25/2023

Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation

The problem of speech separation, also known as the cocktail party probl...
research
11/04/2022

Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration

Diffusion-based generative models have had a high impact on the computer...
research
09/21/2020

DiffWave: A Versatile Diffusion Model for Audio Synthesis

In this work, we propose DiffWave, a versatile Diffusion probabilistic m...
research
05/10/2023

Diffusion-based Signal Refiner for Speech Separation

We have developed a diffusion-based speech refiner that improves the ref...
research
10/27/2022

Solving Audio Inverse Problems with a Diffusion Model

This paper presents CQT-Diff, a data-driven generative audio model that ...

Please sign up or login with your details

Forgot password? Click here to reset