VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple Guidance

09/13/2023
by   Carlos Hernandez-Olivan, et al.
0

Restoring degraded music signals is essential to enhance audio quality for downstream music manipulation. Recent diffusion-based music restoration methods have demonstrated impressive performance, and among them, diffusion posterior sampling (DPS) stands out given its intrinsic properties, making it versatile across various restoration tasks. In this paper, we identify that there are potential issues which will degrade current DPS-based methods' performance and introduce the way to mitigate the issues inspired by diverse diffusion guidance techniques including the RePaint (RP) strategy and the Pseudoinverse-Guided Diffusion Models (ΠGDM). We demonstrate our methods for the vocal declipping and bandwidth extension tasks under various levels of distortion and cutoff frequency, respectively. In both tasks, our methods outperform the current DPS-based music restoration benchmarks. We refer to <http://carlosholivan.github.io/demos/audio-restoration-2023.html> for examples of the restored audio samples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/04/2022

Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration

Diffusion-based generative models have had a high impact on the computer...
research
11/08/2022

Unsupervised vocal dereverberation with diffusion-based generative models

Removing reverb from reverberant music is a necessary technique to clean...
research
04/03/2023

Generative Diffusion Prior for Unified Image Restoration and Enhancement

Existing image restoration methods mostly leverage the posterior distrib...
research
09/19/2023

PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance

Exploiting pre-trained diffusion models for restoration has recently bec...
research
05/23/2023

WaveDM: Wavelet-Based Diffusion Models for Image Restoration

Latest diffusion-based methods for many image restoration tasks outperfo...
research
06/02/2023

Zero-Shot Blind Audio Bandwidth Extension

Audio bandwidth extension involves the realistic reconstruction of high-...
research
05/02/2019

Psychoacoustically Motivated Declipping Based on Weighted l1 Minimization

A novel method for audio declipping based on sparsity is presented. The ...

Please sign up or login with your details

Forgot password? Click here to reset