Beyond Griffin-Lim: Improved Iterative Phase Retrieval for Speech

05/11/2022
by   Tal Peer, et al.
0

Phase retrieval is a problem encountered not only in speech and audio processing, but in many other fields such as optics. Iterative algorithms based on non-convex set projections are effective and frequently used for retrieving the phase when only STFT magnitudes are available. While the basic Griffin-Lim algorithm and its variants have been the prevalent method for decades, more recent advances, e.g. in optics, raise the question: Can we do better than Griffin-Lim for speech signals, using the same principle of iterative projection? In this paper we compare the classical algorithms in the speech domain with two modern methods from optics with respect to reconstruction quality and convergence rate. Based on this study, we propose to combine Griffin-Lim with the Difference Map algorithm in a hybrid approach which shows superior results, in terms of both convergence and quality of the final reconstruction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2023

A Flexible Online Framework for Projection-Based STFT Phase Retrieval

Several recent contributions in the field of iterative STFT phase retrie...
research
07/11/2020

Fast Griffin Lim based Waveform Generation Strategy for Text-to-Speech Synthesis

The performance of text-to-speech (TTS) systems heavily depends on spect...
research
11/08/2022

DiffPhase: Generative Diffusion-based STFT Phase Retrieval

Diffusion probabilistic models have been recently used in a variety of t...
research
10/02/2019

Iterative methods for signal reconstruction on graphs

We present two iterative algorithms to interpolate graph signals from on...
research
02/05/2020

Convex combination of alternating projection and Douglas-Rachford operators for phase retrieval

We present the convergence analysis of convex combination of the alterna...
research
07/16/2020

DeepInit Phase Retrieval

This paper shows how data-driven deep generative models can be utilized ...
research
11/22/2018

Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective

This study investigates phase reconstruction for deep learning based mon...

Please sign up or login with your details

Forgot password? Click here to reset