Dynamic Attention Based Generative Adversarial Network with Phase Post-Processing for Speech Enhancement

06/13/2020
by   Andong Li, et al.
0

The generative adversarial networks (GANs) have facilitated the development of speech enhancement recently. Nevertheless, the performance advantage is still limited when compared with state-of-the-art models. In this paper, we propose a powerful Dynamic Attention Recursive GAN called DARGAN for noise reduction in the time-frequency domain. Different from previous works, we have several innovations. First, recursive learning, an iterative training protocol, is used in the generator, which consists of multiple steps. By reusing the network in each step, the noise components are progressively reduced in a step-wise manner. Second, the dynamic attention mechanism is deployed, which helps to re-adjust the feature distribution in the noise reduction module. Third, we exploit the deep Griffin-Lim algorithm as the module for phase postprocessing, which facilitates further improvement in speech quality. Experimental results on Voice Bank corpus show that the proposed GAN achieves state-of-the-art performance than previous GAN- and non-GAN-based models

READ FULL TEXT

page 2

page 4

research
12/19/2020

DCCRGAN: Deep Complex Convolution Recurrent Generator Adversarial Network for Speech Enhancement

Generative adversarial network (GAN) still exists some problems in deali...
research
12/30/2020

Unpaired Image Enhancement with Quality-Attention Generative Adversarial Network

In this work, we aim to learn an unpaired image enhancement model, which...
research
07/28/2021

CycleGAN-based Non-parallel Speech Enhancement with an Adaptive Attention-in-attention Mechanism

Non-parallel training is a difficult but essential task for DNN-based sp...
research
03/29/2020

A Recursive Network with Dynamic Attention for Monaural Speech Enhancement

A person tends to generate dynamic attention towards speech under compli...
research
10/21/2022

Improved Normalizing Flow-Based Speech Enhancement using an All-pole Gammatone Filterbank for Conditional Input Representation

Deep generative models for Speech Enhancement (SE) received increasing a...
research
03/27/2018

Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition

We investigate the use of generative adversarial networks (GANs) in spee...
research
03/22/2020

A Time-domain Monaural Speech Enhancement with Recursive Learning

In this paper, we propose a type of neural network with recursive learni...

Please sign up or login with your details

Forgot password? Click here to reset