Perceptual Speech Enhancement via Generative Adversarial Networks

10/21/2019
by   Sherif Abdulatif, et al.
0

Automatic speech recognition (ASR) systems are of vital importance nowadays in commonplace tasks such as speech-to-text processing and language translation. This created the need of an ASR system that can operate in realistic crowded environments. Thus, speech enhancement is now considered as a fundamental building block in newly developed ASR systems. In this paper, a generative adversarial network (GAN) based framework is investigated for the task of speech enhancement of audio tracks. A new architecture based on CasNet generator and additional perceptual loss is incorporated to get realistically denoised speech phonetics. Finally, the proposed framework is shown to quantitatively outperform other GAN-based speech enhancement approaches.

READ FULL TEXT

page 2

page 3

page 4

research
11/15/2017

Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition

We investigate the effectiveness of generative adversarial networks (GAN...
research
07/27/2020

On the Use of Audio Fingerprinting Features for Speech Enhancement with Generative Adversarial Network

The advent of learning-based methods in speech enhancement has revived t...
research
05/13/2019

MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement

Adversarial loss in a conditional generative adversarial network (GAN) i...
research
03/24/2022

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Generative adversarial networks have recently demonstrated outstanding p...
research
01/15/2020

Improving GANs for Speech Enhancement

Generative adversarial networks (GAN) have recently been shown to be eff...
research
08/21/2019

Coarse-to-fine Optimization for Speech Enhancement

In this paper, we propose the coarse-to-fine optimization for the task o...
research
09/06/2021

Machine Learning: Challenges, Limitations, and Compatibility for Audio Restoration Processes

In this paper machine learning networks are explored for their use in re...

Please sign up or login with your details

Forgot password? Click here to reset