A non-causal FFTNet architecture for speech enhancement

06/08/2020
by   Muhammed PV Shifas, et al.
0

In this paper, we suggest a new parallel, non-causal and shallow waveform domain architecture for speech enhancement based on FFTNet, a neural network for generating high quality audio waveform. In contrast to other waveform based approaches like WaveNet, FFTNet uses an initial wide dilation pattern. Such an architecture better represents the long term correlated structure of speech in the time domain, where noise is usually highly non-correlated, and therefore it is suitable for waveform domain based speech enhancement. To further strengthen this feature of FFTNet, we suggest a non-causal FFTNet architecture, where the present sample in each layer is estimated from the past and future samples of the previous layer. By suggesting a shallow network and applying non-causality within certain limits, the suggested FFTNet for speech enhancement (SE-FFTNet) uses much fewer parameters compared to other neural network based approaches for speech enhancement like WaveNet and SEGAN. Specifically, the suggested network has considerably reduced model parameters: 32 WaveNet and 87 objective metrics, SE-FFTNet outperforms WaveNet in terms of enhanced signal quality, while it provides equally good performance as SEGAN. A Tensorflow implementation of the architecture is provided at 1 .

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2020

Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks

In recent decades, neural network based methods have significantly impro...
research
06/23/2020

Real Time Speech Enhancement in the Waveform Domain

We present a causal speech enhancement model working on the raw waveform...
research
07/03/2019

Convolutional Neural Network-based Speech Enhancement for Cochlear Implant Recipients

Attempts to develop speech enhancement algorithms with improved speech i...
research
11/02/2022

Inference and Denoise: Causal Inference-based Neural Speech Enhancement

This study addresses the speech enhancement (SE) task within the causal ...
research
09/07/2023

Causal Signal-Based DCCRN with Overlapped-Frame Prediction for Online Speech Enhancement

The aim of speech enhancement is to improve speech signal quality and in...
research
04/09/2019

Speech Enhancement with Wide Residual Networks in Reverberant Environments

This paper proposes a speech enhancement method which exploits the high ...
research
06/13/2020

SE-MelGAN – Speaker Agnostic Rapid Speech Enhancement

Recent advancement in Generative Adversarial Networks in speech synthesi...

Please sign up or login with your details

Forgot password? Click here to reset