Convolutional-Recurrent Neural Networks for Speech Enhancement

05/02/2018
by   Han Zhao, et al.
0

We propose an end-to-end model based on convolutional and recurrent neural networks for speech enhancement. Our model is purely data-driven and does not make any assumptions about the type or the stationarity of the noise. In contrast to existing methods that use multilayer perceptrons (MLPs), we employ both convolutional and recurrent neural network architectures. Thus, our approach allows us to exploit local structures in both the frequency and temporal domains. By incorporating prior knowledge of speech signals into the design of model structures, we build a model that is more data-efficient and achieves better generalization on both seen and unseen noise. Based on experiments with synthetic data, we demonstrate that our model outperforms existing methods, improving PESQ by up to 0.6 on seen noise and 0.64 on unseen noise.

READ FULL TEXT
research
02/02/2020

Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks

In recent decades, neural network based methods have significantly impro...
research
01/24/2022

A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement

In acoustic signal processing, the target signals usually carry semantic...
research
02/16/2018

Constrained Convolutional-Recurrent Networks to Improve Speech Quality with Low Impact on Recognition Accuracy

For a speech-enhancement algorithm, it is highly desirable to simultaneo...
research
08/24/2017

CloudScan - A configuration-free invoice analysis system using recurrent neural networks

We present CloudScan; an invoice analysis system that requires zero conf...
research
07/28/2020

Neural Kalman Filtering for Speech Enhancement

Statistical signal processing based speech enhancement methods adopt exp...
research
01/22/2021

Towards efficient models for real-time deep noise suppression

With recent research advancements, deep learning models are becoming att...
research
06/09/2020

A fully recurrent feature extraction for single channel speech enhancement

Convolutional neural network (CNN) modules are widely being used to buil...

Please sign up or login with your details

Forgot password? Click here to reset