Convolutional Recurrent Neural Network Based Progressive Learning for Monaural Speech Enhancement

08/28/2019
by   Andong Li, et al.
0

Recently, progressive learning has shown its capacity of improving speech quality and speech intelligibility when it is combined with deep neural network (DNN) and long short-term memory (LSTM) based monaural speech enhancement algorithms, especially in low signal-to-noise ratio (SNR) conditions. Nevertheless, due to a large number of parameters and highly computational complexity, it is hard to implement in current resource-limited micro-controllers and thus, it is important to significantly reduce both the amount of parameters and the computational load for practical applications. For this purpose, we propose a novel progressive learning framework with convolutional recurrent neural networks called PL-CRNN, which takes advantages of both convolutional neural networks and recurrent neural networks to drastically reduce the amount of parameters and simultaneously improve speech quality and speech intelligibility. Numerous experiments verify the effectiveness of proposed PL-CRNN model and indicate that it yields consistent better performance than the PL-DNN and PL-LSTM algorithms and also it gets results close even better than the CRNN in terms of various evaluation metrics. Compared with PL-DNN, PL-LSTM and state-of-the-art CRNN models, the proposed PL-CRNN algorithm can reduce the amount of parameters up to 77%, 93% and 93%, respectively.

READ FULL TEXT

page 8

page 22

research
02/14/2020

Real-time speech enhancement using equilibriated RNN

We propose a speech enhancement method using a causal deep neural networ...
research
10/10/2020

A Model Compression Method with Matrix Product Operators for Speech Enhancement

The deep neural network (DNN) based speech enhancement approaches have a...
research
12/25/2018

Tensor-Train Long Short-Term Memory for Monaural Speech Enhancement

In recent years, Long Short-Term Memory (LSTM) has become a popular choi...
research
01/24/2018

Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension

This paper presents a waveform modeling and generation method using hier...
research
03/19/2020

Convolutional Neural Networks for Continuous QoE Prediction in Video Streaming Services

In video streaming services, predicting the continuous user's quality of...
research
04/07/2020

SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech Enhancement

This paper analyzes the generalization of speech enhancement algorithms ...
research
05/28/2022

Go Beyond Multiple Instance Neural Networks: Deep-learning Models based on Local Pattern Aggregation

Deep convolutional neural networks (CNNs) have brought breakthroughs in ...

Please sign up or login with your details

Forgot password? Click here to reset