Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning

09/17/2023
by   Zilu Guo, et al.
0

In this paper, we explore a continuous modeling approach for deep-learning-based speech enhancement, focusing on the denoising process. We use a state variable to indicate the denoising process. The starting state is noisy speech and the ending state is clean speech. The noise component in the state variable decreases with the change of the state index until the noise component is 0. During training, a UNet-like neural network learns to estimate every state variable sampled from the continuous denoising process. In testing, we introduce a controlling factor as an embedding, ranging from zero to one, to the neural network, allowing us to control the level of noise reduction. This approach enables controllable speech enhancement and is adaptable to various application scenarios. Experimental results indicate that preserving a small amount of noise in the clean target benefits speech enhancement, as evidenced by improvements in both objective speech measures and automatic speech recognition performance.

READ FULL TEXT

page 1

page 2

research
05/20/2022

NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement

Acoustic echo cancellation (AEC) plays an important role in the full-dup...
research
05/25/2021

RNNoise-Ex: Hybrid Speech Enhancement System based on RNN and Spectral Features

Recent interest in exploiting Deep Learning techniques for Noise Suppres...
research
02/23/2021

Handling Background Noise in Neural Speech Generation

Recent advances in neural-network based generative modeling of speech ha...
research
11/08/2020

Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust Waveform Generation

This paper presents a denoising and dereverberation hierarchical neural ...
research
09/07/2023

Spiking Structured State Space Model for Monaural Speech Enhancement

Speech enhancement seeks to extract clean speech from noisy signals. Tra...
research
01/06/2020

Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders

Deep learning-based models have greatly advanced the performance of spee...
research
06/02/2020

Dilated U-net based approach for multichannel speech enhancement from First-Order Ambisonics recordings

We present a CNN architecture for speech enhancement from multichannel f...

Please sign up or login with your details

Forgot password? Click here to reset