Listening to Sounds of Silence for Speech Denoising

10/22/2020
by   Ruilin Xu, et al.
0

We introduce a deep learning model for speech denoising, a long-standing challenge in audio analysis arising in numerous applications. Our approach is based on a key observation about human speech: there is often a short pause between each sentence or word. In a recorded speech signal, those pauses introduce a series of time periods during which only noise is present. We leverage these incidental silent intervals to learn a model for automatic speech denoising given only mono-channel audio. Detected silent intervals over time expose not just pure noise but its time-varying features, allowing the model to learn noise dynamics and suppress it from the speech signal. Experiments on multiple datasets confirm the pivotal role of silent interval detection for speech denoising, and our method outperforms several state-of-the-art denoising methods, including those that accept only audio input (like ours) and those that denoise based on audiovisual input (and hence require more information). We also show that our method enjoys excellent generalization properties, such as denoising spoken languages not seen during training.

READ FULL TEXT

page 4

page 6

page 7

page 18

research
10/18/2022

BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds

Audio denoising has been explored for decades using both traditional and...
research
06/27/2018

Speech Denoising with Deep Feature Losses

We present an end-to-end deep learning approach to denoising speech sign...
research
04/08/2021

Speech Denoising without Clean Training Data: a Noise2Noise Approach

This paper tackles the problem of the heavy dependence of clean speech d...
research
04/29/2021

Star DGT: a Robust Gabor Transform for Speech Denoising

In this paper, we address the speech denoising problem, where white Gaus...
research
08/22/2023

Deep learning-based denoising streamed from mobile phones improves speech-in-noise understanding for hearing aid users

The hearing loss of almost half a billion people is commonly treated wit...
research
05/20/2020

SADDEL: Joint Speech Separation and Denoising Model based on Multitask Learning

Speech data collected in real-world scenarios often encounters two issue...
research
05/25/2021

RNNoise-Ex: Hybrid Speech Enhancement System based on RNN and Spectral Features

Recent interest in exploiting Deep Learning techniques for Noise Suppres...

Please sign up or login with your details

Forgot password? Click here to reset