Speech Denoising with Deep Feature Losses

06/27/2018
by   Francois G. Germain, et al.
0

We present an end-to-end deep learning approach to denoising speech signals by processing the raw waveform directly. Given input audio containing speech corrupted by an additive background signal, the system aims to produce a processed signal that contains only the speech content. Recent approaches have shown promising results using various deep network architectures. In this paper, we propose to train a fully-convolutional context aggregation network using a deep feature loss. That loss is based on comparing the internal feature activations in a different network, trained for acoustic environment detection and domestic audio tagging. Our approach outperforms the state-of-the-art in objective speech quality metrics and in large-scale perceptual experiments with human listeners. It also outperforms an identical network trained using traditional regression losses. The advantage of the new approach is particularly pronounced for the hardest data with the most intrusive background noise, for which denoising is most needed and most challenging.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2020

Deep Network Perceptual Losses for Speech Denoising

Contemporary speech enhancement predominantly relies on audio transforms...
research
06/10/2020

HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Real-world audio recordings are often degraded by factors such as noise,...
research
10/22/2020

Listening to Sounds of Silence for Speech Denoising

We introduce a deep learning model for speech denoising, a long-standing...
research
06/23/2022

Restoring speech intelligibility for hearing aid users with deep learning

Almost half a billion people world-wide suffer from disabling hearing lo...
research
10/22/2020

Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised Models

Deep learning based speech denoising still suffers from the challenge of...
research
10/22/2019

Speech-VGG: A deep feature extractor for speech processing

A growing number of studies in the field of speech processing employ fea...
research
02/23/2018

Do WaveNets Dream of Acoustic Waves?

Various sources have reported the WaveNet deep learning architecture bei...

Please sign up or login with your details

Forgot password? Click here to reset