Deep neural network techniques for monaural speech enhancement: state of the art analysis

12/01/2022
by   Peter Ochieng, et al.
0

Deep neural networks (DNN) techniques have become pervasive in domains such as natural language processing and computer vision. They have achieved great success in these domains in task such as machine translation and image generation. Due to their success, these data driven techniques have been applied in audio domain. More specifically, DNN models have been applied in speech enhancement domain to achieve denosing, dereverberation and multi-speaker separation in monaural speech enhancement. In this paper, we review some dominant DNN techniques being employed to achieve speech separation. The review looks at the whole pipeline of speech enhancement from feature extraction, how DNN based tools are modelling both global and local features of speech and model training (supervised and unsupervised). We also review the use of speech-enhancement pre-trained models to boost speech enhancement process. The review is geared towards covering the dominant trends with regards to DNN application in speech enhancement in speech obtained via a single speaker.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/27/2021

On TasNet for Low-Latency Single-Speaker Speech Enhancement

In recent years, speech processing algorithms have seen tremendous progr...
research
05/23/2019

A Perceptual Weighting Filter Loss for DNN Training in Speech Enhancement

Single-channel speech enhancement with deep neural networks (DNNs) has s...
research
02/03/2020

Tensor-to-Vector Regression for Multi-channel Speech Enhancement based on Tensor-Train Network

We propose a tensor-to-vector regression approach to multi-channel speec...
research
07/24/2023

An objective evaluation of Hearing Aids and DNN-based speech enhancement in complex acoustic scenes

We investigate the objective performance of five high-end commercially a...
research
10/17/2022

How to Leverage DNN-based speech enhancement for multi-channel speaker verification?

Speaker verification (SV) suffers from unsatisfactory performance in far...
research
11/08/2021

SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points

Numerous compression and acceleration strategies have achieved outstandi...
research
11/07/2018

On the use of DNN Autoencoder for Robust Speaker Recognition

In this paper, we present an analysis of a DNN-based autoencoder for spe...

Please sign up or login with your details

Forgot password? Click here to reset