Recycling an anechoic pre-trained speech separation deep neural network for binaural dereverberation of a single source

08/09/2022
by   Sania Gul, et al.
0

Reverberation results in reduced intelligibility for both normal and hearing-impaired listeners. This paper presents a novel psychoacoustic approach of dereverberation of a single speech source by recycling a pre-trained binaural anechoic speech separation neural network. As training the deep neural network (DNN) is a lengthy and computationally expensive process, the advantage of using a pre-trained separation network for dereverberation is that the network does not need to be retrained, saving both time and computational resources. The interaural cues of a reverberant source are given to this pretrained neural network to discriminate between the direct path signal and the reverberant speech. The results show an average improvement of 1.3 signal intelligibility, 0.83 dB in SRMR (signal to reverberation energy ratio) and 0.16 points in perceptual evaluation of speech quality (PESQ) over other state-of-the-art signal processing dereverberation algorithms and 14 intelligibility and 0.35 points in quality over orthogonal matching pursuit with spectral subtraction (OSS), a machine learning based dereverberation algorithm.

READ FULL TEXT
research
08/10/2022

Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source

Reverberations are unavoidable in enclosures, resulting in reduced intel...
research
04/14/2023

On Data Sampling Strategies for Training Neural Network Speech Separation Models

Speech separation remains an important area of multi-speaker signal proc...
research
08/23/2020

Independent Vector Analysis with Deep Neural Network Source Priors

This paper studies the density priors for independent vector analysis (I...
research
09/11/2020

RECOApy: Data recording, pre-processing and phonetic transcription for end-to-end speech-based applications

Deep learning enables the development of efficient end-to-end speech pro...
research
11/20/2019

Perceptual Loss Function for Neural Modelling of Audio Systems

This work investigates alternate pre-emphasis filters used as part of th...
research
10/20/2022

Speech Dereverberation with a Reverberation Time Shortening Target

This work proposes a new learning target based on reverberation time sho...
research
08/07/2023

Improving Deep Attractor Network by BGRU and GMM for Speech Separation

Deep Attractor Network (DANet) is the state-of-the-art technique in spee...

Please sign up or login with your details

Forgot password? Click here to reset