LSTM based AE-DNN constraint for better late reverb suppression in multi-channel LP formulation

12/04/2018
by   Srikanth Raj Chetupalli, et al.
0

Prediction of late reverberation component using multi-channel linear prediction (MCLP) in short-time Fourier transform (STFT) domain is an effective means to enhance reverberant speech. Traditionally, a speech power spectral density (PSD) weighted prediction error (WPE) minimization approach is used to estimate the prediction filters. The method is sensitive to the estimate of the desired signal PSD. In this paper, we propose a deep neural network (DNN) based non-linear estimate for the desired signal PSD. An auto encoder trained on clean speech STFT coefficients is used as the desired signal prior. We explore two different architectures based on (i) fully-connected (FC) feed-forward, and (ii) recurrent long short-term memory (LSTM) layers. Experiments using real room impulse responses show that the LSTM-DNN based PSD estimate performs better than the traditional methods for late reverb suppression.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2017

Long Short-Term Memory (LSTM) networks with jet constituents for boosted top tagging at the LHC

Multivariate techniques based on engineered features have found wide ado...
research
06/03/2021

Joint Multi-Channel Dereverberation and Noise Reduction Using a Unified Convolutional Beamformer With Sparse Priors

Recently, the convolutional weighted power minimization distortionless r...
research
05/26/2023

ElectrodeNet – A Deep Learning Based Sound Coding Strategy for Cochlear Implants

ElectrodeNet, a deep learning based sound coding strategy for the cochle...
research
04/10/2019

Audio-noise Power Spectral Density Estimation Using Long Short-term Memory

We propose a method using a long short-term memory (LSTM) network to est...
research
07/08/2021

A hybrid virtual sensing approach for approximating non-linear dynamic system behavior using LSTM networks

Modern Internet of Things solutions are used in a variety of different a...
research
06/25/2018

Single-channel Speech Dereverberation via Generative Adversarial Training

In this paper, we propose a single-channel speech dereverberation system...
research
04/11/2022

SAL-CNN: Estimate the Remaining Useful Life of Bearings Using Time-frequency Information

In modern industrial production, the prediction ability of the remaining...

Please sign up or login with your details

Forgot password? Click here to reset