Instantaneous PSD Estimation for Speech Enhancement based on Generalized Principal Components

07/01/2020
by   Thomas Dietzen, et al.
0

Power spectral density (PSD) estimates of various microphone signal components are essential to many speech enhancement procedures. As speech is highly non-nonstationary, performance improvements may be gained by maintaining time-variations in PSD estimates. In this paper, we propose an instantaneous PSD estimation approach based on generalized principal components. Similarly to other eigenspace-based PSD estimation approaches, we rely on recursive averaging in order to obtain a microphone signal correlation matrix estimate to be decomposed. However, instead of estimating the PSDs directly from the temporally smooth generalized eigenvalues of this matrix, yielding temporally smooth PSD estimates, we propose to estimate the PSDs from newly defined instantaneous generalized eigenvalues, yielding instantaneous PSD estimates. The instantaneous generalized eigenvalues are defined from the generalized principal components, i.e. a generalized eigenvector-based transform of the microphone signals. We further show that the smooth generalized eigenvalues can be understood as a recursive average of the instantaneous generalized eigenvalues. Simulation results comparing the multi-channel Wiener filter (MWF) with smooth and instantaneous PSD estimates indicate better speech enhancement performance for the latter. A MATLAB implementation is available online.

READ FULL TEXT
research
10/07/2019

Impulsive Noise Detection for Intelligibility and Quality Improvement of Speech Enhancement Methods Applied in Time-Domain

This letter introduces a novel speech enhancement method in the Hilbert-...
research
02/20/2020

iSEGAN: Improved Speech Enhancement Generative Adversarial Networks

Popular neural network-based speech enhancement systems operate on the m...
research
09/07/2023

Causal Signal-Based DCCRN with Overlapped-Frame Prediction for Online Speech Enhancement

The aim of speech enhancement is to improve speech signal quality and in...
research
10/07/2022

Model-based estimation of in-car-communication feedback applied to speech zone detection

Modern cars provide versatile tools to enhance speech communication. Whi...
research
03/22/2020

Monaural Speech Enhancement with Recursive Learning in the Time Domain

In this paper, we propose a type of neural network with recursive learni...
research
03/22/2020

A Time-domain Monaural Speech Enhancement with Recursive Learning

In this paper, we propose a type of neural network with recursive learni...

Please sign up or login with your details

Forgot password? Click here to reset