Model-Based Speech Enhancement in the Modulation Domain

07/09/2017
by   Yu Wang, et al.
0

This paper presents algorithms for modulation-domain speech enhancement using a Kalman filter. The algorithms are derived using two alternative statistical models for the speech and noise spectral coefficients. The proposed models incorporate the estimated dynamics of the spectral amplitudes of speech and noise into the MMSE estimation of the amplitude spectrum of the clean speech. Both models assume that the speech and noise are additive in the complex domain. The difference between the two algorithms is that the the first algorithm models only the spectral dynamics of the clean speech while the second algorithm jointly models the spectral dynamics of both speech and noise. In the first algorithm, a closed-form estimator is derived under the assumption that speech amplitudes follow a form of generalized Gamma distribution and the noise amplitudes follow Gaussian distribution. In the second algorithm, in order to include the dynamics of noise amplitudes with that of speech amplitudes, we propose a statistical "Gaussring" model that comprises a mixture of Gaussians whose centres lie in a circle on the complex plane. The performance of the proposed algorithms are evaluated using the perceptual evaluation of speech quality (PESQ) measure and segmental SNR measure and shown to give a consistent improvement over a wide range of SNRs when compared to competitive algorithms.

READ FULL TEXT

page 5

page 11

research
08/07/2017

Phase-Aware Single-Channel Speech Enhancement with Modulation-Domain Kalman Filtering

We present a single-channel phase-sensitive speech enhancement algorithm...
research
10/31/2018

On Single-Channel Speech Enhancement and On Non-Linear Modulation-Domain Kalman Filtering

This report focuses on algorithms that perform single-channel speech enh...
research
02/10/2022

Auditory Model based Phase-Aware Bayesian Spectral Amplitude Estimator for Single-Channel Speech Enhancement

Bayesian estimation of short-time spectral amplitude is one of the most ...
research
07/26/2018

Modulation-Domain Kalman Filtering for Monaural Blind Speech Denoising and Dereverberation

We describe a monaural speech enhancement algorithm based on modulation-...
research
06/13/2018

Model-based Speech Enhancement for Intelligibility Improvement in Binaural Hearing Aids

Speech intelligibility is often severely degraded among hearing impaired...
research
02/10/2022

Single-channel speech enhancement by using psychoacoustical model inspired fusion framework

When the parameters of Bayesian Short-time Spectral Amplitude (STSA) est...
research
07/22/2019

ML Estimation and CRBs for Reverberation, Speech and Noise PSDs in Rank-Deficient Noise-Field

Speech communication systems are prone to performance degradation in rev...

Please sign up or login with your details

Forgot password? Click here to reset