Nonnegative HMM for Babble Noise Derived from Speech HMM: Application to Speech Enhancement

09/16/2017
by   Nasser Mohammadiha, et al.
0

Deriving a good model for multitalker babble noise can facilitate different speech processing algorithms, e.g. noise reduction, to reduce the so-called cocktail party difficulty. In the available systems, the fact that the babble waveform is generated as a sum of N different speech waveforms is not exploited explicitly. In this paper, first we develop a gamma hidden Markov model for power spectra of the speech signal, and then formulate it as a sparse nonnegative matrix factorization (NMF). Second, the sparse NMF is extended by relaxing the sparsity constraint, and a novel model for babble noise (gamma nonnegative HMM) is proposed in which the babble basis matrix is the same as the speech basis matrix, and only the activation factors (weights) of the basis vectors are different for the two signals over time. Finally, a noise reduction algorithm is proposed using the derived speech and babble models. All of the stationary model parameters are estimated using the expectation-maximization (EM) algorithm, whereas the time-varying parameters, i.e. the gain parameters of speech and babble signals, are estimated using a recursive EM algorithm. The objective and subjective listening evaluations show that the proposed babble model and the final noise reduction algorithm significantly outperform the conventional methods.

READ FULL TEXT
research
09/15/2017

Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization

Reducing the interference noise in a monaural noisy speech signal has be...
research
01/11/2014

An Online Expectation-Maximisation Algorithm for Nonnegative Matrix Factorisation Models

In this paper we formulate the nonnegative matrix factorisation (NMF) pr...
research
08/31/2017

A State-Space Approach to Dynamic Nonnegative Matrix Factorization

Nonnegative matrix factorization (NMF) has been actively investigated an...
research
11/16/2018

Semi-supervised multichannel speech enhancement with variational autoencoders and non-negative matrix factorization

In this paper we address speaker-independent multichannel speech enhance...
research
06/30/2020

A Speech Enhancement Algorithm based on Non-negative Hidden Markov Model and Kullback-Leibler Divergence

In this paper, we propose a novel supervised single-channel speech enhan...
research
09/16/2017

Speech Dereverberation Using Nonnegative Convolutive Transfer Function and Spectro temporal Modeling

This paper presents two single channel speech dereverberation methods to...
research
07/12/2012

Probabilistic index maps for modeling natural signals

One of the major problems in modeling natural signals is that signals wi...

Please sign up or login with your details

Forgot password? Click here to reset