Speech Dereverberation Using Nonnegative Convolutive Transfer Function and Spectro temporal Modeling

09/16/2017
by   Nasser Mohammadiha, et al.
0

This paper presents two single channel speech dereverberation methods to enhance the quality of speech signals that have been recorded in an enclosed space. For both methods, the room acoustics are modeled using a nonnegative approximation of the convolutive transfer function (NCTF), and to additionally exploit the spectral properties of the speech signal, such as the low rank nature of the speech spectrogram, the speech spectrogram is modeled using nonnegative matrix factorization (NMF). Two methods are described to combine the NCTF and NMF models. In the first method, referred to as the integrated method, a cost function is constructed by directly integrating the speech NMF model into the NCTF model, while in the second method, referred to as the weighted method, the NCTF and NMF based cost functions are weighted and summed. Efficient update rules are derived to solve both optimization problems. In addition, an extension of the integrated method is presented, which exploits the temporal dependencies of the speech signal. Several experiments are performed on reverberant speech signals with and without background noise, where the integrated method yields a considerably higher speech quality than the baseline NCTF method and a state of the art spectral enhancement method. Moreover, the experimental results indicate that the weighted method can even lead to a better performance in terms of instrumental quality measures, but that the optimal weighting parameter depends on the room acoustics and the utilized NMF model. Modeling the temporal dependencies in the integrated method was found to be useful only for highly reverberant conditions.

READ FULL TEXT

page 8

page 9

research
09/26/2022

Least-squares methods for nonnegative matrix factorization over rational functions

Nonnegative Matrix Factorization (NMF) models are widely used to recover...
research
09/15/2017

Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization

Reducing the interference noise in a monaural noisy speech signal has be...
research
12/07/2020

Nonnegative Matrix Factorization with Toeplitz Penalty

Nonnegative Matrix Factorization (NMF) is an unsupervised learning algor...
research
08/31/2017

A State-Space Approach to Dynamic Nonnegative Matrix Factorization

Nonnegative matrix factorization (NMF) has been actively investigated an...
research
09/19/2018

Switching divergences for spectral learning in blind speech dereverberation

When recorded in an enclosed room, a sound signal will most certainly ge...
research
09/16/2017

Nonnegative HMM for Babble Noise Derived from Speech HMM: Application to Speech Enhancement

Deriving a good model for multitalker babble noise can facilitate differ...
research
10/13/2016

Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network

In this paper, we propose a dictionary update method for Nonnegative Mat...

Please sign up or login with your details

Forgot password? Click here to reset