Switching divergences for spectral learning in blind speech dereverberation

09/19/2018
by   Francisco Ibarrola, et al.
0

When recorded in an enclosed room, a sound signal will most certainly get affected by reverberation. This not only undermines audio quality, but also poses a problem for many human-machine interaction technologies that use speech as their input. In this work, a new blind, two-stage dereverberation approach based in a generalized β-divergence as a fidelity term over a non-negative representation is proposed. The first stage consists of learning the spectral structure of the signal solely from the observed spectrogram, while the second stage is devoted to model reverberation. Both steps are taken by minimizing a cost function in which the aim is put either in constructing a dictionary or a good representation by changing the divergence involved. In addition, an approach for finding an optimal fidelity parameter for dictionary learning is proposed. An algorithm for implementing the proposed method is described and tested against state-of-the-art methods. Results show improvements for both artificial reverberation and real recordings.

READ FULL TEXT

page 4

page 5

page 11

page 12

research
03/19/2019

Non-negative representation based discriminative dictionary learning for face recognition

In this paper, we propose a non-negative representation based discrimina...
research
02/17/2022

A Two-Stage U-Net for High-Fidelity Denoising of Historical Recordings

Enhancing the sound quality of historical music recordings is a long-sta...
research
04/05/2019

Unsupervised Low Latency Speech Enhancement with RT-GCC-NMF

In this paper, we present RT-GCC-NMF: a real-time (RT), two-channel blin...
research
09/16/2017

Speech Dereverberation Using Nonnegative Convolutive Transfer Function and Spectro temporal Modeling

This paper presents two single channel speech dereverberation methods to...
research
08/19/2018

Dynamic Temporal Alignment of Speech to Lips

Many speech segments in movies are re-recorded in a studio during postpr...
research
04/23/2016

An information theoretic formulation of the Dictionary Learning and Sparse Coding Problems on Statistical Manifolds

In this work, we propose a novel information theoretic framework for dic...
research
09/30/2016

Optimal spectral transportation with application to music transcription

Many spectral unmixing methods rely on the non-negative decomposition of...

Please sign up or login with your details

Forgot password? Click here to reset