Blind Signal Dereverberation for Machine Speech Recognition

09/30/2022
by   Samik Sadhu, et al.
0

We present a method to remove unknown convolutive noise introduced to speech by reverberations of recording environments, utilizing some amount of training speech data from the reverberant environment, and any available non-reverberant speech data. Using Fourier transform computed over long temporal windows, which ideally cover the entire room impulse response, we convert room induced convolution to additions in the log spectral domain. Next, we compute a spectral normalization vector from statistics gathered over reverberated as well as over clean speech in the log spectral domain. During operation, this normalization vectors are used to alleviate reverberations from complex speech spectra recorded under the same reverberant conditions . Such dereverberated complex speech spectra are used to compute complex FDLP-spectrograms for use in automatic speech recognition.

READ FULL TEXT
research
04/14/2022

Lombard Effect for Bilingual Speakers in Cantonese and English: importance of spectro-temporal features

For a better understanding of the mechanisms underlying speech perceptio...
research
03/25/2021

Radically Old Way of Computing Spectra: Applications in End-to-End ASR

We propose a technique to compute spectrograms using Frequency Domain Li...
research
04/13/2018

Voices Obscured in Complex Environmental Settings (VOICES) corpus

This paper introduces the Voices Obscured In Complex Environmental Setti...
research
08/29/2019

Fraudulent White Noise: Flat power spectra belie arbitrarily complex processes

Power spectral densities are a common, convenient, and powerful way to a...
research
03/26/2018

Spectral feature mapping with mimic loss for robust speech recognition

For the task of speech enhancement, local learning objectives are agnost...
research
11/01/2019

Long-distance Detection of Bioacoustic Events with Per-channel Energy Normalization

This paper proposes to perform unsupervised detection of bioacoustic eve...
research
03/24/2022

Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech

Conventional Frequency Domain Linear Prediction (FDLP) technique models ...

Please sign up or login with your details

Forgot password? Click here to reset