TS-RIR: Translated synthetic room impulse responses for speech augmentation

03/31/2021
by   Anton Ratnarajah, et al.
0

We present a method for improving the quality of synthetic room impulse responses for far-field speech recognition. We bridge the gap between the fidelity of synthetic room impulse responses (RIRs) and the real room impulse responses using our novel, TS-RIRGAN architecture. Given a synthetic RIR in the form of raw audio, we use TS-RIRGAN to translate it into a real RIR. We also perform real-world sub-band room equalization on the translated synthetic RIR. Our overall approach improves the quality of synthetic RIRs by compensating low-frequency wave effects, similar to those in real RIRs. We evaluate the performance of improved synthetic RIRs on a far-field speech dataset augmented by convolving the LibriSpeech clean speech dataset [1] with RIRs and adding background noise. We show that far-field speech augmented using our improved synthetic RIRs reduces the word error rate by up to 19.9 automatic speech recognition benchmark [2].

READ FULL TEXT
research
10/23/2019

Low-frequency compensated synthetic impulse responses for improved far-field speech recognition

We propose a method for generating low-frequency compensated synthetic i...
research
10/25/2020

IR-GAN: Room Impulse Response Generator for Speech Augmentation

We present a Generative Adversarial Network (GAN) based room impulse res...
research
12/10/2022

Synthetic Wave-Geometric Impulse Responses for Improved Speech Dereverberation

We present a novel approach to improve the performance of learning-based...
research
11/08/2022

Towards Improved Room Impulse Response Estimation for Speech Recognition

We propose to characterize and improve the performance of blind room imp...
research
07/25/2021

Adding air attenuation to simulated room impulse responses: A modal approach

Air absorption is an important effect to consider when simulating room a...
research
12/09/2017

Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models

In this paper, we describe how to efficiently implement an acoustic room...
research
11/01/2019

Long-distance Detection of Bioacoustic Events with Per-channel Energy Normalization

This paper proposes to perform unsupervised detection of bioacoustic eve...

Please sign up or login with your details

Forgot password? Click here to reset