FAST-RIR: Fast neural diffuse room impulse response generator

10/07/2021
by   Anton Ratnarajah, et al.
0

We present a neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment. Our FAST-RIR takes rectangular room dimensions, listener and speaker positions, and reverberation time as inputs and generates specular and diffuse reflections for a given acoustic environment. Our FAST-RIR is capable of generating RIRs for a given input reverberation time with an average error of 0.02s. We evaluate our generated RIRs in automatic speech recognition (ASR) applications using Google Speech API, Microsoft Speech API, and Kaldi tools. We show that our proposed FAST-RIR with batch size 1 is 400 times faster than a state-of-the-art diffuse acoustic simulator (DAS) on a CPU and gives similar performance to DAS in ASR experiments. Our FAST-RIR is 12 times faster than an existing GPU-based RIR generator (gpuRIR). We show that our FAST-RIR outperforms gpuRIR by 2.5

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2020

IR-GAN: Room Impulse Response Generator for Speech Augmentation

We present a Generative Adversarial Network (GAN) based room impulse res...
research
12/09/2017

Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models

In this paper, we describe how to efficiently implement an acoustic room...
research
07/11/2018

A Fast-Converged Acoustic Modeling for Korean Speech Recognition: A Preliminary Study on Time Delay Neural Network

In this paper, a time delay neural network (TDNN) based acoustic model i...
research
11/08/2022

Towards Improved Room Impulse Response Estimation for Speech Recognition

We propose to characterize and improve the performance of blind room imp...
research
10/26/2018

gpuRIR: A python library for Room Impulse Response simulation with GPU acceleration

The Image Source Method (ISM) is one of the most employed techniques to ...
research
05/18/2022

MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes

We propose a mesh-based neural network (MESH2IR) to generate acoustic im...
research
04/15/2021

Towards a Fast and Accurate Model of Intercontact Times for Epidemic Routing

We present an accurate user-encounter trace generator based on analytica...

Please sign up or login with your details

Forgot password? Click here to reset