Filtered Noise Shaping for Time Domain Room Impulse Response Estimation From Reverberant Speech

07/15/2021
by   Christian J. Steinmetz, et al.
0

Deep learning approaches have emerged that aim to transform an audio signal so that it sounds as if it was recorded in the same room as a reference recording, with applications both in audio post-production and augmented reality. In this work, we propose FiNS, a Filtered Noise Shaping network that directly estimates the time domain room impulse response (RIR) from reverberant speech. Our domain-inspired architecture features a time domain encoder and a filtered noise shaping decoder that models the RIR as a summation of decaying filtered noise signals, along with direct sound and early reflection components. Previous methods for acoustic matching utilize either large models to transform audio to match the target room or predict parameters for algorithmic reverberators. Instead, blind estimation of the RIR enables efficient and realistic transformation with a single convolution. An evaluation demonstrates our model not only synthesizes RIRs that match parameters of the target room, such as the T_60 and DRR, but also more accurately reproduces perceptual characteristics of the target room, as shown in a listening test when compared to deep learning baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2022

One-Shot Acoustic Matching Of Audio Signals – Learning to Hear Music In Any Room/ Concert Hall

The acoustic space in which a sound is created and heard plays an essent...
research
03/15/2023

Blind Estimation of Audio Processing Graph

Musicians and audio engineers sculpt and transform their sounds by conne...
research
09/03/2018

Deep Room Recognition Using Inaudible Echos

Recent years have seen the increasing need of location awareness by mobi...
research
11/17/2018

Multipath-enabled private audio with noise

We address the problem of privately communicating audio messages to mult...
research
08/16/2022

Enhancing Audio Perception of Music By AI Picked Room Acoustics

Every sound that we hear is the result of successive convolutional opera...
research
12/02/2022

Relative Acoustic Features for Distance Estimation in Smart-Homes

Any audio recording encapsulates the unique fingerprint of the associate...
research
05/05/2023

Blind identification of Ambisonic reduced room impulse response

Recently proposed Generalized Time-domain Velocity Vector (GTVV) is a ge...

Please sign up or login with your details

Forgot password? Click here to reset