Data-driven 3D Room Geometry Inference with a Linear Loudspeaker Array and a Single Microphone

08/28/2023
by   Cagdas Tuna, et al.
0

Knowing the room geometry may be very beneficial for many audio applications, including sound reproduction, acoustic scene analysis, and sound source localization. Room geometry inference (RGI) deals with the problem of reflector localization (RL) based on a set of room impulse responses (RIRs). Motivated by the increasing popularity of commercially available soundbars, this article presents a data-driven 3D RGI method using RIRs measured from a linear loudspeaker array to a single microphone. A convolutional recurrent neural network (CRNN) is trained using simulated RIRs in a supervised fashion for RL. The Radon transform, which is equivalent to delay-and-sum beamforming, is applied to multi-channel RIRs, and the resulting time-domain acoustic beamforming map is fed into the CRNN. The room geometry is inferred from the microphone position and the reflector locations estimated by the network. The results obtained using measured RIRs show that the proposed data-driven approach generalizes well to unseen RIRs and achieves an accuracy level comparable to a baseline model-driven RGI method that involves intermediate semi-supervised steps, thereby offering a unified and fully automated RGI framework.

READ FULL TEXT
research
07/02/2019

Can a Robot Hear the Shape and Dimensions of a Room?

Knowing the geometry of a space is desirable for many applications, e.g....
research
07/21/2022

Room geometry blind inference based on the localization of real sound source and first order reflections

The conventional room geometry blind inference techniques with acoustic ...
research
09/03/2018

Deep Room Recognition Using Inaudible Echos

Recent years have seen the increasing need of location awareness by mobi...
research
09/04/2023

RGI-Net: 3D Room Geometry Inference from Room Impulse Responses in the Absence of First-order Echoes

Room geometry is important prior information for implementing realistic ...
research
09/01/2021

Mean absorption estimation from room impulse responses using virtually supervised learning

In the context of building acoustics and the acoustic diagnosis of an ex...
research
07/27/2021

Microphone Array Generalization for Multichannel Narrowband Deep Speech Enhancement

This paper addresses the problem of microphone array generalization for ...
research
06/09/2020

C-SL: Contrastive Sound Localization with Inertial-Acoustic Sensors

Human brain employs perceptual information about the head and eye moveme...

Please sign up or login with your details

Forgot password? Click here to reset