Multi-Channel MOSRA: Mean Opinion Score and Room Acoustics Estimation Using Simulated Data and a Teacher Model

09/21/2023
by   Jozef Coldenhoff, et al.
0

Previous methods for predicting room acoustic parameters and speech quality metrics have focused on the single-channel case, where room acoustics and Mean Opinion Score (MOS) are predicted for a single recording device. However, quality-based device selection for rooms with multiple recording devices may benefit from a multi-channel approach where the descriptive metrics are predicted for multiple devices in parallel. Following our hypothesis that a model may benefit from multi-channel training, we develop a multi-channel model for joint MOS and room acoustics prediction (MOSRA) for five channels in parallel. The lack of multi-channel audio data with ground truth labels necessitated the creation of simulated data using an acoustic simulator with room acoustic labels extracted from the generated impulse responses and labels for MOS generated in a student-teacher setup using a wav2vec2-based MOS prediction model. Our experiments show that the multi-channel model improves the prediction of the direct-to-reverberation ratio, clarity, and speech transmission index over the single-channel model with roughly 5× less computation while suffering minimal losses in the performance of the other metrics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2022

MOSRA: Joint Mean Opinion Score and Room Acoustics Speech Quality Assessment

The acoustic environment can degrade speech quality during communication...
research
03/14/2021

Blind Estimation of Room Acoustic Parameters and Speech Transmission Index using MTF-based CNNs

This paper proposes a blind estimation method based on the modulation tr...
research
12/02/2022

Relative Acoustic Features for Distance Estimation in Smart-Homes

Any audio recording encapsulates the unique fingerprint of the associate...
research
07/29/2021

Blind Room Parameter Estimation Using Multiple-Multichannel Speech Recordings

Knowing the geometrical and acoustical parameters of a room may benefit ...
research
10/07/2022

Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization

Due to the high performance of multi-channel speech processing, we can u...
research
11/03/2018

Multi-View Networks For Multi-Channel Audio Classification

In this paper we introduce the idea of multi-view networks for sound cla...
research
09/01/2021

Mean absorption estimation from room impulse responses using virtually supervised learning

In the context of building acoustics and the acoustic diagnosis of an ex...

Please sign up or login with your details

Forgot password? Click here to reset