MOSRA: Joint Mean Opinion Score and Room Acoustics Speech Quality Assessment

04/04/2022
by   Karl El Hajal, et al.
0

The acoustic environment can degrade speech quality during communication (e.g., video call, remote presentation, outside voice recording), and its impact is often unknown. Objective metrics for speech quality have proven challenging to develop given the multi-dimensionality of factors that affect speech quality and the difficulty of collecting labeled data. Hypothesizing the impact of acoustics on speech quality, this paper presents MOSRA: a non-intrusive multi-dimensional speech quality metric that can predict room acoustics parameters (SNR, STI, T60, DRR, and C50) alongside the overall mean opinion score (MOS) for speech quality. By explicitly optimizing the model to learn these room acoustics parameters, we can extract more informative features and improve the generalization for the MOS task when the training data is limited. Furthermore, we also show that this joint training method enhances the blind estimation of room acoustics, improving the performance of current state-of-the-art models. An additional side-effect of this joint prediction is the improvement in the explainability of the predictions, which is a valuable feature for many applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/21/2023

Multi-Channel MOSRA: Mean Opinion Score and Room Acoustics Estimation Using Simulated Data and a Teacher Model

Previous methods for predicting room acoustic parameters and speech qual...
research
07/24/2020

Dereverberation using joint estimation of dry speech signal and acoustic system

The purpose of speech dereverberation is to remove quality-degrading eff...
research
11/02/2020

Perceptually Guided End-to-End Text-to-Speech

Several fast text-to-speech (TTS) models have been proposed for real-tim...
research
03/14/2021

Blind Estimation of Room Acoustic Parameters and Speech Transmission Index using MTF-based CNNs

This paper proposes a blind estimation method based on the modulation tr...
research
03/01/2023

Personalized Task Load Prediction in Speech Communication

Estimating the quality of remote speech communication is a complex task ...
research
08/23/2023

Analysis of XLS-R for Speech Quality Assessment

In online conferencing applications, estimating the perceived quality of...
research
09/29/2021

A Universal Deep Room Acoustics Estimator

Speech audio quality is subject to degradation caused by an acoustic env...

Please sign up or login with your details

Forgot password? Click here to reset