Non-intrusive speech quality assessment using neural networks

03/16/2019
by   Anderson R. Avila, et al.
0

Estimating the perceived quality of an audio signal is critical for many multimedia and audio processing systems. Providers strive to offer optimal and reliable services in order to increase the user quality of experience (QoE). In this work, we present an investigation of the applicability of neural networks for non-intrusive audio quality assessment. We propose three neural network-based approaches for mean opinion score (MOS) estimation. We compare our results to three instrumental measures: the perceptual evaluation of speech quality (PESQ), the ITU-T Recommendation P.563, and the speech-to-reverberation energy ratio. Our evaluation uses a speech dataset contaminated with convolutive and additive noise, labeled using a crowd-based QoE evaluation, evaluated with Pearson correlation with MOS labels, and mean-squared-error of the estimated MOS. Our proposed approaches outperform the aforementioned instrumental measures, with a fully connected deep neural network using Mel-frequency features providing the best correlation (0.87) and the lowest mean squared error (0.15)

READ FULL TEXT
research
09/07/2020

Deep Learning-Based Single-Ended Objective Quality Measures for Time-Scale Modified Audio

Objective evaluation of audio processed with Time-Scale Modification (TS...
research
01/29/2018

On Psychoacoustically Weighted Cost Functions Towards Resource-Efficient Deep Neural Networks for Speech Denoising

We present a psychoacoustically enhanced cost function to balance networ...
research
03/01/2023

Personalized Task Load Prediction in Speech Communication

Estimating the quality of remote speech communication is a complex task ...
research
08/23/2023

Analysis of XLS-R for Speech Quality Assessment

In online conferencing applications, estimating the perceived quality of...
research
06/24/2022

Speech Quality Assessment through MOS using Non-Matching References

Human judgments obtained through Mean Opinion Scores (MOS) are the most ...
research
02/09/2022

End-to-End Blind Quality Assessment for Laparoscopic Videos using Neural Networks

Video quality assessment is a challenging problem having a critical sign...
research
07/29/2021

Fine-Grained Classroom Activity Detection from Audio with Neural Networks

Instructors are increasingly incorporating student-centered learning tec...

Please sign up or login with your details

Forgot password? Click here to reset