Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings

11/12/2022
by   Karl El Hajal, et al.
0

Automatic speech quality assessment is essential for audio researchers, developers, speech and language pathologists, and system quality engineers. The current state-of-the-art systems are based on framewise speech features (hand-engineered or learnable) combined with time dependency modeling. This paper proposes an efficient system with results comparable to the best performing model in the ConferencingSpeech 2022 challenge. Our proposed system is characterized by a smaller number of parameters (40-60x), fewer FLOPS (100x), lower memory consumption (10-15x), and lower latency (30x). Speech quality practitioners can therefore iterate much faster, deploy the system on resource-limited hardware, and, overall, the proposed system contributes to sustainable machine learning. The paper also concludes that framewise embeddings outperform utterance-level embeddings and that multi-task training with acoustic conditions modeling does not degrade speech quality prediction while providing better interpretation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2023

Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids

Automated assessment of speech intelligibility in hearing aid (HA) devic...
research
11/04/2022

CCATMos: Convolutional Context-aware Transformer Network for Non-intrusive Speech Quality Assessment

Speech quality assessment has been a critical component in many voice co...
research
03/01/2023

Personalized Task Load Prediction in Speech Communication

Estimating the quality of remote speech communication is a complex task ...
research
08/23/2023

Analysis of XLS-R for Speech Quality Assessment

In online conferencing applications, estimating the perceived quality of...
research
01/03/2019

Quality Assessment and Improvement of Helm Charts for Kubernetes-Based Cloud Applications

Helm has recently been proposed by practitioners as technology to packag...
research
05/16/2020

Exploration of Audio Quality Assessment and Anomaly Localisation Using Attention Models

Many applications of speech technology require more and more audio data....

Please sign up or login with your details

Forgot password? Click here to reset