SESQA: semi-supervised learning for speech quality assessment

10/01/2020
by   Joan Serrà, et al.
0

Automatic speech quality assessment is an important, transversal task whose progress is hampered by the scarcity of human annotations, poor generalization to unseen recording conditions, and a lack of flexibility of existing approaches. In this work, we tackle these problems with a semi-supervised learning approach, combining available annotations with programmatically generated data, and using 3 different optimization criteria together with 5 complementary auxiliary tasks. Our results show that such a semi-supervised approach can cut the error of existing methods by more than 36 providing additional benefits in terms of reusable features or auxiliary outputs. Improvement is further corroborated with an out-of-sample test showing promising generalization capabilities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2020

Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition

We employ a combination of recent developments in semi-supervised learni...
research
05/16/2020

Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation

Recently, end-to-end multi-speaker text-to-speech (TTS) systems gain suc...
research
10/05/2018

ResumeNet: A Learning-based Framework for Automatic Resume Quality Assessment

Recruitment of appropriate people for certain positions is critical for ...
research
04/05/2006

Semi-Supervised Learning -- A Statistical Physics Approach

We present a novel approach to semi-supervised learning which is based o...
research
06/24/2022

Speech Quality Assessment through MOS using Non-Matching References

Human judgments obtained through Mean Opinion Scores (MOS) are the most ...
research
10/29/2021

Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition

Recent advances in unsupervised representation learning have demonstrate...
research
02/27/2021

DeepBLE: Generalizing RSSI-based Localization Across Different Devices

Accurate smartphone localization (< 1-meter error) for indoor navigation...

Please sign up or login with your details

Forgot password? Click here to reset