InQSS: a speech intelligibility assessment model using a multi-task learning network

11/04/2021
by   Yu-Wen Chen, et al.
0

Speech intelligibility assessment models are essential tools for researchers to evaluate and improve speech processing models. In this study, we propose InQSS, a speech intelligibility assessment model that uses both spectrogram and scattering coefficients as input features. In addition, InQSS uses a multi-task learning network in which quality scores can guide the training of the speech intelligibility assessment. The resulting model can predict not only the intelligibility scores but also the quality scores of a speech. The experimental results confirm that the scattering coefficients and quality scores are informative for intelligibility. Moreover, we released TMHINT-QI, which is a Chinese speech dataset that records the quality and intelligibility scores of clean, noisy, and enhanced speech.

READ FULL TEXT
research
08/18/2023

Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model

This study proposes a multi-task pseudo-label learning (MPL)-based non-i...
research
08/24/2023

MultiPA: a multi-task speech pronunciation assessment system for a closed and open response scenario

The design of automatic speech pronunciation assessment can be categoriz...
research
08/22/2022

Detect Hate Speech in Unseen Domains using Multi-Task Learning: A Case Study of Political Public Figures

Automatic identification of hateful and abusive content is vital in comb...
research
01/25/2023

Evaluation of the syllables pronunciation quality in speech rehabilitation through the solution of the classification problem

The solution of the problem of assessing the quality of the pronunciatio...
research
05/07/2021

SpeechNet: A Universal Modularized Model for Speech Processing Tasks

There is a wide variety of speech processing tasks ranging from extracti...
research
12/04/2022

Speech MOS multi-task learning and rater bias correction

Perceptual speech quality is an important performance metric for telecon...
research
11/09/2020

STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model

The calculation of most objective speech intelligibility assessment metr...

Please sign up or login with your details

Forgot password? Click here to reset