MultiPA: a multi-task speech pronunciation assessment system for a closed and open response scenario

08/24/2023
by   Yu-Wen Chen, et al.
0

The design of automatic speech pronunciation assessment can be categorized into closed and open response scenarios, each with strengths and limitations. A system with the ability to function in both scenarios can cater to diverse learning needs and provide a more precise and holistic assessment of pronunciation skills. In this study, we propose a Multi-task Pronunciation Assessment model called MultiPA. MultiPA provides an alternative to Kaldi-based systems in that it has simpler format requirements and better compatibility with other neural network models. Compared with previous open response systems, MultiPA provides a wider range of evaluations, encompassing assessments at both the sentence and word-level. Our experimental results show that MultiPA achieves comparable performance when working in closed response scenarios and maintains more robust performance when directly used for open responses.

READ FULL TEXT
research
11/04/2021

InQSS: a speech intelligibility assessment model using a multi-task learning network

Speech intelligibility assessment models are essential tools for researc...
research
08/18/2023

Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model

This study proposes a multi-task pseudo-label learning (MPL)-based non-i...
research
10/27/2022

Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning

Automatic assessment of dysarthric speech is essential for sustained tre...
research
12/24/2020

Multi-channel Multi-frame ADL-MVDR for Target Speech Separation

Many purely neural network based speech separation approaches have been ...
research
09/18/2023

Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids

Automated assessment of speech intelligibility in hearing aid (HA) devic...
research
07/18/2018

Hierarchical Multi Task Learning With CTC

In Automatic Speech Recognition, it is still challenging to learn useful...
research
05/31/2019

Content Word-based Sentence Decoding and Evaluating for Open-domain Neural Response Generation

Various encoder-decoder models have been applied to response generation ...

Please sign up or login with your details

Forgot password? Click here to reset