DDSupport: Language Learning Support System that Displays Differences and Distances from Model Speech

12/08/2022
by   Kazuki Kawamura, et al.
0

When beginners learn to speak a non-native language, it is difficult for them to judge for themselves whether they are speaking well. Therefore, computer-assisted pronunciation training systems are used to detect learner mispronunciations. These systems typically compare the user's speech with that of a specific native speaker as a model in units of rhythm, phonemes, or words and calculate the differences. However, they require extensive speech data with detailed annotations or can only compare with one specific native speaker. To overcome these problems, we propose a new language learning support system that calculates speech scores and detects mispronunciations by beginners based on a small amount of unannotated speech data without comparison to a specific person. The proposed system uses deep learning–based speech processing to display the pronunciation score of the learner's speech and the difference/distance between the learner's and a group of models' pronunciation in an intuitively visual manner. Learners can gradually improve their pronunciation by eliminating differences and shortening the distance from the model until they become sufficiently proficient. Furthermore, since the pronunciation score and difference/distance are not calculated compared to specific sentences of a particular model, users are free to study the sentences they wish to study. We also built an application to help non-native speakers learn English and confirmed that it can improve users' speech intelligibility.

READ FULL TEXT

page 1

page 3

page 5

research
11/09/2018

Native Language Identification using i-vector

The task of determining a speaker's native language based only on his sp...
research
04/20/2019

Self-imitating Feedback Generation Using GAN for Computer-Assisted Pronunciation Training

Self-imitating feedback is an effective and learner-friendly method for ...
research
08/30/2021

Speaker-Conditioned Hierarchical Modeling for Automated Speech Scoring

Automatic Speech Scoring (ASS) is the computer-assisted evaluation of a ...
research
10/05/2020

Assessing the Helpfulness of Learning Materials with Inference-Based Learner-Like Agent

Many English-as-a-second language learners have trouble using near-synon...
research
01/31/2019

Rhythm Zone Theory: Speech Rhythms are Physical after all

Speech rhythms have been dealt with in three main ways: from the introsp...
research
10/19/2022

A Data-Driven Investigation of Noise-Adaptive Utterance Generation with Linguistic Modification

In noisy environments, speech can be hard to understand for humans. Spok...
research
08/17/2023

Is Argument Structure of Learner Chinese Understandable: A Corpus-Based Analysis

This paper presents a corpus-based analysis of argument structure errors...

Please sign up or login with your details

Forgot password? Click here to reset